Cloud Operations Engineer at HYCU maintaining AWS and Google Cloud environments for data protection. Responsibilities include automation, monitoring, and troubleshooting for cloud operations compliance.
Responsibilities
Assist in managing and maintaining cloud environments (AWS and Google Cloud), ensuring uptime, performance, and scalability
Work with the Senior Cloud Operations Manager to ensure that all operational procedures and infrastructure meet industry and federal government security guidelines and requirements
Monitor cloud infrastructure, detect issues, and troubleshoot problems to minimize downtime and ensure system stability
Implement automation tools and CI/CD pipelines to streamline operations, deployments, and system updates
Support security initiatives such as patching, vulnerability scanning, and remediation in compliance with industry standards
Respond to infrastructure incidents, resolve issues quickly, and provide reports on root causes and resolution strategies
Ensure backup systems are operational, and disaster recovery plans are in place and regularly tested
Assist in maintaining comprehensive documentation of system configurations, operational procedures, and compliance-related activities
Work closely with the development and security teams to ensure seamless integration between infrastructure and applications
Requirements
3+ years of experience in cloud operations, infrastructure management, or DevOps, with a strong emphasis on AWS
Familiarity with SOC2, ISO 27001 , FedRAMP or other government cloud security frameworks is highly desirable
Strong knowledge of AWS services such as EC2, S3, RDS, Cognito, and Load Balancer; and familiarity with Google Cloud services such as GCE, CloudRun, and BigQuery is a plus
Experience with automation tools such as Terraform, CloudFormation, or Ansible, and familiarity with CI/CD pipelines
Proficiency in scripting languages (Python, Bash, etc.) for automation and system management
Understanding of cloud security principles, encryption, and compliance requirements
Experience with monitoring tools (CloudWatch, Prometheus) for tracking infrastructure performance and resolving issues
Strong troubleshooting skills with the ability to resolve technical issues quickly and efficiently
Ability to collaborate effectively with cross-functional teams, and take direction from senior leadership
Bachelor’s degree in Computer Science, Information Systems, or a related field, or equivalent work experience
Senior Software Engineer at PayPal managing cloud infrastructure and DevOps solutions. Delivering complete SDLC solutions and guiding engineering teams for scalable and reliable services.
Senior Site Reliability Engineer at Diligent leading reliability, automation, and observability across cloud infrastructure. Build tools for incident response and enhance performance in fast - paced environments.
Perception Deployment Engineer deploying deep learning models on embedded systems at Caterpillar. Collaborating with cross - functional teams for integration and optimization of perception modules in vehicles.
Principal Site Reliability Engineer at AT&T required to design scalable solutions for critical operations with minimal downtime. Collaborating with teams to monitor and improve system performance in cloud environments.
DevOps Engineer managing AI SaaS infrastructure at a high - growth European company. Supporting AI model deployment and ensuring platform security and compliance with multiple systems integration.
Engineering Manager leading teams for observability platforms at LexisNexis. Owns operational excellence across software delivery lifecycle in Raleigh, NC.
Reliability Engineer optimizing site facility infrastructure and utility systems at Roche. Conducting root cause analyses and developing maintenance plans to enhance reliability and efficiency.
DevOps SME designing, implementing, and operating multi - cloud platforms for The Missing Link. Collaborating with engineering, security, and operations teams while embedding DevOps best practices.
Site Reliability Engineer improving reliability of cloud infrastructure for an AI - specialized company. Taking ownership of monitoring and incident response processes in hybrid - working style.
DevOps Engineer leading automation for sophisticated release/deployment pipelines at Securonix. Focused on Python, Ansible, and cloud services to enhance security operations.