Cloud Operations Engineer responsible for maintaining cloud infrastructure for Old Mutual. Collaborating with teams for optimal system performance and compliance in cloud operations.
Responsibilities
Manage and maintain cloud infrastructure to ensure high availability, performance, and security.
Monitor cloud systems and services to ensure optimal performance.
Respond to and resolve cloud-related incidents and service request per SLA.
Implement and manage cloud automation tools to streamline operations and improve efficiency.
Implement and manage monitoring systems and monitoring reporting.
Monitor, report and manage cloud platform and shared services capacity.
Collaborate with development and engineering teams to support cloud-based applications and services.
Assist in the deployment, configuration, and management of cloud resources.
Perform regular system maintenance, updates, backups and patching to keep cloud environments secure and up-to-date.
Develop and maintain documentation for cloud operations processes and procedures.
Optimize cloud resource usage and costs, providing recommendations for improvements.
Ensure compliance with security policies and industry standards.
Participate in on-call rotations to provide 24/7 support for critical cloud operations.
Requirements
A numerate Bachelor's Degree (e. g. Computer Science, Mathematics, Engineering) with minimum or equivalent technical qualification
5 years of professional experience
Relevant cloud certification
Cloud platforms such as AWS, Azure or Google
DevOps Cloud Operations practices
Problem solving and trouble shooting skills
Infrastructure as code (Eg, Terraform, CloudFormation)
Cloud security
Cloud Networking
Agile/SAFe
Coding and scripting (Eg. Python, Bash, Powershell)
Scripting and Automation (Eg. Python, Bash, Powershell)
Cloud Monitoring
Cloud Compliance Skills
Adaptive Thinking, Change Management, Cloud Computing, Cloud Infrastructure Management, Computer Network Security, Cost Account Management, Cost Budgeting, Data Analysis, Data Collection Methods, Enterprise Application Integration (EAI), IT Network Security, Performance Improvements, Project Integration Management, Project Life Cycle Management, System Architecture Analysis, System Requirements Analysis, Virtualization, Virtual Private Networks (VPNs)
Full - Stack Engineer enhancing engineering productivity at Fidelity. Building internal tools for SRE teams to improve operational efficiency and reliability.
DevOps Engineer at Cloudogu working with development and operations for reliable software delivery. Focusing on CI/CD, infrastructure automation, and platform services in an agile environment.
Jr. DevOps Engineer supporting and improving CI/CD pipelines and Linux systems at Swift. Collaborating with senior engineers in a hands - on learning environment.
Senior DevOps Engineer I managing automation tooling and multi - cloud infrastructure at Spring Health. Collaborating with AI and Infrastructure teams in a hybrid Seattle office.
Site Reliability Engineer for cloudified backup platform using Commvault technology at Expleo. Joining a dynamic team to ensure backup infrastructure scalability and reliability.
Site Reliability Engineer responsible for designing and maintaining scalable services with high availability. Collaborating with development teams to enhance reliability and operational excellence.
Technical Staff leading the architecture, reliability, and modernization of enterprise ALM and DevOps tools. Driving strategy and influencing product development in collaboration with various teams.
Site Reliability Engineer responsible for reliability and availability, collaborating with development teams on scalable systems. Applying software engineering practices to improve production operations.
DevOps Engineer in the Security Data and AI Lab at Lloyds Banking Group driving data and cloud infrastructure's influence on product operations and customer service improvements.