Site Reliability Engineer managing stable, resilient applications with a focus on customer journeys. Collaborating with teams to ensure reliable service delivery and implementation of observability solutions.
Responsibilities
manage the provision of stable, resilient, reliable applications
identify and automate manual tasks
implement observability solutions
collaborate with feature teams to understand application changes
participate in delivery activities
address production issues
contribute to site reliability operations including production support, incident response, on-call rota, toil reduction, application performance
lead improvement to release quality into production
Requirements
experience of supporting live production services serving customer journeys
demonstrable knowledge of ITIL processes
IT Security principles
hands on experience with Azure Cloud
full-stack observability using tools such as Log Analytics, Application Insights, and Grafana
expertise in AWS infrastructure
containerisation platforms like Docker, Kubernetes, OpenShift
strong hands-on experience in cloud security governance
WAF implementation
designing, implementing, securing, and optimising cloud-native environments
strong understanding of the Identity and Access Management domain
Senior DevOps Engineer at Parser focusing on deploying and maintaining cloud - based products with AWS. Collaborating across technical teams and ensuring robust solutions for business needs.
Safety and Reliability Engineer focusing on safety assessments and reliability evaluations at Collins Aerospace. Lead analyses and ensure designs meet certification standards.
Deployment Engineer responsible for client solution deployment and integration at ng - voice. Work includes planning, configuration, and operational efficiency tasks.
DevOps Engineer participating in structuring Terraform practices at EOLEN, a consulting firm in engineering and IT. Focused on Cloud, Data, Cybersécurité, software development and IT infrastructure.
DevOps Developer coordinating IT support and developing pipelines and delivery processes for Saab. Focused on collaboration, technical solutions, and communication to achieve high - quality results.
Senior Infrastructure Engineer focused on design automation and software infrastructure at Intel Foundry. Collaborating with development teams to improve reliability and velocities in engineering processes.
Site Reliability Engineer at Personio focusing on automated infrastructure and collaboration across engineering teams. Shape the future of HR technology with meaningful impact and ownership.
Site Reliability Engineering Senior Manager leading multiple SRE teams at Netwealth. Shaping strategy and operational practices in a collaborative environment.