Site Reliability Engineer ensuring reliability and performance of critical systems and services. Collaborating with teams to improve system resilience and automate operations.
Responsibilities
Design and implement monitoring and alerting systems for critical services
Automate operational tasks to improve efficiency and reduce manual effort
Collaborate with development teams to enhance system reliability and performance
Manage incident response and post-incident reviews
Analyse system metrics to identify trends and areas for improvement
Contribute to capacity planning and scalability strategies
Requirements
Experience in site reliability engineering or DevOps roles
Strong scripting and automation skills (e.g., Python, Bash)
Knowledge of monitoring tools and observability practices
Understanding of cloud infrastructure and containerisation
Excellent problem-solving and analytical abilities
Commitment to continuous improvement and operational excellence
Benefits
26 days annual leave plus bank holidays (increasing with length of service)
Flexible working options
Private health care
Competitive pension scheme – Anglian Water double-matches your contributions up to 6%
Life assurance at eight times your salary
Annual bonus
Personal medical assessments
Virtual GP service
Cancer screening
Financial wellbeing support and salary finance benefits
Lifestyle Savings including discounts on retail, travel, and utilities
Senior Database Reliability Engineer enhancing MongoDB and PostgreSQL deployments at SS&C, a leader in financial services technology. Collaborating with teams to ensure operational reliability and mentor junior engineers.
DevOps Engineer at Smile enhancing performance and security for digital transformation projects. Collaborating on end - to - end solutions and driving operational efficiency in a digital environment.
DevOps Engineer managing automation lifecycle and technical infrastructure support for gaming company. Collaborating with IT Operations and business units to streamline issue resolution and enhance service quality.
DevSecOps Engineer responsible for CI/CD pipeline design, infrastructure automation, and ensuring operational reliability in a fast - growing AI startup.
DevOps Engineer defining DevOps strategies and collaborating with teams at Pacific Programming and Tech. Building infrastructure and processes for software solutions in a hybrid environment.
Senior DevOps Engineer managing Azure cloud infrastructure for AI solutions in healthcare. Architecting and maintaining multi - tenant Azure environments while ensuring compliance and security.
Senior DevOps Engineer at Leidos contributing to mission - critical programs for national security. Focusing on platform architecture, automation, and cloud infrastructure solutions.
DevSecOps Engineer modernizing multi - cloud environments for Leidos. Collaborating across AWS, Azure, Google, and Oracle clouds to support mission - critical systems.
Associate DevOps Engineer enhancing application operations for secure digitization solutions at Bundesdruckerei GmbH. Collaborating on CI/CD processes in an agile team setting.
Support AI and DevOps platforms at Citi Finance, ensuring operational stability and effective incident resolution, while collaborating with engineering teams.