Reliability Engineer responsible for availability and performance of U.S. Air Force Cloud services. Collaborates with teams to deliver reliable mission-critical systems in a hybrid environment.
Responsibilities
This role supports the U.S. Air Force Cloud One Architecture and Common Shared Services contract and currently has an opening for a **Reliability Engineer**.
The Reliability Engineer is responsible for ensuring the availability, performance, scalability, and resiliency of mission‑critical systems.
This role applies software engineering principles to infrastructure and operations, with a strong emphasis on automation, monitoring, incident response, and continuous reliability improvement.
The reliability engineer serves as the bridge between development, operations, and platform teams to ensure production systems consistently meet defined service level objectives (SLOs) while supporting rapid, safe delivery of new capabilities.
Requirements
Bachelors and eight (8) years or more of experience; Masters and six (6) years or more of experience. Additional experience may be accepted in lieu of degree.
Active Secret clearance at a minimum required to start
US citizenship required
Experience with cloud platforms (AWS, Azure, OCI, or GCP), including managed services
Experience with containerized environments (Docker, Kubernetes)
Familiarity with CI/CD pipelines and deployment automation
SLOs and error budgets
Capacity modeling and performance testing
Strong understanding of:
Distributed systems and high‑availability architectures
Safety and Reliability Engineer focusing on safety assessments and reliability evaluations at Collins Aerospace. Lead analyses and ensure designs meet certification standards.
Deployment Engineer responsible for client solution deployment and integration at ng - voice. Work includes planning, configuration, and operational efficiency tasks.
DevOps Engineer participating in structuring Terraform practices at EOLEN, a consulting firm in engineering and IT. Focused on Cloud, Data, Cybersécurité, software development and IT infrastructure.
DevOps Developer coordinating IT support and developing pipelines and delivery processes for Saab. Focused on collaboration, technical solutions, and communication to achieve high - quality results.
Senior Infrastructure Engineer focused on design automation and software infrastructure at Intel Foundry. Collaborating with development teams to improve reliability and velocities in engineering processes.
Site Reliability Engineer at Personio focusing on automated infrastructure and collaboration across engineering teams. Shape the future of HR technology with meaningful impact and ownership.
Site Reliability Engineering Senior Manager leading multiple SRE teams at Netwealth. Shaping strategy and operational practices in a collaborative environment.
DevOps Engineer automating software development lifecycle in multi - cloud Kubernetes environments. Building and maintaining DevSecOps pipeline using Infrastructure as Code and modern tools.