Senior Site Reliability Engineer at Rootly embedding with teams to enhance service performance and reliability. Own CI/CD pipelines and drive capacity planning efforts in a fast-paced environment.
Responsibilities
Embed with product teams to enhance observability, reliability, and performance of their services.
Own our CI/CD pipelines, observability tooling, monitoring systems, and incident response processes.
Build tools and automation to eliminate manual toil, improve engineering velocity and developer experience, and improve system reliability.
Collaborate deeply across engineering to understand systems at the code level and surface cross-cutting reliability, performance, and scaling concerns.
Architect and scale our infrastructure, ensuring best-in-class performance, availability, and operational excellence.
Drive capacity planning efforts to ensure our infrastructure is resilient and scalable as we grow.
Define and manage SLOs and error budgets in partnership with Engineering teams who own production services.
Be vocal - act as a strong voice and force of reliability, quality, performance, and scalability.
Requirements
5+ years of experience in an SRE, Platform, or Infrastructure Engineering role.
5+ years of experience writing software in a production environment.
Strong technical knowledge of cloud infrastructure, distributed systems, and reliability practices.
Strong understanding of observability, performance tuning, and scaling strategies.
Deep familiarity with incident response, monitoring, and CI/CD systems.
Hands-on experience supporting web or RPC services at meaningful scale.
You write code to solve infrastructure problems; not shell scripts alone, but production-grade software.
Benefits
Competitive compensation and early equity in a fast-growing, venture-backed company.
Comprehensive medical, dental, and vision coverage.
3 weeks of vacation, plus unlimited sick and mental health days, and a company-wide end-of-year shutdown to recharge.
$500 stipend for home office setup.
A fast-moving, high-impact environment where your leadership and ideas directly shape the future of the company.
Senior DevOps Engineer at Parser focusing on deploying and maintaining cloud - based products with AWS. Collaborating across technical teams and ensuring robust solutions for business needs.
Safety and Reliability Engineer focusing on safety assessments and reliability evaluations at Collins Aerospace. Lead analyses and ensure designs meet certification standards.
Deployment Engineer responsible for client solution deployment and integration at ng - voice. Work includes planning, configuration, and operational efficiency tasks.
DevOps Engineer participating in structuring Terraform practices at EOLEN, a consulting firm in engineering and IT. Focused on Cloud, Data, Cybersécurité, software development and IT infrastructure.
DevOps Developer coordinating IT support and developing pipelines and delivery processes for Saab. Focused on collaboration, technical solutions, and communication to achieve high - quality results.
Senior Infrastructure Engineer focused on design automation and software infrastructure at Intel Foundry. Collaborating with development teams to improve reliability and velocities in engineering processes.
Site Reliability Engineer at Personio focusing on automated infrastructure and collaboration across engineering teams. Shape the future of HR technology with meaningful impact and ownership.
Site Reliability Engineering Senior Manager leading multiple SRE teams at Netwealth. Shaping strategy and operational practices in a collaborative environment.