Technical Lead for DevOps team at Atria Health ensuring reliable, scalable systems and mentoring engineers. Leading cloud infrastructure initiatives using Google Cloud Platform and Terraform.
Responsibilities
Define and drive the technical vision for DevOps practices across the organization
Lead architecture decisions for infrastructure, CI/CD pipelines, and cloud resources
Serve as a technical escalation point for complex infrastructure challenges
Conduct design reviews and provide guidance on reliability, security, and scalability
Design, build, and maintain cloud infrastructure on Google Cloud Platform using Terraform
Own and improve CI/CD pipelines to enable fast, safe deployments
Implement and maintain monitoring, alerting, and observability systems
Drive incident response processes and lead post-mortems to improve system resilience
Partner with product engineering teams to understand their infrastructure needs and translate them into scalable solutions
Work closely with Security to implement and maintain compliance and security best practices
Collaborate with Product and Engineering leadership on capacity planning and technical roadmaps
Mentor and coach DevOps engineers, fostering growth and technical development
Establish and document DevOps standards, runbooks, and best practices
Champion a culture of reliability, automation, and continuous improvement
Requirements
7+ years of software engineering experience, with 3+ years focused on DevOps, SRE, or infrastructure engineering
Deep experience with cloud platforms (GCP strongly preferred; AWS or Azure acceptable)
Proficiency with infrastructure-as-code tools, particularly Terraform
Strong experience with container orchestration (Kubernetes) and CI/CD systems
Demonstrated ability to lead technical initiatives and influence without direct authority
Excellent communication skills and ability to translate complex technical concepts for varied audiences
Experience in healthcare technology or other regulated industries (Preferred)
Familiarity with our backend stack (Node, TypeScript, Express) (Preferred)
Experience building and scaling observability platforms (Preferred)
Track record of improving developer experience and deployment velocity (Preferred)
Benefits
Excellent health and wellness benefits, 100% paid by Atria effective date of hire
OneMedical membership for employees & dependents giving access to 24/7 virtual care
Fertility & family planning
Company-covered preventive health screenings through partner hospitals (Calcium score)
Fitness Perks including Wellhub +
401k contributions and 4% match starting after 6 months
Flexible Time Off
Continuing medical education (CME) and CEU support for professional licensure
Time to give back and make an impact in underserved communities
Senior DevOps Engineer working on deployment and operations of FedRAMP authorized products. Improve cloud infrastructure and collaborate with federal customers in a regulated environment.
DevOps Team Lead at Insightful managing DevOps engineers for optimizing cloud infrastructure and CI/CD processes. Focused on team mentoring and operational excellence in a collaborative environment.
Site Reliability Engineer ensuring the reliability and performance of Freewheel systems. Collaborating across teams to optimize infrastructure and automate operations.
DevOps Professional specializing in Salesforce release management at YASH Technologies. Involves CI/CD pipeline management, version control, and collaboration with development teams.
Instrument/Control SIS Reliability Engineer providing technical support for BASF's global engineering team. Delivering complex engineering solutions and ensuring adherence to technical standards and safety regulations across multiple projects.
Site Reliability Engineer working on Linux systems for observability platforms and logging. Design and maintain applications, support network visibility, and collaborate with teams.
DevOps Engineer working at White Circle, focusing on infrastructure for AI systems. Involves managing production environments, Kubernetes, CI/CD pipelines, and automation tools.
Airflow Reliability Engineer on the Customer Reliability Engineering team at Astronomer. Working with clients on optimizing their use of the managed Airflow service in a hybrid role in Hyderabad.
Full - Stack Engineer enhancing engineering productivity at Fidelity. Building internal tools for SRE teams to improve operational efficiency and reliability.