Senior Software Engineer focusing on DevOps, managing the application lifecycle and CI/CD for GMS. Collaborating with teams to ensure secure, reliable releases with a customer-centric approach.
Responsibilities
Own the container-based application lifecycle, bi-weekly releases, and CI/CD pipelines for GMS.
Organize and execute migrations as the service evolves.
Manage deployments on customer-isolated Kubernetes clusters running stateful applications, persistent storage, and infrastructure-as-code (manifests/Makefiles), closely partnering with infrastructure teams to ensure operational and performance needs are met.
Ensure high availability and performance by meeting contractual SLAs through proactive monitoring and alert response, including participation in a 24x7 on-call rotation.
Rapidly debug and resolve complex issues surfaced by customers or internal monitoring.
Own auth for GMS resources and provide full stack technical expertise to customer-facing teams to fulfill service obligations.
Manage terraform deployments for data pipelines.
Requirements
10+ years of software engineering experience with a focus on infrastructure or DevOps, specifically deploying and managing containers.
Bachelor’s Degree in Computer Science or a similar field.
Hands-on experience with Kubernetes and stateful applications, persistent storage, and node pool isolation in Kubernetes.
Track record of owning and optimizing CI/CD pipelines and managing infrastructure via Makefiles and manifests.
Practical knowledge of secrets management and security best practices within a cloud-native environment.
Experience participating in on-call rotations and participating in blameless postmortems.
Ability to monitor complex distributed systems with established SLAs and SLOs and a commitment to meeting strict performance SLOs and contractual obligations.
A high degree of independence and a sense of ownership over the full deployment lifecycle.
Experience deploying in secured customer environments.
Willingness to collaborate across multiple time zones and travel quarterly for team alignment.
Evolve systems and services intentionally and responsibly using Architectural Decision Records among stakeholders.
Experience managing data pipelines, particularly with dbt.
Benefits
Paid time off including vacation, holidays and company-wide days off
Employee Wellness Program
Home Office Reimbursement
Monthly Phone and Internet Reimbursement
Tuition Reimbursement and access to LinkedIn Learning
Graduate Site Reliability Engineer at SiXworks developing skills in automation and cloud technologies while working in a collaborative team environment. Focus on supporting scalable systems and services through best practices in DevOps.
Engineer supporting enterprise - scale Microsoft 365 environment at NIH. Implementing automated testing frameworks and secure development practices in Federal Government program.
Senior Cloud Engineer developing cloud - native applications and optimizing CI/CD pipelines at GRAYOAK. Collaborating in interdisciplinary teams on innovative cloud projects with a focus on data and AI.
Senior Manager Site Reliability Engineering at WEX ensuring system scalability and resilience while leading engineering best practices. Collaborating with cross - functional teams to enhance reliability across platforms.
SRE DevOps Engineer developing scalable solutions for Consumer Products and Retail Services at Capgemini. Focusing on Kubernetes, Terraform, and CI/CD automation with a flexible work culture.
DevOps Analyst at SONDA managing integrations of technological solutions in Brasília. Focused on infrastructure management and continuous improvement of processes.
DevOps Engineer focusing on hybrid projects in a dynamic team responsible for leading DevOps technologies. Collaborating to optimise large - scale websites and applications in the United Kingdom.
Senior DevOps Engineer optimizing CI/CD workflows and collaborating with development and security teams. Focused on building robust pipelines and implementing DevSecOps best practices.
Senior DevOps Developer at Boeing focused on designing, implementing, and maintaining AWS Cloud solutions. Collaborating with teams to streamline operations and enhance system reliability.