Senior Production Engineer (SRE) at Legion building and operating a secure AWS/Kubernetes platform. Focused on automation, reliability, and infrastructure as code.
Responsibilities
Support and operate Legion’s AWS-based cloud platform and Kubernetes (EKS) environments.
Leverage GenAI tools (e.g., Claude Code, Codex, or similar) to accelerate infrastructure development, automation, and auto-remediation of common production issues.
Build and maintain infrastructure-as-code using Terraform.
Develop automation and internal tooling using Go or Python.
Improve CI/CD pipelines to increase deployment safety and velocity.
Define and improve monitoring, alerting, and observability systems.
Respond to production incidents, conduct root cause analysis, and implement systemic improvements.
Develop and automate operational runbooks and remediation workflows.
Support production deployments, including during off-hours as needed.
Requirements
5-8+ years of experience in SRE, DevOps, or SaaS production operations.
3+ years of hands-on experience operating production workloads in AWS.
5+ years of experience with observability tools such as Datadog, CloudWatch, ELK stack, Prometheus, or similar.
3+ years experience with Terraform and infrastructure-as-code practices, including managing complex multi-region deployments with module based configurations.
3+ years of experience with containerized environments using Docker and Kubernetes (EKS preferred); familiarity with Helm.
Proficiency in Go or Python (or similar programming language).
Experience building and maintaining CI/CD systems (Git-based workflows, Argo, Jenkins or similar).
Strong Linux/Unix systems experience.
Bachelor’s degree in Computer Science or equivalent practical experience.
Production Support Engineer ensuring system stability and reliability for Manulife's critical services. Collaborative role bridging development and infrastructure, providing seamless service for customers.
Production Engineer managing database operations at Palantir, ensuring reliability and availability of data systems. Involved in architecture, design, and maintenance of production databases in various environments.
Production Engineer PCB managing first - line technical support for PCB assembly processes. Assisting with product introduction and implementing process improvements in a leading transport solutions company.
Senior Production Support / DevOps Engineer at Keyrus focusing on application reliability and cloud operations. Support enterprise Java - based platforms in collaboration with development teams.
Lead Production Engineer managing production optimization initiatives across the enterprise for oil and gas. Act as the key authority in autonomous and semi‑autonomous production engineering standards.
Production Engineer in open pit mining at St Ives Gold Mine. Responsible for drill and blast designs aligning with production plans and continuous improvement.
Production Engineer ensuring compliance with manufacturing procedures and standards at Galderma. Optimizing production processes and supporting autonomous work cells for operational improvements.
Production Support Engineer ensuring reliability of Ruby on Rails platform at HHAeXchange. Supporting operational health and handling incident response for production systems.
Ingénieur systèmes rejoignant une équipe pour l'exploitation et la mise en place de solutions numériques. Environnement stimulant avec une culture technique forte.
Senior Production Systems Engineer at DRS RADA Technologies developing radar solutions for defense applications. Acting as a key interface between systems engineering and manufacturing engineering organizations.