Senior SRE Engineer managing cloud infrastructure and driving Infrastructure-as-Code adoption for Resideo. Designing resilient systems while ensuring the health of cloud platforms.
Responsibilities
Maintain public cloud infrastructure by using at least one of the Cloud technology Azure or AWS or Google Cloud (GCP).
Build and Maintain cloud infrastructure automation (IaC) by using Terraform, ARM Templates or similar.
Build and Maintain IT automation using tools like Ansible, Chef or managing complex container-based applications like Helm for Kubernetes.
Build, delivery and deployment by using modern technologies like Git, Git Action, Jenkins, Octopus, Ansible, Docker, Kubernetes or similar.
Build and maintain observability and monitoring across different IT platforms by using Grafana, Prometheus, Elastic, DataDog or similar.
Be part of a L2 team that provides 24/7 support in troubleshooting IT platforms issues, when required (less than 20% of the working time).
Oversee all planned outages, assess RCA and assist with major upgrades to ensure minimum downtime.
Requirements
Minimum 3 years of working experience with at least one of the public cloud platforms. (Azure preferred but not required).
Minimum of 5 years Windows / Linux experience.
Minimum of 2 years Terraform or other IaC platforms experience.
Strong knowledge of Elastic, Grafana, Prometheus or other observability platforms (Datadog, Dynatrace, etc.).
Proven experience with running and/or managing large IT platform services with multiple availability regions.
Experience with container orchestration platform Docker or Kubernetes, or similar.
Strong English communication (written and oral) skills are required.
Benefits
Employment in a strong, well known international company and part of a global team.
Unlimited access to online training.
Flexible hybrid working arrangement to support work-life balance.
Meal ticket for each day worked.
Medical coverage to support your health and wellbeing.
Associate DevOps Engineer enhancing application operations for secure digitization solutions at Bundesdruckerei GmbH. Collaborating on CI/CD processes in an agile team setting.
Support AI and DevOps platforms at Citi Finance, ensuring operational stability and effective incident resolution, while collaborating with engineering teams.
SRE / Observability Engineer at leading financial services organization focusing on observability and reliability. Building scalable digital platforms and ensuring system stability and user experience.
AWS DevOps Engineer responsible for building AI platform infrastructure focusing on automation and scalability at Brillio. Join a leading digital technology service provider in the US.
Site Reliability Engineer working on Cloud SaaS - Environment, prioritizing IT Security. Collaborate with development teams in a hybrid model from Aachen or Paris.
Lead DevOps Engineer at ONIQ managing multi - cloud infrastructure and data strategy. Collaborating directly with senior development team to shape architectural decisions and operational excellence.
Engineering Manager overseeing a specialized SRE & Infrastructure team in Cologne or remote from Germany. Ensuring stable operations and collaboration with other engineering leaders.
Cloud & DevOps Engineer at HiQ driving cloud service portfolio and implementing modern cloud solutions. Leading development and ensuring quality in customer projects across various cloud environments.
Own cloud systems and CI/CD processes ensuring scalability and performance. Collaborate with teams to streamline deployments and maintain service reliability at journaway.