DevOps SRE focusing on reliability and performance of critical systems in a hybrid setup. Collaborating with development and infrastructure teams to enhance application observability and efficiency.
Responsibilities
Ensure the reliability, availability and performance of applications in production;
Define, monitor and evolve SLAs, SLOs and SLIs;
Implement and maintain observability practices (metrics, logs, tracing and alerts);
Develop automations to reduce toil and increase operational efficiency;
Lead incident management, perform root cause analysis and produce post‑mortems;
Collaborate with development, DevOps and infrastructure teams;
Contribute to security, resilience and compliance improvements;
Support FinOps initiatives to optimize cloud costs;
Promote SRE and DevOps best practices across squads.
Requirements
Experience with on‑premises and cloud environments (preferably AWS);
Strong knowledge of observability (Prometheus, Grafana, Dynatrace, Datadog, OpenTelemetry);
Experience with automation and scripting (Python, Go, Bash and/or PowerShell);
Knowledge of Linux and Windows;
Experience with Docker and Kubernetes;
Experience with SRE practices (error budgets, toil reduction, post‑mortems);
Experience with monitoring, alerting and dashboards;
Knowledge of networking, security and advanced troubleshooting;
Bachelor’s degree in Computer Science, Engineering or related fields;
Desirable: AWS, Observability or Kubernetes certifications;
Experience with CI/CD (GitLab, GitHub Actions, Jenkins);
Experience with IaC (Terraform, CloudFormation);
Knowledge of distributed architectures and microservices;
Experience with FinOps;
Familiarity with advanced SRE practices (Chaos Engineering, fault injection).
Benefits
Multi‑benefit card — choose how and where to use it.
Study grants for undergraduate, graduate, MBA and language courses.
Certification incentive programs.
Flexible working hours.
Competitive salaries.
Annual performance reviews with a structured career plan.
Systems Administrator managing Azure DevOps Server for Saab. Collaborating with development teams to optimize CI/CD pipelines and resolve technical issues.
Intern assisting engineering team with hands - on experience in DevOps and software testing. Works closely with engineers on various assignments in Germantown, Maryland.
Platform Engineer handling Azure cloud platform responsibilities for Hiscox, a global specialist insurer. Involving design, deployment, maintenance, and optimization of cloud infrastructure.
Network DevOps Engineer designing, implementing, and supporting networking services for data centre networks. Collaborating on network infrastructures ensuring reliability, scalability, and security.
Software Consultant focusing on DevOps at ParentPay Group, Europe’s education tech leader. Engage in production code, automation, and cloud management processes.
Python Dev Ops Engineer supporting a platform used by over 1,500 users enabling data - driven innovation at Rabobank. Designing analytics platforms and building Python - based automation in a collaborative environment.
DevOps Engineer managing the cloud infrastructure for gaming solutions. Responsible for deployment standardization, developer support, and system observability across environments.
DevOps Engineer ensuring reliable operation of cloud platforms while supporting development teams at azeti. Part of a growing DevOps team with responsibilities for CI/CD, automation, and infrastructure management.
DevOps Engineer managing AWS infrastructure and CI/CD processes for Sensorfact's smart monitoring platform. Collaborating with development teams to optimize energy efficiency in a modern cloud architecture.
DevOps Integrator responsible for deploying software applications and managing infrastructure at RATP. Engaging in CI/CD processes and collaborating with internal teams for digital solutions.