Senior SRE managing reliability of 300+ servers powering client Odoo ERP systems. Lead incident response and guide a team in building reliable systems.
Responsibilities
Define and track SLOs/SLIs and guard error budgets
Build a complete observability stack (metrics, logs, tracing, alerting)
Lead incident response, run blameless postmortems and raise the bar on operational excellence
Set standards for deployments, rollbacks and change management
Guide and mentor a team of three engineers
Improve CI/CD pipelines, Docker environments, Harbor registry and automation processes
Drive infrastructure automation: provisioning, backups, DR, security hardening, self-healing systems
Support our BI infrastructure and contribute to our self-service client platform
Requirements
4+ years in SRE/DevOps/infrastructure engineering
Strong Linux fundamentals and container expertise (Docker)
Deep experience with observability stacks (Prometheus, Grafana, etc.)
CI/CD experience (Jenkins, GitHub Actions, or similar)
Solid scripting skills (Python, Bash)
Experience with on-call rotations, incident management and root cause analysis
Ability to work across bare-metal and cloud environments (Hetzner, AWS, or similar)
A mindset that prioritizes reliability and sustainable operations
Ready to grow into people leadership
Fluent in English
Benefits
A real leadership path: Step into your first technical leadership role with direct mentorship
Real production scale: Own the reliability of 300+ servers running client Odoo ERP instances
Strategic impact: Assume a critical role in our self-service platform
Impact from day one: you shape monitoring, alerting, incident response, and rollout standards
Flexible working: We value results over hours — structure your work around your life
Competitive salary: Total expected compensation of €50.000 to €65.000 / year, with potential for growth as you take on leadership responsibilities
Plenty of other benefits: variable compensation scheme, learning budget, excellent private health insurance, state-of-the-art equipment - we go beyond fruit baskets & free drinks
Engineer supporting enterprise - scale Microsoft 365 environment at NIH. Implementing automated testing frameworks and secure development practices in Federal Government program.
Senior Cloud Engineer developing cloud - native applications and optimizing CI/CD pipelines at GRAYOAK. Collaborating in interdisciplinary teams on innovative cloud projects with a focus on data and AI.
Senior Manager Site Reliability Engineering at WEX ensuring system scalability and resilience while leading engineering best practices. Collaborating with cross - functional teams to enhance reliability across platforms.
SRE DevOps Engineer developing scalable solutions for Consumer Products and Retail Services at Capgemini. Focusing on Kubernetes, Terraform, and CI/CD automation with a flexible work culture.
DevOps Analyst at SONDA managing integrations of technological solutions in Brasília. Focused on infrastructure management and continuous improvement of processes.
DevOps Engineer focusing on hybrid projects in a dynamic team responsible for leading DevOps technologies. Collaborating to optimise large - scale websites and applications in the United Kingdom.
Senior DevOps Engineer optimizing CI/CD workflows and collaborating with development and security teams. Focused on building robust pipelines and implementing DevSecOps best practices.
Senior DevOps Developer at Boeing focused on designing, implementing, and maintaining AWS Cloud solutions. Collaborating with teams to streamline operations and enhance system reliability.
Senior Delivery Practice Leader guiding delivery practices including Agile, Lean, and DevOps at Sandvik. Focus on enhancing delivery maturity and driving continuous improvement within teams in a flexible working environment.