Platform Engineer responsible for developing and managing Kubernetes environments for AI solutions in healthcare. Collaborating with teams to enhance core infrastructure and streamline deployments.
Responsibilities
Design, deployment, and management of scalable and secure Kubernetes clusters on OVHcloud.
Ownership and advancement of our CI/CD pipelines for automated, reliable application and infrastructure deployments.
Implementation and management of our GitOps workflows using tools like ArgoCD or Flux.
Management and scaling of GPU workloads in Kubernetes, ensuring optimal performance and resource utilization for our ML teams.
Development and maintenance of our observability stack (VictoriaMetrics, VictoriaLogs, Grafana, Tracing) to ensure deep visibility into system health.
Management of our cloud infrastructure on OVHcloud, focusing on automation (Infrastructure as Code), cost optimization, and security.
Lifecycle management of core platform services, including message brokers (RabbitMQ), databases (PostgreSQL, Redis), and authentication systems (Okta, OIDC, OAuth2).
Acting as a key responder for infrastructure incidents; debugging and troubleshooting complex production issues across distributed systems.
Supporting and empowering development teams by providing robust self-service tools, clear documentation, and collaborative support.
Requirements
3-5+ years of professional experience in a Platform Engineering, DevOps, or SRE role
Deep, hands-on experience with Kubernetes in a production environment (cluster management, networking, security, scheduling)
Proven experience managing infrastructure on a cloud provider (OVHcloud is a strong plus; AWS, GCP, or Azure experience is also valued)
Strong practical knowledge of CI/CD systems (e.g. GitHub Actions) and GitOps principles (ArgoCD, Flux)
Proficiency with Infrastructure as Code (IaC) tools like Terraform or Pulumi
Solid understanding of observability principles and tools (e.g. VictoriaMetrics, VictoriaLogs, OpenTelemetry/Tracing, Grafana)
Experience managing stateful services in production (e.g. PostgreSQL, Redis, RabbitMQ)
Solid scripting skills in Python
Benefits
Full ownership of a mission-critical platform
A team that values curiosity, learning, and experimentation
Remote-first setup with the option to work in our Berlin office
Lead Platform Engineer responsible for AWS infrastructure to support business applications. Collaborating with teams to ensure optimal performance and security of cloud resources.
Senior Brand Management Platform Engineer at Baranek & Renger GmbH focusing on technical development of Brand Management systems. Collaborating with clients and providers to enhance brand management capabilities.
Brand Management Platform Engineer optimizing brand management systems like Frontify and Papirfly for Baranek & Renger GmbH. Working in a hybrid role that enhances digital branding processes.
Senior Machine Learning Platform Engineer at Strava developing AI/ML systems for fitness applications. Driving innovative solutions using machine learning models and large datasets to enhance user experience.
Databricks Platform Engineer managing AWS integration and data pipelines for advanced analytics. Driving innovation and optimization in Databricks ecosystem with a focus on machine learning capabilities.
Senior Dynamics 365 and Power Platform Developer at Mitacs leading digital transformation initiatives. Collaborating to build robust solutions and ensure the reliability of business systems.
Senior Developer implementing Microsoft Dynamics 365 and Power Platform solutions at Mitacs. Contributing to digital transformation initiatives and ensuring reliable business systems.
Senior Dynamics 365 and Power Platform Developer at Mitacs working on major enterprise and digital transformation. Ensuring reliability and security of core business systems through innovative solutions.
Senior Developer contributing to enterprise transformation and modernizing business applications at Mitacs. Engaging with business stakeholders and technical teams to deliver robust solutions in Microsoft Dynamics 365 and Power Platform.
Senior iOS Engineer developing user - facing applications at Qlose with global impact. Collaborating in a fast - paced developing environment with strong technical ownership expectations.