MLOps Engineer maintaining and optimizing machine learning models for healthcare applications. Collaborating with engineers to deliver reliable and effective AI solutions.
Responsibilities
Own and manage the full lifecycle of both ML models and core infrastructure – from development and deployment to monitoring and continuous improvement
Build and maintain robust CI/CD pipelines for both software and ML workflows
Ensure reliability, scalability, observability, and security of production systems and ML infrastructure
Automate deployment, orchestration, and environment management using modern DevOps tooling
Collaborate closely with software engineers, ML engineers, and product teams to bring ML-powered features to production
Proactively detect, troubleshoot, and resolve infrastructure and model performance issues
Stay up to date with industry best practices in DevOps, MLOps, and infrastructure engineering
Document infrastructure, workflows, and operational procedures clearly and thoroughly
Requirements
Experience deploying machine learning models into production and managing their lifecycle
Experience implementing model governance, including versioning, monitoring, drift detection, and reporting
Familiarity with MLOps tools such as MLflow, Kubeflow, or DVC
Solid understanding of CI/CD systems (e.g., GitHub Actions, ArgoCD) and infrastructure-as-code tools (e.g., Terraform, Helm)
Familiarity with data engineering concepts such as ETL pipelines, data lakes, and large-scale batch/stream processing
Experience mentoring or supporting colleagues to help them grow their technical skills
Proven experience in a senior-level DevOps, MLOps, or related infrastructure-focused engineering role
Strong proficiency in Python
Deep experience with cloud platforms (AWS, GCP, or Azure) and container orchestration tools (Docker, Kubernetes)
Ability to design scalable, secure, and observable systems in fast-moving environments
Strong debugging and problem-solving skills across distributed systems
Excellent collaboration and communication skills, with experience working in cross-functional teams
Understanding of security and compliance best practices for both software and ML systems
Benefits
Hybrid working environment in Copenhagen, London and New York
Senior Staff Machine Learning Engineer leading technical architecture for GEICO's AI Agent Platform. Driving innovation and enhancing productivity for internal associates and customers.
Staff Machine Learning Engineer developing the next generation of AI Agent OS and SDKs for GEICO. Key responsibilities include architecting scalable systems and implementing observability frameworks.
Senior Machine Learning Engineer at Bumble developing scalable AI systems for personalized user interactions. Leading machine learning model development and deployment from exploration to production.
Lead Machine Learning Engineer at Bumble shaping user connections through machine learning. Driving end - to - end AI solutions while mentoring engineers in a hybrid work environment.
Designing and operating cloud - based MLOps capabilities supporting analytical and generative AI models. Collaborating with data science and business teams for high - impact AI solutions.
Machine Learning Engineer analyzing data structures and developing ML models for customer profiling in Azerbaijan. Collaborating on probabilistic modeling and data quality improvement.
Machine Learning Engineer at HackerRank working on integrity systems to improve model quality. Collaborating on strategies for new signals like audio analysis and behavioral anomalies.
Machine Learning Engineer developing integrity systems for assessing model quality at HackerRank. Collaborating on multimodal signal processing and improving model performance.
Architect designing enterprise - grade AI/ML architectures for Quantiphi. Leading AI applications and ML strategy with a focus on scalability, security, and integration.
Software Engineer for ML Infrastructure at Slack, architecting systems to support large scale AI deployment and reliability. Engage in deep systems engineering focusing on ML lifecycle and infrastructure scalability.