Senior MLOps Engineer building and operating the platform for identity-verification products at Entrust. Focused on bridging ML research and production environments with an emphasis on developer experience.
Responsibilities
Run and evolve our ML compute layer on Kubernetes/EKS (CPU/GPU) for multi-tenant workloads, and make workloads portable across regions (region-aware scheduling, cross-region data access, and artifact portability).
Operate Argo Workflows and Dask Gateway as reliable, self-serve services used by engineers and researchers to orchestrate data prep, training, evaluation, and large-scale batch compute (installation, upgrades, security, quotas, autoscaling).
Build GitOps -native delivery for ML jobs and platform components (GitLab CI, Helm, FluxCD ) with fast rollouts and safe rollbacks.
Design and maintain our data platform built on LakeFS to enable experiment reproducibility, data lineage tracking, and automated governance processes.
Own developer experience and enablement by creating clear APIs/CLIs and minimal UIs, maintaining comprehensive templates and documentation.
Requirements
You will have some MLOps experience as well as
You value developer experience and enjoy talking to users (engineers/scientists), removing friction, and treating the platform like a product.
Production experience with AWS and Kubernetes (EKS), including GPU workloads.
Proficiency in Python (e.g., FastAPI /Django) and solid CS fundamentals (performance, concurrency, data structures).
Experience building/operating data pipelines (idempotency, retries, backfills, reproducibility).
Working knowledge of Terraform, Helm, Docker, Git, and GitLab CI/CD.
Observability experience with Prometheus/Grafana and logs (e.g., Loki/ Promtail or Splunk/Sentry) with sensible alerting.
Good grasp of networking and security concepts and Linux systems administration.
Benefits
25 days annual leave plus + RTT + 1 day off for your birthday
Two paid volunteering days per year*
Meal Vouchers provided by Swile. 50% Covered and 50% is deducted from your payroll.
Health Insurance (Mutuelle) provided by ALAN
Disability & Life insurance (Prevoyance) provided by ALAN (3x Base Salary)
Commuter reimbursement up to €40 per month
Life enrichment allowance of up to €95 per month to use for services including gym, yoga, fitness classes, massages, childcare, and therapy
Dedicated learning opportunities including using tools like Linkedin Learning with availability to use for learning resources such as books, coaches, conferences, courses, podcasts, and more
Our open and transparent culture is reflected in our “Better Together” motto
Expense up to £300 (or local equivalent) to purchase workstation setup equipment
The opportunity to become a member of Entrust’s resource groups in order to learn different skills in our belonging groups
Machine Learning Engineer supporting the Enterprise Machine Learning team in developing advanced solutions. Collaborating with stakeholders and driving data science initiatives in a hybrid work environment.
Contract AI/ML Engineer focused on delivering machine learning projects for clients at AND Digital. Aiming to enhance AI - powered solutions and contribute to digital skills development.
Senior Machine Learning Engineer at TomTom building data pipelines for autonomous vehicle mapping solutions. Collaborating in a diverse team to innovate and implement machine learning technologies.
Machine Learning Manager improving automated decision - making and managing a team. Driving innovations in credit risk modeling for Monzo’s borrowing products.
Machine Learning Engineer responsible for ML model design and deployment at SiGMA Group. Enhancing event experiences through AI - driven solutions in iGaming and tech.
Machine Learning Intern at Nomagic tackling physical manipulation challenges in AI robotics with top professionals. Engage in innovative projects while shaping robotic technology.
Machine Learning Engineer developing and deploying AI systems for Personio's HR platform. Collaborating with teams to integrate ML features and products ensuring data privacy and security.
Principal Machine Learning Engineer leading AI and Machine Learning systems at Bumble for recommendations and personalization. Driving improvements in user engagement and safety across Bumble products.
Software Engineer delivering MLOps solutions for Generative AI at DataGalaxy. Focusing on reliability and collaboration with product engineering teams in a hybrid environment.
Principal Machine Learning Engineer at Qodea responsible for leading ML model lifecycle and collaborating on AI solutions in Buenos Aires delivery center.