MLOps Engineer leading large-scale model deployments and managing CI/CD pipelines in GCP ecosystem. Focus on operational excellence and implementing observability frameworks for AI systems.
Responsibilities
Architect and manage the end-to-end deployment of machine learning models across production environments, ensuring scalability and high availability.
Design, build, and maintain automated CI/CD pipelines using Tekton to streamline model development, testing, and release cycles.
Implement and manage comprehensive observability and traceability frameworks to monitor model health, data drift, and system performance in real-time.
Configure advanced monitoring solutions using Dynatrace and centralized logging systems to track latency, resource utilization, and system errors.
Develop and maintain MLOps infrastructure exclusively within the GCP ecosystem, utilizing Vertex AI, Google Kubernetes Engine (GKE), and BigQuery.
Automate model retraining, validation, and deployment workflows to ensure models remain accurate and performant in production.
Partner with data scientists and software engineers to transition models from research/prototypes to robust, enterprise-grade production assets.
Requirements
3+ years of professional experience in MLOps, DevOps, or Software Engineering, with a specific focus on the industrialization of machine learning models.
Bachelor’s or Master’s degree in a quantitative field (e.g., Computer Science, Engineering, Statistics, or Mathematics).
Proven track record of building and maintaining complex, automated pipelines using Tekton or similar orchestration tools.
Demonstrated experience implementing enterprise-grade monitoring, logging, and distributed tracing in a professional environment.
Deep understanding of the GCP stack, particularly services related to model hosting, orchestration, and data management.
Senior Software Engineer designing and operating ML infrastructure for Plaid's AI initiatives. Collaborating with product teams to accelerate AI - powered financial experiences and ensure scalable ML systems.
Senior ML Engineer serving as an individual contributor in generative AI at GEICO. Collaborating with teams to design, develop, and deploy AI systems that drive business value.
Senior Staff Machine Learning Engineer at GEICO, enhancing service productivity through AI technologies. Collaborating with dynamic teams to develop and deploy scalable AI workflows across Geico.
Staff AI Engineer at GEICO designing and deploying AI platforms for virtual agent workflows. Collaborating with teams to improve service for millions of customers.
Machine Learning Engineer at Tilt, developing personalisation solutions across various app surfaces. Collaborate with teams to enhance recommendation systems on a video - first shopping platform.
Senior Machine Learning Engineer architecting next - generation AI platforms for healthcare and fintech with Nitra's diverse team. Focused on data pipelines, ML infrastructure, and production - ready AI systems.
Senior Machine Learning Engineer architecting and building Nitra's data and AI platform. Driving intelligent products across healthcare and fintech industries with applied AI and platform engineering.
Machine Learning Engineer developing and implementing ML models for lending at Blue Whale Lending LLC. Collaborating with teams to enhance data insights and validate model performance.
Applied ML Engineer contributing to machine learning and perception tasks for edge - intelligent maritime systems. Collaborating with cross - functional teams to deliver real - world AI solutions.
AI/ML Engineer building data science and AI solutions for Pharma and MedTech clients on Azure. Collaborating with teams to deliver end - to - end machine learning projects.