Principal ML Ops Engineer at Pragmatike | Hybrid Hired

About the role

Lead ML Ops Engineer for a fast-growing AI startup focused on scalable infrastructure. Drive hands-on execution across the entire model lifecycle in a collaborative environment.

Responsibilities

Architect, build, and scale the end-to-end ML Ops pipeline, including training, fine-tuning, evaluation, rollout, and monitoring.
Design reliable infrastructure for model deployment, versioning, reproducibility, and orchestration across cloud and on-prem GPU clusters.
Optimize compute usage across distributed systems (Kubernetes, autoscaling, caching, GPU allocation, checkpointing workflows).
Lead the implementation of observability for ML systems (monitor drift, performance, throughput, reliability, cost).
Build automated workflows for dataset curation, labeling, feature pipelines, evaluation, and CI/CD for ML models.
Collaborate with researchers to productionize models and accelerate training/inference pipelines.
Establish ML Ops best practices, internal standards, and cross-team tooling.
Mentor engineers and influence architectural direction across the entire AI platform.

Requirements

Deep hands-on experience designing and operating production ML systems at scale (Staff/Principal-level expected).
Strong background in ML Ops, distributed systems, and cloud infrastructure (AWS, GCP, or Azure).
Proficiency with Python and familiarity with TypeScript or Go for platform integration.
Expertise in ML frameworks: PyTorch, Transformers, vLLM, Llama-factory, Megatron-LM, CUDA / GPU acceleration (practical understanding)
Strong experience with containerization and orchestration (Docker, Kubernetes, Helm, autoscaling).
Deep understanding of ML lifecycle workflows: training, fine-tuning, evaluation, inference, model registries.
Ability to lead technical strategy, collaborate cross-functionally, and operate in fast-paced environments

Benefits

Competitive salary & equity options
Sign-on bonus
Health, Dental, and Vision
401k

Similar roles

Browse all Machine Learning Engineer jobs

43 minutes ago

MH

Machine Learning Engineer – Training & Simulation Systems

Mission Technologies, a division of HII

Machine Learning Engineer designing and deploying advanced training capabilities to support U.S. Navy operational readiness. Collaborate on machine - learning models to enhance combat system training environments.

Onsite Role

Virginia Beach United States Machine Learning Engineer

$95,004 - $128,000 per year

yesterday

AI

Cloud MLOps Engineer

American Family Insurance

Cloud MLOps Engineer supporting Data Science and Engineering teams by automating CI/CD pipelines and managing multi - cloud infrastructure for ML production.

Hybrid Role

Madison United States Machine Learning Engineer

$80,000 - $131,000 per year

2 days ago

CI

Staff AI/ML Engineer, LLMs

CACI International Inc

Lead development of Agentic AI capabilities and LLM applications for multiple mission management applications. Mentor teams to implement ML algorithms addressing customer challenges.

Hybrid Role

Aurora United States Machine Learning Engineer

$98,500 - $206,800 per year

2 days ago

CI

Staff AI/ML Engineer

CACI International Inc

Staff AI/ML Engineer at CACI responsible for developing AI/ML algorithms and analyzing datasets. Join a high - performing team supporting national safety missions.

Hybrid Role

Philadelphia United States Machine Learning Engineer

$98,500 - $206,800 per year

2 days ago

CI

AI/ML Engineer

CACI International Inc

AI/ML Engineer at CACI developing machine learning algorithms for multiple applications. Collaborating with a research team to implement cutting - edge AI/ML solutions for customer missions.

Hybrid Role

Aurora United States Machine Learning Engineer

$82,100 - $172,400 per year

2 days ago

CI

Senior Computer Vision AI/ML Engineer

CACI International Inc

Senior Computer Vision AI/ML Engineer leading a team in AI/ML algorithm implementation for remote sensing solutions. Responsibilities include training models and analyzing datasets with a focus on defense and commercial applications.

Hybrid Role

Philadelphia United States Machine Learning Engineer

$82,100 - $172,400 per year

2 days ago

SG

Machine Learning Operations Engineer II

S&P Global

MLOps Engineer working on ML processes and robust workflows at Kensho. Collaborating with engineers to enhance tooling, services, and frameworks for machine learning.

Hybrid Role

Cambridge United States Machine Learning Engineer

$130,000 - $175,000 per year

2 days ago

BO

Artificial Intelligence / Machine Learning Intern

BLUE ORIGIN

Intern working on avionics hardware and software for Blue Origin. Gaining hands - on experience while contributing to space development projects.

Hybrid Role

Los Angeles United States Machine Learning Engineer

$38 per hour

3 days ago

MG

機械学習エンジニア

Match Group

Machine Learning Engineer engaged in recommendation and search, NLP, and image processing. Contributing to a major online dating app, Pairs, in Japan.

Hybrid Role

Tokyo Japan Machine Learning Engineer

3 days ago

TA

Senior Scientist II, Applied Machine Learning, Translational Agentic AI

Tempus AI

Senior Scientist II leading innovative AI and machine learning projects in oncology at Tempus. Collaborating with teams to advance predictive modeling and drug R&D initiatives.

Hybrid Role

New York City United States Machine Learning Engineer

$125,000 - $185,000 per year