Director, AI – ML Infrastructure at Allen Institute | Hybrid Hired

About the role

AI/ML Infrastructure Architect at the Allen Institute developing engineering infrastructure for AI/ML applications. Collaborating with cross-functional teams to support bioscience research.

Responsibilities

Develop and lead a cloud-agnostic state-of-the-art engineering infrastructure at the Allen Institute to support AI/ML research and applications.
Procure and deploy GPUs to meet computational demands.
Coordinate infrastructure implementation with external partners.
Lead data management, software infrastructure and AI/ML workflow best practices and policies.
Manage and lead a team of engineers.
Develop and implement policies and software for efficient management, prioritization, and scheduling of AI workloads.
Implement Cost Tracking and Reporting for transparency and prevent overruns.
Collaborate with science unit teams to facilitate adoption and use of the new AI pipeline by providing training and support to accelerate the adoption process.
Ensure integration of AI infrastructure with existing platforms.
Develop and oversee a governance framework to ensure use of GPU resources align with the institutes scientific priorities.
Regularly review and adjust resource allocation based on governance inputs.
Help establish community standards for scalability in developing, disseminating, and evaluating AI/ML/computational methods for scientific problems.
Participate in institute-wide initiatives, workshops, and seminars to promote engineering excellence through technical leadership, cross-disciplinary collaboration and knowledge sharing.

Requirements

Bachelors Degree in Computer Engineering or related technical field or equivalent experience
7 years of experience working with MLOps in medium to large scale GPU clusters and/or cloud based ML deployments
Experience with building, deploying and maintaining machine learning models
Proficiency with cloud computing (AWS, GCP or Azure) and with on-prem clusters
Experience with databases, large data management
Working knowledge of AI/ML custom libraries, AI/ML execution platforms
Proven ability to work independently and manage multiple projects simultaneously while meeting deadlines
Excellent written and verbal communication skills, with the ability to collaborate effectively in a multidisciplinary team environment.

Benefits

medical, dental, vision, and basic life insurance
401k plan
Paid time off

Similar roles

Browse all Machine Learning Engineer jobs

7 hours ago

C-

AI Staff Machine Learning Engineer – General AI, ML, Big Data

C-Serv

AI ML Engineer at global networking leader, shaping ML strategy and building high - performance systems. Innovating with AI technology to enhance network management and develop flagship products.

Hybrid Role

Vancouver Canada Machine Learning Engineer

11 hours ago

GE

Senior Staff Machine Learning Engineer, AI Agent Platform

GEICO

Senior Staff Machine Learning Engineer leading technical architecture for GEICO's AI Agent Platform. Driving innovation and enhancing productivity for internal associates and customers.

Hybrid Role

New York City United States Machine Learning Engineer

$130,000 - $300,000 per year

11 hours ago

GE

Staff Machine Learning Engineer, AI Agent Platform

GEICO

Staff Machine Learning Engineer developing the next generation of AI Agent OS and SDKs for GEICO. Key responsibilities include architecting scalable systems and implementing observability frameworks.

Hybrid Role

New York City United States Machine Learning Engineer

$115,000 - $260,000 per year

19 hours ago

BI

Senior Machine Learning Engineer

Bumble Inc.

Senior Machine Learning Engineer at Bumble developing scalable AI systems for personalized user interactions. Leading machine learning model development and deployment from exploration to production.

Hybrid Role

Austin United States Machine Learning Engineer

$220,000 - $250,000 per year

19 hours ago

BI

Lead Machine Learning Engineer

Bumble Inc.

Lead Machine Learning Engineer at Bumble shaping user connections through machine learning. Driving end - to - end AI solutions while mentoring engineers in a hybrid work environment.

Hybrid Role

Austin United States Machine Learning Engineer

$255,000 - $280,000 per year

yesterday

GF

Machine Learning Operations Engineer II

GM Financial

Designing and operating cloud - based MLOps capabilities supporting analytical and generative AI models. Collaborating with data science and business teams for high - impact AI solutions.

Hybrid Role

Irving United States Machine Learning Engineer

yesterday

UL

Machine Learning Engineer

UltraFix Appliance Repair, LLC

Machine Learning Engineer analyzing data structures and developing ML models for customer profiling in Azerbaijan. Collaborating on probabilistic modeling and data quality improvement.

Hybrid Role

Baku Azerbaijan Machine Learning Engineer

yesterday

HA

Machine Learning Engineer, Integrity

HackerRank

Machine Learning Engineer at HackerRank working on integrity systems to improve model quality. Collaborating on strategies for new signals like audio analysis and behavioral anomalies.

Hybrid Role

Santa Clara United States Machine Learning Engineer

yesterday

HA

Machine Learning Engineer, Integrity

HackerRank

Machine Learning Engineer developing integrity systems for assessing model quality at HackerRank. Collaborating on multimodal signal processing and improving model performance.

Hybrid Role

Bangalore India Machine Learning Engineer

yesterday

QU

Architect – Machine Learning

Quantiphi

Architect designing enterprise - grade AI/ML architectures for Quantiphi. Leading AI applications and ML strategy with a focus on scalability, security, and integration.

Hybrid Role

Mumbai India Machine Learning Engineer