Staff/Principal Machine Learning Engineer at Inworld optimizing real-time AI models and orchestration. Engaging in deep tech projects in a dynamic, collaborative environment.
Responsibilities
Make unclear problems clear through design and prototyping.
Treat performance, latency, and reliability as product features.
Engage in in-person collaboration to solve complex problems and foster team culture.
Support sharing work and open-source contributions to advance the field.
Requirements
Deep understanding of modern serving frameworks and techniques like vLLM or TRT-LLM.
Hands-on experience with quantization, distillation, caching strategies, continuous batching, paged attention, and speculative decoding.
Proficiency in C++, CUDA, Rust, or highly optimized Python.
Experience with Kubernetes, Ray, custom load balancing, multi-GPU/multi-node inference, and reliably handling thousands of concurrent connections.
Non-trivial systems programming projects, open-source contributions to major inference engines, or deep-dive technical write-ups.
Full-cycle ownership of model deployment from research to production.
PhD in CS, Physics, Math, or equivalent practical experience building backend or ML systems.
AI/GenAI - ML Engineer at Quento Technologies S.A. building and maintaining scalable and efficient machine learning pipelines for various AI applications.
Senior Machine Learning Engineer developing and integrating solutions for MoonPay's payments platform. Supporting fraud prevention and collaborating across teams in Fintech.
Lead Machine Learning Engineer at Disney Ad Platforms driving AI innovation and machine learning solutions for advertising. Innovating ad technology while mentoring junior engineers in a collaborative environment.
Perception and SLAM Machine Learning Engineer developing advanced perception algorithms for autonomous machinery at AIM. Collaborating on state - of - the - art machine learning methods in challenging environments.
Machine Learning Engineer developing innovative computer vision solutions for defense and commercial applications. Collaborating with R&D on algorithms for detection, classification, and tracking capabilities.
Senior Machine Learning Engineer developing deep learning models for camera - based perception in autonomous trucks. Building robust camera models for safe and reliable autonomous driving.
Senior Machine Learning Engineer at Disney designing machine learning models for self - healing infrastructure. Collaborating with cross - functional teams to enhance enterprise technology strategies.
Machine Learning Engineer at Capital One responsible for productionizing ML applications in Agile teams. Focused on ML architectural design, coding, and maintaining high availability of models.
Lead Machine Learning Engineer at Capital One focused on machine learning applications and systems. Collaborate with Agile teams to develop scalable solutions for business problems.
Machine Learning Systems Research Intern at Red Hat working on AI inference and model optimization techniques. Collaborating with experts in the field while gaining hands - on experience in applied ML research.