Hybrid Staff / Principal Machine Learning Engineer

Posted 7 hours ago

Apply now

About the role

  • Staff/Principal Machine Learning Engineer at Inworld optimizing real-time AI models and orchestration. Engaging in deep tech projects in a dynamic, collaborative environment.

Responsibilities

  • Make unclear problems clear through design and prototyping.
  • Treat performance, latency, and reliability as product features.
  • Engage in in-person collaboration to solve complex problems and foster team culture.
  • Support sharing work and open-source contributions to advance the field.

Requirements

  • Deep understanding of modern serving frameworks and techniques like vLLM or TRT-LLM.
  • Hands-on experience with quantization, distillation, caching strategies, continuous batching, paged attention, and speculative decoding.
  • Proficiency in C++, CUDA, Rust, or highly optimized Python.
  • Experience with Kubernetes, Ray, custom load balancing, multi-GPU/multi-node inference, and reliably handling thousands of concurrent connections.
  • Non-trivial systems programming projects, open-source contributions to major inference engines, or deep-dive technical write-ups.
  • Full-cycle ownership of model deployment from research to production.
  • PhD in CS, Physics, Math, or equivalent practical experience building backend or ML systems.

Benefits

  • relocation assistance

Job title

Staff / Principal Machine Learning Engineer

Job type

Experience level

Lead

Salary

$270,000 - $500,000 per year

Degree requirement

Postgraduate Degree

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job