AI Researcher developing multimodal perception models for Tavus' core AI team. Focus on foundational multimodal models in conversational settings and collaborating with ML teams.
Responsibilities
Conduct research on Foundational Multimodal Models in the context of Conversational Avatars (e.g., Neural Avatars, Talking-Heads).
Model video, audio, and language sequences using Autoregressive, Predictive Architectures (e.g., V-JEPA), and/or Diffusion paradigms with an emphasis on temporal and sequential data rather than static images.
Collaborate with the Applied ML team to bring your work to life in production systems.
Stay at the cutting edge of multimodal learning and help us define what “cutting edge” means next.
Requirements
A PhD (or near completion) in a relevant field, or equivalent hands-on research experience.
Experience modeling human behavior and generation (facial expressions, affect, or speech). Ideally in conversational or interactive settings.
Deep understanding of sequence modeling in video/audio/language domains.
Familiarity with large model training, especially LLMs or VLMs.
Strong background in Deep Learning (from Transformers to Diffusion Models) and how to make them work in practice.
Excellent programming skills, especially in PyTorch.
AI Research Scientist developing advanced technologies related to multimodal models at Mercari's R4D team. Collaborating on machine learning and computer vision projects that impact e - commerce platforms.
AI Research Intern at Toyota Research Institute exploring AI applications in enhancing wellbeing. Collaborate in developing innovative approaches within the Human - Centered AI Division.
Machine Learning Researcher at Astera Institute focusing on data - efficient and general model induction. Collaborating on innovative architectures with a focus on performance and throughput.
AI Research Intern focusing on advancing deep learning techniques in financial products at TD. Collaborating on large - scale datasets and representing the team at ML conferences.
AI Research Engineer designing downstream AI models operationalizing clinical endpoints in breast cancer care. Collaborating with clinical experts to enhance healthcare innovation in medical imaging.
Machine Learning Engineer designing, implementing, and deploying ML solutions for GEICO. Collaborating with cross - functional teams to integrate ML models and ensure business impact.
Intern AI Researcher at Analog Devices exploring AI models for efficient edge computing. Collaborating with experts to drive breakthroughs in model optimization and compression.
Senior AI Researcher at Dolby developing audio and video technologies with a focus on deep learning. Partnering with experts to innovate in multimedia analysis, processing, and rendering.
AI Research Lead guiding development and evaluation of African - first open language models at GSMA. Collaborating with researchers and operators to ensure high - quality outcomes across diverse linguistic structures.
AI Scientist Intern focusing on automated speech processing for educational applications at Pearson. Involves mentoring, training ML models, and contributing to prototype ideas.