Senior AI Researcher leading research on multimodal perception models for AI Humans. Building models that generate audio, visual, and language responses in a conversational context.
Responsibilities
Lead research on Foundational Multimodal Models for Conversational Avatars — systems that can perceive, reason, and generate across video, audio, and language.
Build and train models using Autoregressive, Predictive (e.g., V-JEPA), and Diffusion-based architectures with a deep focus on temporal and sequential data (not static frames).
Design and execute experiments to predict and control the visual, auditory, and linguistic responses of avatars.
Partner with the Applied ML team to bring research into real-world use cases.
Mentor other researchers and drive excellence across the team.
Requirements
A PhD plus 2–3+ years working hands-on with LLMs, VLMs, or multimodal systems.
Previous experience leading research efforts or mentoring teams.
Expertise in sequence modeling across video, audio, and text — with strong understanding of autoregressive, predictive, and diffusion frameworks.
Experience with large-scale model training and optimization for performance and real-time generation.
Proven ability to translate research ideas into production-grade systems.
Publications in top-tier venues (CVPR, ICCV, NeurIPS, ECCV, ACMMM).
Strong PyTorch skills and comfort moving fluidly between research and engineering.
Benefits
flexible work schedule
unlimited PTO
competitive healthcare
gear stipends
Job title
Senior AI Researcher, Multimodal Perception Models
Advanced AI Scientist at HP responsible for architecture and leadership of AI ecosystems. Leading projects in data mining, modeling techniques, and automation systems to drive business innovation.
Graduate Machine Learning Researcher at Longshot Systems designing and improving predictive models for sports betting analytics with a focus on innovation and R&D.
AI Research Scientist developing advanced technologies related to multimodal models at Mercari's R4D team. Collaborating on machine learning and computer vision projects that impact e - commerce platforms.
AI Research Intern at Toyota Research Institute exploring AI applications in enhancing wellbeing. Collaborate in developing innovative approaches within the Human - Centered AI Division.
Machine Learning Researcher at Astera Institute focusing on data - efficient and general model induction. Collaborating on innovative architectures with a focus on performance and throughput.
AI Research Intern focusing on advancing deep learning techniques in financial products at TD. Collaborating on large - scale datasets and representing the team at ML conferences.
AI Research Engineer designing downstream AI models operationalizing clinical endpoints in breast cancer care. Collaborating with clinical experts to enhance healthcare innovation in medical imaging.
Machine Learning Engineer designing, implementing, and deploying ML solutions for GEICO. Collaborating with cross - functional teams to integrate ML models and ensure business impact.
Intern AI Researcher at Analog Devices exploring AI models for efficient edge computing. Collaborating with experts to drive breakthroughs in model optimization and compression.