Senior Engineer in generative AI focused on foundation models and AI systems design at NVIDIA. Collaborating on large-scale training and contributing to open-source projects.
Responsibilities
Design and post-train foundation models (LLMs, VLMs, VLAs and DiTs)
Contribute to large-scale training infrastructure and high-efficiency inference pipelines
Collaborate on open-source and internal projects, author technical papers or patents
Prototype and iterate rapidly on experiments across AI domains
Design and implement model distillation algorithms for size reduction and diffusion step optimization
Profile and benchmark training and inference pipelines
Requirements
Minimum 8 years industry or 5+ years research/postdoc in generative AI systems
Proficiency in PyTorch, JAX, or other deep learning frameworks
AI Research Engineer developing sophisticated workflows leveraging LLM models at Cisco. Collaborating with teams to ensure security and scalability of AI solutions.
Senior Applied AI Scientist focusing on AI Learning at Preply, leveraging deep learning for personalized education solutions. Collaborating with teams to enhance learning experiences globally.
AI Scientist at Preply applying deep learning and NLP for personalized learning solutions. Collaborating with cross - functional teams to translate AI research into impactful educational tools.
Advanced AI Scientist at HP responsible for architecture and leadership of AI ecosystems. Leading projects in data mining, modeling techniques, and automation systems to drive business innovation.
Graduate Machine Learning Researcher at Longshot Systems designing and improving predictive models for sports betting analytics with a focus on innovation and R&D.
AI Research Scientist developing advanced technologies related to multimodal models at Mercari's R4D team. Collaborating on machine learning and computer vision projects that impact e - commerce platforms.
AI Research Intern at Toyota Research Institute exploring AI applications in enhancing wellbeing. Collaborate in developing innovative approaches within the Human - Centered AI Division.
Machine Learning Researcher at Astera Institute focusing on data - efficient and general model induction. Collaborating on innovative architectures with a focus on performance and throughput.
AI Research Intern focusing on advancing deep learning techniques in financial products at TD. Collaborating on large - scale datasets and representing the team at ML conferences.