Senior AI Software Engineer developing open-source AI frameworks for Large Language Models at NVIDIA. Collaborating on optimizing model training and providing innovative solutions in AI applications.
Responsibilities
Design and develop the GenAI open source Megatron Core and NeMo Framework
Solve large-scale, end-to-end AI training and inference challenges, spanning the full model lifecycle from initial orchestration, data pre-processing, and running of model training and tuning, to model deployment.
Work at the intersection of AI applications, libraries, frameworks, and the entire software stack.
Innovate and improve model architectures, distributed training algorithms, and model parallel paradigms.
Accelerate foundation model training and finetuning with mixed precision recipes and next-gen NVIDIA GPU architectures.
Performance tuning and optimizations of deep learning framework and software components.
Research, prototype, and develop robust and scalable AI tools and pipelines.
Requirements
MS, PhD or equivalent experience in Computer Science, AI, Applied Math, or related fields and 5+ years of industry experience.
Experience with AI Frameworks (e.g. PyTorch, JAX), and/or inference and deployment environments (e.g. TRTLLM, vLLM, SGLang).
Proficient in Python programming, software design, debugging, performance analysis, test design and documentation.
Consistent record of working effectively across multiple engineering initiatives and improving AI libraries with new innovations.
Strong understanding of AI/Deep-Learning fundamentals and their practical applications.
Senior network security Engineer for Zero Trust and Network security architecture team at Pitney Bowes. Ensuring implementation, operation, and optimization of zero trust solutions.
Full Stack Developer working on impactful software solutions for top brands in Australasia. Join Sandfield where diverse projects await and personal growth is fostered.
Senior Developer/Tech Lead focusing on AI - driven software solutions at Datacom. Collaborate with teams to design and deliver innovative projects addressing complex challenges.
Full - Stack Developer responsible for actively developing and integrating features on the PULSE platform for business process automation. Collaborating closely with customers to deliver tailored software solutions.
Software Developer Engineer in Networking at NVIDIA designing and verifying high - speed communication devices. Working closely with customers on product solutions across multiple platforms.
Software Engineer designing and developing AI networking protocols for NVIDIA's cutting - edge technology. Collaborate with customers and handle all aspects of network driver development.
Senior Software Engineer developing high - performance diagnostic tools for NVIDIA’s networking platforms. Collaborating with teams for innovative solutions and ensuring hardware stability in high - performance computing environments.
Full - Stack Developer responsible for developing features and improving processes at GovTech startup SUMM AI. Building AI solutions that create societal value in the public sector.
Senior Engineer developing AI tools for an early stage startup in Munich. Expected to build AI Agents and enhance frontend and backend applications while collaborating with the Co - Founder.
Senior Software Engineer at Anansi Solutions developing impactful client projects in a hybrid environment. Collaborating with teams and building internal tools while mentoring junior professionals.