Senior Software Architect leading the design of scalable communications software at NVIDIA. Focusing on AI and HPC performance improvements for cutting-edge technology solutions.
Responsibilities
Investigate opportunities to improve communication performance by identifying bottlenecks in today's systems
Design and implement new communication technologies to accelerate AI and HPC workloads
Explore innovative solutions in HW and SW for our next generation platforms as part of co-design efforts involving GPU, Networking, and SW architects
Build proofs-of-concept, conduct experiments, and perform quantitive modeling to evaluate and drive new innovations
Use simulation to explore performance of large GPU clusters (think scales of 100s of 1000s of GPUs)
Requirements
M.S./Ph.D. degree in CS/CE or equivalent experience
5+ years of relevant experience
Excellent C/C++ programming and debugging skills
Experience with parallel programming models (MPI, SHMEM) and at least one communication runtime (MPI, NCCL, NVSHMEM, OpenSHMEM, UCX, UCC)
Deep understanding of operating systems, computer and system architecture
Solid in fundamentals of network architecture, topology, algorithms, and communication scaling relevant to AI and HPC workloads
Strong experience with Linux
Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment
Benefits
equity
benefits
Job title
Senior Software Architect – Deep Learning, HPC Communications
Fullstack Engineer developing Customer & Marketing web applications for a leading European retail company. Collaborating with teams on innovative AI solutions and technology integration.
Technical Lead in hybrid role for Weekday's client, focusing on software development in Java/Python and cybersecurity solutions. Leading initiatives for scalable systems and best practices across teams.
Senior Staff Engineer driving architectural excellence within Matillion's AI - powered data integration platform. Leading technical design and strategy for scalable data solutions with a focus on collaboration.
Software Engineer developing high - quality legal software solutions for Cisco. Partnering with cross - functional teams and applying secure coding principles for system integrity and performance.
Staff Engineer leading a product team at Beamery, a transformational AI platform in HR technology. Designing scalable software and providing technical mentorship in a hybrid role.
iOS Engineer developing new financial services with Merpay, focusing on individual credit business in Japan. Collaborating with cross - functional teams to improve user experience and product quality.
Tech Lead managing development teams across mobile, web, and backend at Lotus's. Overseeing software solutions while ensuring technical excellence and high - quality code across projects.
Staff Engineer developing solutions with agile teams and mentoring junior engineers. Focused on leading development initiatives utilizing CI/CD, .NET, and web services.
Software Engineer developing and supporting client - server applications for gaming technology at Light & Wonder. Collaborating with teams to build reliable and scalable software solutions.
Associate Director role leading software development and team collaboration at RBC. Designing and building robust Java applications while mentoring a high - performing development team.