Senior Deep Learning Software Engineer focused on optimizing PyTorch with TensorRT for NVIDIA accelerators. Collaborating with diverse teams to enhance performance in generative AI and more.
Responsibilities
Analyze performance issues and identify performance optimization opportunities inside Torch-TensorRT/TensorRT.
Contribute features and code to NVIDIA/OSS inference frameworks including but not limited to Torch-TensorRT/TensorRT/PyTorch.
Work with cross-collaborative teams inside and outside of NVIDIA across generative AI, automotive, robotics, image understanding, and speech understanding to develop innovative inference solutions.
Scale performance of deep learning models across different architectures and types of NVIDIA accelerators.
Requirements
Bachelors, Masters, PhD, or equivalent experience in relevant fields (Computer Science, Computer Engineering, EECS, AI).
At least 4 years of relevant software development experience.
Excellent Python/C++ programming, software design and software engineering skills.
Experience with a DL framework like PyTorch, JAX, TensorFlow.
Experience with performance analysis and performance optimization.
Benefits
Equity
Benefits
Job title
Senior Deep Learning Software Engineer, PyTorch - TensorRT Performance
Staff Software Engineer driving development of Cloudera's AI and machine learning platform. Collaborating with cross - functional teams to create scalable enterprise applications.
Staff OpenSearch Engineer driving technical vision and mentoring at Cloudera. Leading scalable search infrastructure design for data discovery and analytics.
Software Engineer contributing to Cloudera's Data Engineering Experience and Apache Spark Team. Implementing scalable solutions and collaborating with distributed teams on large - scale data challenges.
Tech Lead responsible for guiding global teams in agile software delivery and technical discussions. Focused on engineering excellence and mentoring within Fidelity's architecture team.
Software Engineer developing a digital maintenance assistant that reduces unplanned downtime through predictive maintenance. Analyzing machine data and enhancing customer applications with ownership of the data warehouse.
Senior Software Engineer building an AI - powered content generation platform for educators. Developing features with React and TypeScript, ensuring high standards for code quality.
Software Engineering & AI Intern developing internal automation and AI - driven solutions at Aspen Power. Supporting operational efficiency through workflows, applications, and collaboration with teams.
Senior FullStack Engineer developing Python backend services for fintech reimagining consumer lending. Collaborating across teams to support financial operations and AI integrations.
Principal Engineer developing firmware for Flashtec NVMe Controllers at Microchip Technology Inc. Involved in design and implementation of controller firmware within a global organization environment.