Solutions Architect at NVIDIA driving AI and ML solutions on cloud platforms. Collaborating with multi-functional teams and mentoring customers to improve GPU-enabled machine learning workflows.
Responsibilities
Help cloud customers craft, deploy, and maintain scalable, GPU-accelerated inference pipelines on cloud ML services and Kubernetes for large language models (LLMs) and generative AI workloads.
Enhance performance tuning using TensorRT/TensorRT-LLM, vLLM, Dynamo, and Triton Inference Server to improve GPU utilization and model efficiency.
Collaborate with multi-functional teams (engineering, product) and offer technical mentorship to cloud customers implementing AI inference at scale.
Build custom PoCs for solution that address customer’s critical business needs applying NVIDIA hardware and software technology
Partner with Sales Account Managers or Developer Relations Managers to identify and secure new business opportunities for NVIDIA products and solutions for ML/DL and other software solutions
Prepare and deliver technical content to customers including presentations about purpose-built solutions, workshops about NVIDIA products and solutions, etc.
Conduct regular technical customer meetings for project/product roadmap, feature discussions, and intro to new technologies.
Establish close technical ties to the customer to facilitate rapid resolution of customer issues.
Requirements
BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Statistics, Physics, or other Engineering fields or equivalent experience.
3+ Years in Solutions Architecture with a proven track record of moving AI inference from POC to production in cloud computing environments including AWS, GCP, or Azure
3+ years of hands-on experience with Deep Learning frameworks such as PyTorch and TensorFlow
Excellent knowledge of the theory and practice of LLM and DL inference
Strong fundamentals in programming, optimizations, and software design, especially in Python
Experience with containerization and orchestration technologies like Docker and Kubernetes, monitoring, and observability solutions for AI deployments
Payer Solutions Architect coordinating rollout of State and Payer solutions for homecare technology platform. Ensuring customer engagement and timely project implementations while managing multiple stakeholder needs.
Solutions Architect supporting IoT application architecture and optimization as a technical partner. Building long - term trust with clients through technical expertise and problem - solving.
Cloud Solutions Architect responsible for cloud computing initiatives at Cayuse. Designing and implementing cloud infrastructures and architectures that are scalable and cost - effective.
Solutions Architect leading technical onboarding and integrations for customers at OpenAsset. Utilizing Python and JavaScript expertise for workflow automation and system integrations.
Pre - Sales Solution Architect at FINBOURNE, collaborating with Sales Team to design solutions in financial services. Running demos, workshops, and creating custom solutions using Python and SQL.
Solution Architect in Deloitte, leading public sector digital transformation. Focusing on Identity and Access Management and overseeing Discovery phases.
Director of AI Solution Architecture at PwC leading the design and delivery of enterprise technology solutions. Fostering collaboration and building executive - level client relationships in complex projects.
Vice President, Solution Architect leading architecture for digital servicing applications. Overseeing development and design solutions for scalability, performance, and security in technical environments.
TRIRIGA Subject Matter Expert consulting for federal government agency providing analysis and guidance on software modernization initiatives. Collaborating with stakeholders in a hybrid environment.