Principal Engineer at NVIDIA architecting next-generation diagnostic systems for Cloud Service Providers. Leading technical strategy and mentoring engineering teams for scalable infrastructures.
Responsibilities
Define technical strategy and development of NVIDIA’s Data Center diagnostic systems, orchestrating large-scale stress testing for CPUs, GPUs, networking, memory, and high-speed interconnects.
Mentor and grow engineering teams, providing technical leadership and encouraging a culture of innovation and excellence.
Drive the root-cause analysis of systemic failures that intersect multiple hardware and software domains.
Partner with CSPs to diagnose and address scalability challenges within their unique data center infrastructures.
Requirements
Bachelor's degree in Computer Science/Engineering, Electrical Engineering, or a related field (or equivalent experience).
15+ years of system software experience working on highly resilient distributed systems with programming experience in C++ or Python.
Deep systems knowledge of x86/ARM architectures, Linux OS internals, firmware (UEFI/BIOS), Redfish, HMC, BMC protocols and platform security.
Consistent track record demonstrating technical leadership leading project teams and setting technical direction.
Expertise in software testing methodologies with an automation-led, AI-first approach to ensuring software quality.
Benefits
equity
benefits
Job title
Principal System Software Engineer – Data Center MODS
Lead Software Engineer at GM Financial overseeing software development and team collaboration in AI technology. Engage in multi - developer projects and continuous improvement practices.
Principal Engineer leading software development for Wholesale business, focusing on network activation and provisioning systems. Collaborating with cross - functional teams to ensure high - quality deliverables.
Principal Engineer at Verizon leading design, development, and support of Wholesale suite of applications. Collaborating with resellers and internal teams to ensure customer experience and system compliance.
Engineering Technologist III/Senior Engineering Technologist providing technical expertise and leadership at Duke Energy. Involving complex problem - solving and ensuring business goals are met in a technical environment.
Lead Engineer developing advanced automation solutions for Duke Energy's Power Grid Operations. Managing projects and providing leadership in automation control within the Power Grid Operations Distribution system.
Senior Software Engineer at FundApps delivering high - impact software projects for compliance in financial services. Collaborating with team members to provide best - in - class solutions and drive business improvements.
Senior Tech Manager responsible for unifying digital experiences across Rabobank's platforms. Leading teams to ensure high - quality capabilities and customer engagement.
Tech Lead for Monitoring & Observability at Rabobank leading internal teams and offshore members. Ensuring technical and HR responsibilities for diverse engineering projects in a hybrid role.
Software Engineer focusing on network automation and infrastructure scalability in a tech company. Seeking an expert with solid networking fundamentals and experience in building automated solutions.
Software Engineer Intern delivering NetApp enterprise class software products. Collaborating with senior engineers to tackle data challenges and improve tiering solutions.