Staff Software Engineer leading architecture and delivery of cloud-native AI platform for Cloudera. Optimizing AI stack and ensuring seamless integration for enterprises.
Responsibilities
Design and implement elegant, scalable application services (Go/Node.js) that wrap AI capabilities for enterprise use.
Lead the deployment of inference servers (vLLM, Triton) using KServe, KubeRay, or Knative to ensure serverless-style scaling for AI workloads.
Build internal tooling, SDKs, and "AI Gateways" that enhance team agility and simplify the integration of Foundation Models (Llama, GPT) into product features.
Architect robust Retrieval-Augmented Generation (RAG) pipelines and prompt management services that integrate seamlessly with vector databases and enterprise data sources.
Partner with UI engineers, UX designers, and Product Management to ensure the AI platform is not just powerful, but highly usable for internal developers.
Ensure AI workloads are secure, multi-tenant, and optimized for GPU resource scheduling (MIG, fractional GPUs) within Kubernetes.
Requirements
Bachelor’s degree with 6+ years of software engineering experience (or equivalent Masters/PhD tenure), with at least 2+ years focused on AI/ML systems.
Expert proficiency in Python (for AI ecosystem) and strong competence in a systems language like Go or Rust/C++ (for high-performance serving layers).
Deep understanding of LLM deployment challenges and runtimes (e.g., vLLM, ONNX, TorchServe, Triton).
Familiarity with quantization techniques (AWQ, GPTQ) to optimize model size/speed.
Experience building complex workflows using tools like LangChain or LlamaIndex, and deploying them on containerized infrastructure (Docker/Kubernetes).
Ability to navigate the rapidly changing AI landscape, filtering hype from practical engineering solutions, and driving technical alignment across teams.
Lead software engineering projects as a technical expert within Truist's financial services, focusing on complex solutions. Collaborate with multiple stakeholders to design and implement innovative systems.
Senior Software Developer focused on XR applications at Light & Wonder. Developing production - ready Unity applications for Meta Quest and optimizing XR experiences.
Software Engineer in PNC's Technology organization focusing on API testing and Selenium automation. Involves designing software solutions and collaborating with cross - functional teams.
Software Developer Internship at HP, applying classroom learnings to real - world challenges. Join a diverse team focused on technology innovation and development.
Software Engineer developing tools to enhance developer productivity at Nordstrom. Collaborating with teams to design and implement DevOps solutions in a hybrid environment.
Principal Engineer optimizing standard cell libraries for Intel's foundry customers. Collaborating with teams to enhance performance, power and area on leading edge process nodes.
Lead Software Engineer at Icertis, proficient in .NET, C#, Azure, and client - side technologies. Delivering high - quality, scalable solutions and guiding a team of developers.
Senior Software Development Engineer developing cutting - edge software features for Intel's communication libraries. Collaborating with diverse teams to optimize performance in HPC/AI systems.
Principal Engineer designing mixed - signal IPs for Microchip Technology. Collaborating with SoC architects and managing IP intake processes for advanced analog solutions.