Lead Data Scientist architecting and deploying scalable machine learning systems for real-time applications. Collaborate with cross-functional teams to deliver high-performance AI solutions in Bengaluru, India.
Responsibilities
Lead the architecture, development, and deployment of scalable machine learning systems for real-time inference.
Optimize inference pipelines using high-performance frameworks like vLLM, Groq, ONNX Runtime, Triton Inference Server, and TensorRT.
Design and implement agentic AI systems utilizing frameworks such as LangChain, AutoGPT, and ReAct.
Fine-tune, integrate, and deploy foundation models including GPT, LLaMA, Claude, Mistral, Falcon.
Develop and maintain robust MLOps workflows for the model lifecycle.
Collaborate with DevOps teams for scalable serving infrastructure.
Implement retrieval-augmented generation (RAG) pipelines with vector databases.
Build observability systems for LLMs to track performance metrics.
Work cross-functionally with various teams to deliver production-grade AI systems.
Stay updated on emerging AI trends and contribute to relevant initiatives.
Requirements
Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, or related fields.
Strong proficiency in Python and ML frameworks such as PyTorch, TensorFlow, and Hugging Face Transformers.
Deep knowledge of NLP, transformer-based architectures, and generative AI models.
Hands-on experience with scalable LLM inference optimization using tools like vLLM, Groq, Triton Inference Server, TensorRT, or ONNX Runtime.
Proven ability to serve AI models to concurrent users with low latency and high throughput.
Experience in deploying ML systems on cloud platforms (AWS, GCP, Azure).
Expertise in containerization (Docker), orchestration (Kubernetes), and CI/CD pipelines.
Familiarity with vector search technologies (FAISS, Pinecone, Weaviate) and RAG implementations.
Experience with agent-based AI frameworks, autonomous workflows, and prompt chaining (preferred).
Knowledge of fine-tuning methods like LoRA, PEFT, RLHF (preferred).
Benefits
Flexible work arrangements (remote/PAN India options)
Internship for AI in document processing at ArianeGroup, focusing on natural language processing and data analysis tasks in a collaborative environment.
Senior Data Scientist focused on Generative AI and LLM at Manulife. Develop and implement machine learning models to solve business problems and mentor peers.
Sr. Advanced Data Scientist leveraging advanced analytics and data science at Honeywell. Developing solutions for business growth and operational efficiency in the Atlanta office.
Data Scientist at Capital One leveraging technology to improve fraud prevention and customer safety. Collaborating with cross - functional teams to deliver industry - leading fraud defenses.
Data Scientist delivering insights for product and operations teams in Customer Support at Etsy. Using behavioral analysis to drive product development and strategy within a collaborative environment.
Data Scientist developing predictive models and enhancing investment strategies with AI at MDOTM. Collaborating in a dynamic research team to drive data - driven insights and innovative solutions.
Lead Data Scientist at Vizient developing automated analytics and advanced data science solutions. Collaborating with teams to improve clinical, operational, and economic outcomes.
Data Scientist developing analytic solutions and analyzing healthcare datasets for client decision - making. Collaborating with teams to build scalable analytics products and communicate insights.
Senior Data Scientist at Pinterest applying GenAI to build analytics solutions and data models. Collaborating across teams to improve data integration and pipeline management.