Senior Data Scientist developing advanced ML and Generative AI applications at Emerson. Leading projects in AI solutioning, deployment, and cloud-based environments.
Responsibilities
Design, develop, and deploy advanced ML, DL and Generative AI models (LLMs, Transformers, Diffusion, GANs, VAEs) for NLP, multimodal, forecasting, recommendation, and intelligent automation use cases.
Lead data-driven GenAI solutioning, including problem framing, feature engineering, dataset curation and statistical validation for enterprise AI applications.
Perform exploratory data analysis (EDA), bias detection, data quality checks, and advanced feature engineering on structured and unstructured data (text, image, tabular).
Monitor and manage data drift, model drift, hallucination risks and performance degradation, and define retraining and recalibration strategies.
Engineer end-to-end RAG pipelines and multi-agent workflows using LangChain, Copilot, MCP.
Build and optimize RAG pipelines, semantic search, and contextual reasoning workflows using embeddings, chunking strategies, hybrid retrieval, and reranking techniques.
Integrate vector DBs (PostreSQL, Pinecone, Weaviate, Chroma, FAISS, Redis) and graph DBs (Neo4j, Postgres/pgvector) for semantic retrieval and contextual reasoning.
Optimize foundation models (GPT, LLaMA, Mistral, Falcon) via prompt engineering, RLHF, LoRA, quantization, and hyperparameter tuning.
Build scalable AI solutions using Azure AI/ML (preferred) with containerized deployments (Docker, Kubernetes).
Apply MLOps, LLMOps best practices: CI/CD, model versioning, drift detection, observability, and lifecycle management with MLflow, Kubeflow, Airflow, and monitoring tools.
Develop secure AI pipelines and APIs with Python, FastAPI/Flask, RBAC, OAuth2, JWT, and encryption standards.
Conduct model optimization: prompt engineering, hyperparameter tuning, cross-validation, and performance monitoring.
Use tools like Azure Machine Learning, OpenAI API, HuggingFace, or custom PyTorch/TensorFlow-based models.
Implement AI safety, bias mitigation, interpretability (SHAP, LIME, and compliance guardrails (GDPR, HIPAA, ISO).
Collaborate with cross-functional teams to deliver enterprise-grade copilots, assistants, and reusable AI components.
Document AI design, model workflows, and deployment pipelines for audit readiness and knowledge sharing.
Requirements
Bachelor's or Master's degree in Computer Science, Data Science, Statistics, Mathematics, or a related field over 7+ years.
Proven experience as a Data Scientist Developer or in a similar role and proficiency in Python
Experience with AI on Azure (must), including Azure OpenAI, Azure ML, and related services.
Deep hands-on experience in Python, CUDA, SQL, proficient with TensorFlow, PyTorch, Keras.
Familiarity with security, bias mitigation, and responsible AI frameworks.
Experience with MLOps practices and tools for deploying, tracking, and updating models.
Excellent problem-solving, communication, and team collaboration skills.
Preferred Qualifications: Certifications in AI/ML from Microsoft, AWS or Coursera/edX, Exposure to enterprise use cases in industries such as manufacturing, finance and other, Experience with AutoML, LLMOps, and performance benchmarking tools, Understanding of semantic search, knowledge graphs, and contextual recommendation engines, Hands on MLOps experience, with an appreciation of the end-to-end CI/CD process, Certified in Azure AI Fundamentals (AI-900), Azure AI Engineer Associate (AI-102), Azure Developer Associate, Experience with big data technologies
Senior Health Data Scientist leading complex data extraction and modeling for healthcare solutions at Inovalon. Collaborating with multidisciplinary teams to deliver data - driven insights.
Data Scientist developing machine learning solutions and delivering insights for operational decisions. Collaborating with stakeholders to apply analytical techniques and improve business outcomes.
Data Scientist responsible for modeling and analyzing credit risk at CAIXA Consórcio. Utilizing data - driven insights to support strategic decision - making in credit operations.
Data Scientist optimizing payments ecosystem for Preply, enhancing user experience through data - driven insights. Collaborating with teams to improve payment processes and fraud management.
Staff Data Scientist at Preply developing data strategies for product domains. Collaborating with executives to drive long - term strategy and experimentation frameworks.
Data Manager leading data strategy and governance for Global Payments Solutions at Bank of America. Managing data architecture aligning with business and regulatory needs while overseeing complex data ecosystems.
Data Scientist developing and implementing LLM - based agents and leveraging AI techniques to improve client value. Collaborating on project challenges in a dynamic, start - up environment at Gartner.
Data Scientist in AI SaaS integrating 100+ systems for a European unicorn - in - the - making. Ensure scalability, security, and performance in a high - growth environment.
Data Science Intern working on AI - driven recipe and hardware optimization problems in semiconductor processes. Developing machine learning models and collaborating with engineering teams for innovative solutions.
Senior Data Scientist at LexisNexis developing AI - driven solutions for legal analytics. Collaborating with teams to implement machine learning models and monitor performance metrics.