Lead Data Scientist developing NLP-driven solutions for large volumes of unstructured data. Join AI-first SaaS company Neuron7.ai pushing the boundaries of service intelligence.
Responsibilities
Lead the development and deployment of NLP-based solutions to process and analyze unstructured data at scale.
Design, train, and optimize machine learning models using libraries such as PyTorch, NLTK, and Scikit-learn.
Architect and deploy AI/ML products on cloud platforms like Azure, GCP, or AWS.
Collaborate with data engineering teams to ensure seamless integration of AI models into production systems.
Perform advanced SQL analytics to extract actionable insights from structured datasets.
Stay up-to-date with the latest advancements in NLP and machine learning techniques.
Mentor junior data scientists and foster a culture of technical excellence within the team.
Communicate complex technical concepts to non-technical stakeholders and customers.
Partner with customers to understand their needs and translate them into technical solutions.
Requirements
Minimum 8 years of experience in data science, with a focus on NLP and unstructured data processing.
Proven track record of launching NLP-driven products to live users.
Expertise in Python and standard libraries such as PyTorch, NLTK, and Scikit-learn.
Experience with Transformer-based models (e.g., BERT, GPT).
Develop, train, and optimize ML and deep learning models (classification, regression, clustering, sequence modeling, embeddings).
Implement and fine-tune transformer-based models such as BERT, GPT-style LLMs, and domain-specific architectures.
Build and deploy RAG (Retrieval-Augmented Generation) pipelines, vector databases, embedding models, and prompt optimization workflows.
Strong experience with one or more cloud platforms (Azure, GCP, AWS) for hosting and deploying AI/ML products.
Design and implement NLP pipelines for text classification, information extraction, topic modeling, semantic search, summarization, and conversational AI applications.
Fine-tune pretrained LLMs and Hugging Face models for domain-specific tasks.
Develop custom tokenizers, embeddings, and text-processing architectures.
Familiarity with data engineering pipelines and best practices.
Proficiency in SQL for analytics and data manipulation.
Build, evaluate, and deploy GenAI models for text generation, document processing, knowledge retrieval, and agent-based automation.
Integrate LLMs into production systems using APIs, LangChain, LlamaIndex, or custom frameworks.
Design safety, evaluation, and monitoring processes for GenAI deployments.
Excellent problem-solving skills and ability to work with large-scale datasets.
Strong interpersonal and communication skills, with the ability to mentor team members and interact with customers effectively.
Work with large-scale datasets using Python, SQL, Spark, Databricks, or cloud data platforms.
Build ETL/ELT pipelines, feature stores, and model-serving infrastructures.
Deploy ML models into production environments using Docker, Kubernetes, and CI/CD pipelines.
Implement monitoring, observability, and retraining workflows.
Mentor junior data scientists and provide technical oversight for AI/ML projects.
Collaborate with cross-functional teams to define model requirements and success metrics.
Own the full ML lifecycle from research to deployment and ongoing maintenance.
Knowledge Graph Engineer at Johnson & Johnson focusing on biomedical data standardization and interoperability solutions. Contributing to healthcare innovation through data - driven technologies and collaborations.
Data Scientist specializing in computer vision and multi - object tracking at STATSports. Leading development of tracking systems and collaborating with sports scientists and engineers.
Data Scientist developing and improving pricing and statistical models for Alandia's growing insurance business. Collaborating across teams with a clear ownership of analysis from idea to solution.
Manager leading data science and machine learning initiatives for healthcare payer clients. Focused on operational efficiency, mentoring teams, and actionable insights to reduce costs and improve health outcomes.
Senior Data Scientist at Michelin coding and deploying production - ready AI solutions. Developing reusable AI components with a focus on LLM frameworks and Azure Cloud.
Lead Data Scientist using advanced statistical techniques at Travelers to drive analytics and develop solutions. Collaborating with teams to enhance business intelligence through data insights.
Director, Head of Data leading vision and management of data strategy and environment at AAA Life. Collaborating across teams to build a high - performing data organization.
Senior Data Scientist developing valuation analytics for marketing strategies at PNC. Delivering analytical solutions for improving decision - making and optimizing marketing strategies.
Data Science Manager Senior overseeing analytics projects leveraging complex data for actionable insights at PNC. Leading teams to deliver data - driven business solutions.
Data Scientist developing machine learning models that integrate into Authenticx's SaaS platform. Collaborating with engineers to build, train, and deploy models in a regulated environment.