Data Scientist delivering data science projects for Samba TV. Working on knowledge graphs, audience modeling, and mentoring junior team members.
Responsibilities
Own end-to-end delivery of significant data science projects — from problem scoping and approach design through to production deployment, with a focus on knowledge graph and identity solutions
Make sound, independently-reasoned decisions on methodology, model selection, and evaluation; document them clearly in technical solution documents covering problem statement, approach, metrics, and timeline
Lead solution design for your own initiatives; break down complex epics into well-scoped user stories with clear acceptance criteria, adopting DataOps and MLOps best practices throughout — experiment tracking, pipeline orchestration, model monitoring, and reproducibility
Build production-quality Python and PySpark code on Databricks — well-tested, documented, and reusable — and implement advanced ML and AI-powered workflows including entity resolution, probabilistic record linkage, embedding-based matching, semantic similarity, and LLM-augmented pipelines
Develop and maintain reusable tools, libraries, and documentation that improve team efficiency and technical standards; conduct code reviews with constructive, specific feedback that raises the bar
Mentor junior data scientists on technical execution, code quality, and career development; lead internal talks or workshops on knowledge graphs, identity, or ML topics
Collaborate cross-functionally with product, engineering, and operations — translate business requirements into technical specifications, partner with data engineering on scalable pipeline design, and participate in cross-functional design reviews and working groups
Requirements
Bachelor's degree required in Statistics, Data Science, Computer Science, Mathematics or a related quantitative field; Master's strongly preferred
3–5 years of hands-on data science experience with demonstrated ability to own and deliver complex, multi-sprint projects independently
Advanced Python with production-quality code, testing, and documentation; strong SQL and PySpark for billion-row datasets
Databricks workflows, Delta Lake, and job orchestration; working knowledge of cloud platforms (AWS or GCP)
Solid command of core ML — regression, classification, clustering, model evaluation, and experimental design — applied to complex, high-volume data
Proficiency with MLOps practices: experiment tracking, pipeline orchestration (Airflow), and reproducible model deployment
Exposure to modern AI methodologies: RAG systems, LLM-augmented models, vector databases, and semantic search
Strong communicator — able to translate technical work into clear documentation, user stories, and cross-functional conversations
Demonstrated ability to mentor junior data scientists and contribute to team standards
Data Scientist responsible for analyzing data to resolve business challenges at Fligoo. Working hybrid in Córdoba, applying AI solutions for various industries.
Data Scientist unlocking insights from complex data to solve mission - critical challenges at Booz Allen. Collaborating with data engineers and mission stakeholders for national security.
Data Scientist driving analytics and machine learning solutions for Gap Inc. Collaborating across teams to build and deploy data capabilities and influence business outcomes.
Principal Data Scientist leading advanced marketing analytics, optimizing investments at Walmart. Collaborating with multiple teams to drive customer engagement and growth strategies.
Analytics Lead in manufacturing driving data and analytics product features to enhance performance in a sustainability - focused company. Collaborating across teams to optimize technological solutions.
Associate Data Manager responsible for clinical data review and query management at Pfizer. Ensuring data integrity and collaboration within the team for operational excellence.
Director of Data Science leading technical data science initiatives for advertising products at Mastercard. Overseeing ML strategy, model deployment, and team collaboration in the Commerce Media platform.
Principal Data Scientist at Fidelity driving AI/ML innovations and solutions for financial growth. Collaborating cross - functionally to design and deploy advanced analytics and AI technology.
Senior Data Scientist at Enklare owning end - to - end ML models from data to production. Working across data engineering, data science and backend systems for financial impact.