Data Scientist delivering data science projects for media companies at Samba TV in Warsaw. Collaborating with engineering and mentoring junior data scientists on the team.
Responsibilities
Own end-to-end delivery of significant data science projects — from problem scoping and approach design through to production deployment, with a focus on knowledge graph and identity solutions
Make sound, independently-reasoned decisions on methodology, model selection, and evaluation; document them clearly in technical solution documents covering problem statement, approach, metrics, and timeline
Build production-quality Python and PySpark code on Databricks — well-tested, documented, and reusable — and implement advanced ML and AI-powered workflows including entity resolution, probabilistic record linkage, embedding-based matching, semantic similarity, and LLM-augmented pipelines
Lead solution design for your own initiatives; break down complex epics into well-scoped user stories with clear acceptance criteria, adopting DataOps and MLOps best practices throughout (experiment tracking, pipeline orchestration, model monitoring, reproducibility)
Develop and maintain reusable tools, libraries, and documentation that improve team efficiency and technical standards; conduct code reviews with constructive, specific feedback that raises the bar
Mentor junior data scientists on technical execution, code quality, and career development; lead internal talks or workshops on knowledge graphs, identity, or ML topics
Collaborate cross-functionally with product, engineering, and operations — translate business requirements into technical specifications and partner with data engineering on scalable pipeline design; participate in cross-functional design reviews and working groups
Requirements
Bachelor's degree required in Statistics, Data Science, Computer Science, Mathematics or related field; Master's preferred
5-7 years of hands-on data science experience with 1-2+ years in a direct people-management or team-lead role with demonstrated ability to develop, retain, and hire data scientists
Solid command of core statistics and ML - hypothesis testing, probability, regression, classification, clustering, model evaluation, and experimental design
Strong Python (pandas, NumPy, PySpark, scikit-learn) and SQL; Databricks or similar platform experience essential
Familiarity with MLOps practices: experiment tracking, pipeline orchestration (Airflow), reproducible model deployment
Detail-oriented and proactive in anticipating delivery risks
Comfortable running Agile ceremonies and maintaining consistent sprint cadence across a distributed team
Strong communicator - able to give direct, constructive feedback and advocate for your team to key stakeholders.
Benefits
Samba TV is an equal opportunity employer.
We celebrate diversity and are committed to creating an inclusive environment for all employees.
We strive to empower connection with one another, reflect the communities we serve, and tackle meaningful projects that make a real impact.
Senior Health Data Scientist leading complex data extraction and modeling for healthcare solutions at Inovalon. Collaborating with multidisciplinary teams to deliver data - driven insights.
Data Scientist developing machine learning solutions and delivering insights for operational decisions. Collaborating with stakeholders to apply analytical techniques and improve business outcomes.
Data Scientist responsible for modeling and analyzing credit risk at CAIXA Consórcio. Utilizing data - driven insights to support strategic decision - making in credit operations.
Data Scientist optimizing payments ecosystem for Preply, enhancing user experience through data - driven insights. Collaborating with teams to improve payment processes and fraud management.
Staff Data Scientist at Preply developing data strategies for product domains. Collaborating with executives to drive long - term strategy and experimentation frameworks.
Data Manager leading data strategy and governance for Global Payments Solutions at Bank of America. Managing data architecture aligning with business and regulatory needs while overseeing complex data ecosystems.
Data Scientist developing and implementing LLM - based agents and leveraging AI techniques to improve client value. Collaborating on project challenges in a dynamic, start - up environment at Gartner.
Data Scientist in AI SaaS integrating 100+ systems for a European unicorn - in - the - making. Ensure scalability, security, and performance in a high - growth environment.
Data Science Intern working on AI - driven recipe and hardware optimization problems in semiconductor processes. Developing machine learning models and collaborating with engineering teams for innovative solutions.
Senior Data Scientist at LexisNexis developing AI - driven solutions for legal analytics. Collaborating with teams to implement machine learning models and monitor performance metrics.