Data Scientist working on enhancing AI agent performance metrics and experiments. Analyzing data and collaborating with cross-functional teams to drive product improvements.
Responsibilities
Design and analyze experiments to measure agent improvements—from model changes to UX variations—with statistical rigor and practical tradeoffs.
Define success metrics that connect agent trace data (prompts, responses, code changes, execution outcomes) to user outcomes like successful deploys, retention, and revenue.
Build the semantic layer for agent data in partnership with data engineering—defining the tables, metrics, and models that enable self-serve analysis across the AI team.
Surface insights from trace analysis that identify failure modes, successful patterns, and opportunities to improve agent effectiveness.
Partner with AI engineering, product, and leadership to translate data into roadmap decisions; you'll have a seat at the table for critical agent strategy discussions.
Create dashboards and reporting that surface agent performance metrics (task completion, latency, quality scores, user satisfaction) for the AI team and executives.
Requirements
5+ years of experience in data science, analytics, or a quantitative role with a focus on product, growth, or experimentation.
Deep experimentation expertise: A/B testing, experiment design, power analysis, handling skewed data, interpreting results beyond p-values.
Strong SQL skills and experience designing data models for high-volume event data; experience with dbt or similar transformation tools.
Proficiency in Python and data science libraries (pandas, scipy, statsmodels, etc.).
Ability to translate ambiguous questions into structured analysis and communicate findings clearly to both technical and non-technical stakeholders.
Bias toward action: you ship insights that influence decisions, not just dashboards.
Data Lead at Bifrost Studios building core data and analytics systems for new ventures. Collaborating with founders to streamline operations and establish scalable infrastructures.
Data Science Engineer building the software and data infrastructure for media advertising at Medialab. A 1 year university placement role focusing on AI and automation in data science.
Data Scientist for climate action progress tracking at C40. Analyzing climate - related data and managing data warehouses for performance measurement across global cities.
Product Data Scientist responsible for co - building an Agentic AI framework. Collaborating with product teams while working within a hybrid model in Warsaw, Poland.
Staff Data Scientist at Clio developing scalable ML systems and strategic experimentation frameworks. Leading high - impact modeling projects and mentoring junior team members.
Manager, Data Science and AI delivering actionable insights and AI - powered analytics tools for Pfizer’s Commercial organization. Leading execution of AI/ML models and facilitating communication of data - driven insights.
Data Scientist at Votorantim Cimentos focusing on Sales & Marketing data initiatives and model development. Collaborating with stakeholders to deliver effective data solutions.
Senior Data Scientist at INSZO applying data - driven solutions for improved patient outcomes in healthcare. Working on complex challenges using large datasets and innovative algorithms.
Data Scientist leveraging expertise in statistics, machine learning, and AI at Simplot. Contributing to diverse, challenging projects to create business value.
Data Scientist analyzing data for informed decision - making at PayPal. Collaborating cross - functionally to enhance process quality and effectiveness in payment solutions.