Senior Data Engineer designing and scaling data pipelines for a go-to-market platform. Build and own systems that create a high-quality first-party contact dataset.
Responsibilities
Build the enrichment platform - Design and scale pipelines that process 100M+ contact records, integrating with large-scale contact data vendors.
Own data quality - Build deduplication, entity resolution, and record matching systems that merge contacts from multiple sources into a single high-quality record.
Optimize vendor economics - Create waterfall enrichment logic that maximizes coverage and freshness while minimizing per-record cost across a portfolio of data vendors.
Ship freshness infrastructure - Build systems that detect job changes, flag stale records, and trigger re-enrichment to keep the dataset current.
Instrument and measure - Create quality scoring, coverage dashboards, and accuracy metrics that give the business visibility into dataset health.
Requirements
5+ years of data engineering experience, including 2+ years working with contact data, enrichment systems, or data infrastructure.
Deep experience with data vendor APIs and the economics of contact data (coverage rates, accuracy tradeoffs, cost per record).
Expert-level SQL and data modeling skills, with experience designing schemas for large-scale entity datasets.
Track record building production data pipelines using modern tooling (dbt, Airflow, Dagster, Spark).
Experience with entity resolution, deduplication, and fuzzy matching at scale.
Strong understanding of data warehousing (Snowflake, ClickHouse, BigQuery) and performance optimization.
Business-minded: you understand that data quality directly impacts revenue and can articulate tradeoffs in terms the GTM team understands.
Benefits
comprehensive benefits (including medical, dental, vision, and 401(k) options)
Senior Manager leading a team of database engineers to manage CCC's data platform. Overseeing mission - critical applications and collaborating with cross - functional teams in a hybrid environment.
As a Principal Data Architect at Solstice, lead the design and implementation of data architecture solutions. Ensure data integrity, security, and accessibility to meet strategic organizational goals.
Data Platform Specialist overseeing data workflows and enhancing data quality for Stackgini's AI - driven IT solutions. Collaborating with teams to drive improvements and stakeholder support.
Data Engineer designing data pipelines in Python for a major railway industry client. Collaborate with Data Scientists and ensure code quality with agile methodologies.
Senior Data Engineer responsible for building and optimizing data pipelines for banking analytics initiatives. Collaborating with data teams to ensure data quality and readiness for enterprise use.
Senior Data Engineer developing scalable data solutions on Databricks for analytics and operational workloads. Collaborating with cross - functional teams to modernize the data ecosystem.
Data Engineer focused on analytics and data pipeline development for network optimisation. Collaborating with teams to deliver high - quality data solutions with Python and SQL.
Senior Product Manager defining platform capabilities for Data Cloud in Salesforce. Collaborating with R&D teams while shaping product strategy for Data 360 integration.
Senior Data Engineer at Goodwin enhancing data platforms and fostering data - driven culture across teams. Collaborating with IT and Finance on technology solutions and data governance practices.
Director, Data Platform Design and Strategy at MedImpact leading data platform and AI innovations to enhance healthcare services. Overseeing enterprise projects and managing teams to meet strategic goals.