Principal Data Engineer responsible for architecting scalable data pipelines and building high-quality data foundations. Collaborating closely with experts to ensure data readiness for advanced analytics.
Responsibilities
Architect and implement scalable data pipelines for batch and real-time ingestion and processing.
Build sophisticated transformations for attribute extraction, normalisation, and entity resolution.
Develop knowledge infrastructure, including metadata layers, product graphs, and ontologies.
Collaborate with domain experts to define taxonomies and classification schemas.
Enforce data contracts and validation rules to ensure consistency and lineage across the organisation.
Promote engineering best practices around testing, documentation, and observability for data workflows.
Requirements
Mastery of Apache Beam, Dataflow, and Pub/Sub in a cloud-native environment.
Expert knowledge of SQL, dbt, BigQuery, and distributed computing frameworks.
Extensive experience building production-grade pipelines for large-scale data.
Strong analytical thinking and the ability to collaborate across engineering, ML, and product teams.
Proven track record of independently researching and applying new data technologies.
Nice to have: Background in building agentic systems or personalised recommendation engines.
Experience with various data formats including Avro and Protobuf.
Google Cloud Professional Data Engineer certification.
Benefits
10 days PTO + 17 days paid public holidays
Pension
Law 19032 (Social Security)
Family allowance
National Employment Fund
Accident Insurance
Life Insurance
Work from Home Allowance
Private Medical Insurance
Birthday leave
10 paid learning days per year
Bonusly 100 points per month to recognise colleagues
Senior Data Engineer at Goodwin enhancing data platforms and fostering data - driven culture across teams. Collaborating with IT and Finance on technology solutions and data governance practices.
Director, Data Platform Design and Strategy at MedImpact leading data platform and AI innovations to enhance healthcare services. Overseeing enterprise projects and managing teams to meet strategic goals.
Data Engineer delivering AI - and data - driven solutions for Honeywell’s industrial customers. Architecting and implementing scalable data pipelines and platforms focused on IoT and real - time data processing.
Data Engineering Associate focusing on data quality control and management for distribution platform. Collaborates on large scale data projects to ensure data accuracy and availability for users.
Data Architect managing enterprise data platform built on Microsoft Fabric at Johnstone Supply. Leading architectural standards and collaborating with business and IT leaders for strategic data - driven insights.
Data Engineer at Studyportals responsible for data pipelines and infrastructure. Join a team ensuring accurate and trustworthy data for analytics and business decisions.
AI/ML Engineer designing and refining prompts and workflows using large language models. Responsible for developing data pipelines and delivering scalable AI solutions in a hybrid work environment.
AWS Data Architect at Fractal designing and operationalizing AWS data solutions at enterprise scale. Collaborating with clients and mentoring engineers in best practices.
Senior Data Engineer driving data - driven success at Pacific Life. Collaborating with a team to build scalable and secure data solutions in Newport Beach, CA or Charlotte, NC.
Data Architect managing Commercial Data architecture initiatives for Valmet's sales and service team. Leading AI - driven data integrity and quality efforts in a global context.