Principal Data Engineer at Deputy creating a scalable data platform for frontline workers' management. Leading architecture design, technical practices, and mentoring a skilled data team.
Responsibilities
Architect and Evolve Our Core Data Platform: You will own the technical vision and roadmap for our data platform, steering its evolution on our modern cloud stack and ensuring it meets the demands of a rapidly scaling business.
Own the Architecture: Design, implement, and refine a robust data lakehouse architecture (e.g., Medallion) using Databricks and Delta Lake to ensure data reliability and performance.
Build Scalable Ingestion Frameworks: Develop and maintain resilient, reusable patterns for ingesting data from a diverse set of sources, including our systems, transactional databases, event streams, and third-party SaaS APIs.
Define Data Modelling Standards: Lead the implementation of our core data modelling principles (e.g., Kimball dimensional modelling) to produce curated, intuitive datasets for business intelligence and product analytics.
Implement Robust Governance: Use tools like Unity Catalog to establish a comprehensive data governance framework, covering data lineage, fine-grained access controls, and a user-friendly data catalogue.
Manage Platform Performance and Cost: Develop and implement strategies for monitoring, optimising, and forecasting our Databricks and cloud expenditure, ensuring the platform is both powerful and cost-effective.
Champion Engineering Excellence and Best Practice: You will be the driving force for maturing our data operations, embedding a culture of quality, automation, and reliability into everything we do.
Automate Everything with CI/CD: Implement and advocate for automated CI/CD pipelines (e.g., using GitHub Actions) for all data assets, including dbt models, infrastructure changes, and Databricks jobs.
Embed Git-Based Workflows: Champion a Git-first culture for all data transformation code, establishing clear processes for branching, code reviews, and version control.
Embed Automated Data Quality: Implement comprehensive, automated data quality testing at every stage of our pipelines using tools like dbt test, ensuring data is accurate and trustworthy.
Introduce Data Observability: Establish thorough monitoring, logging, and alerting for all data pipelines to proactively detect, diagnose, and resolve issues before they impact the business.
Be a Strategic Partner Across the Business: You will connect the technical capabilities of the data platform to Deputy's strategic objectives, acting as a key advisor to stakeholders across the organisation.
Translate Business Needs into Technical Solutions: Collaborate directly with leaders in Product, Engineering, Sales, finance and Customer Success to understand their challenges and design data solutions that enhance our product, improve customer outcomes, and drive business strategy.
Guide Data Best Practices: Advise analysts, data scientists, and other stakeholders on how to best leverage the data platform for impactful analysis and data-driven decision-making.
Act as the Technical Authority: Serve as the go-to expert on our data architecture, running workshops and design sessions to align technical direction with business needs.
Lead, Mentor, and Elevate Our Data Team: As a technical member of the team, you will be instrumental in upskilling your colleagues and shaping the future of the data function at Deputy.
Mentor and Coach: Actively mentor data analysts and engineers through pair programming, constructive code reviews, and technical guidance to grow their skills in Python, SQL, and data modelling.
Foster a Community of Practice: Lead initiatives like a 'data guild' to encourage knowledge sharing, explore new technologies, and collaboratively solve complex problems.
Shape the Team's Future: Partner with data leadership to define career progression pathways for data engineering and take a leading role in interviewing and hiring new team members.
Requirements
Mastery of data architecture principles, data modelling frameworks (e.g., dimensional modelling), and a strong understanding of data governance and security best practices.
A strong software engineering mindset, with significant experience implementing CI/CD for data, Git-based workflows, and automated data quality testing.
Exceptional communication and stakeholder management skills, with a proven ability to translate complex technical concepts for non-technical audiences and influence business decisions.
A genuine passion for leadership and mentorship, with a track record of elevating the technical skills of those around you.
Tech Stack: Dbt, Databricks, Unity Catalog, Terraform, AWS: Redshift, Dynamo db, API gateway, Cloud Watch, Lambda, Streaming with Kenisis/Firehose, Glue, Bedrock, Stitch & Fivetran, Languages required include advanced SQL, python
Benefits
Enjoy a flexible remote-first work policy (with a work-from-home stipend to set you up for success!)
Own A piece of Deputy via our Employee Share Ownership Plan (ESOP)
Take paid parental leave to support you and your family
Stay protected with Group Salary Continuance Insurance
Access support through our Employee Assistance Program
Enjoy additional leave days — including study assistance, celebration days and volunteering
Join our global working groups focused on collaboration, belonging and connection
Get creative at our annual Hackathons
Take advantage of our novated leasing for electric vehicles, internet reimbursement and more!
As a Principal Data Architect at Solstice, lead the design and implementation of data architecture solutions. Ensure data integrity, security, and accessibility to meet strategic organizational goals.
Data Platform Specialist overseeing data workflows and enhancing data quality for Stackgini's AI - driven IT solutions. Collaborating with teams to drive improvements and stakeholder support.
Data Engineer designing data pipelines in Python for a major railway industry client. Collaborate with Data Scientists and ensure code quality with agile methodologies.
Senior Data Engineer responsible for building and optimizing data pipelines for banking analytics initiatives. Collaborating with data teams to ensure data quality and readiness for enterprise use.
Senior Data Engineer developing scalable data solutions on Databricks for analytics and operational workloads. Collaborating with cross - functional teams to modernize the data ecosystem.
Data Engineer focused on analytics and data pipeline development for network optimisation. Collaborating with teams to deliver high - quality data solutions with Python and SQL.
Senior Product Manager defining platform capabilities for Data Cloud in Salesforce. Collaborating with R&D teams while shaping product strategy for Data 360 integration.
Senior Data Engineer at Goodwin enhancing data platforms and fostering data - driven culture across teams. Collaborating with IT and Finance on technology solutions and data governance practices.
Director, Data Platform Design and Strategy at MedImpact leading data platform and AI innovations to enhance healthcare services. Overseeing enterprise projects and managing teams to meet strategic goals.
Data Engineer delivering AI - and data - driven solutions for Honeywell’s industrial customers. Architecting and implementing scalable data pipelines and platforms focused on IoT and real - time data processing.