Data Engineer responsible for designing and maintaining data pipelines at SOPHiA GENETICS. Collaborating with stakeholders to deliver actionable insights through robust data models and workflows.
Responsibilities
Designing, building, and maintaining the data pipelines and transformations that power analytics and decision-making across SOPHiA GENETICS.
Working closely with stakeholders from different functions to understand their use cases, translate their needs into robust data models and workflows, and ensure that data is reliable, well-structured, and ready for downstream analysis and visualization.
Building scalable data solutions, thinking critically about what the data means, how it is produced, and how it can be used to generate actionable insights.
Design and maintain scalable, reliable data pipelines and ETL processes that integrate seamlessly and perform efficiently across the platform.
Optimize data workflows, queries, and resource utilization to maximize performance and cost-efficiency as data volume and complexity grow.
Develop and implement new data features and transformations that translate stakeholder use cases into production-ready solutions.
Automate deployment of data jobs, CI/CD pipelines, and infrastructure configurations to ensure consistency and reproducibility.
Implement monitoring and observability across data quality, pipeline health, and performance to detect issues early and ensure reliability.
Requirements
2-5 years of experience working within Data Engineer (distributed data, data lakes, microservice-oriented architectures, and APIs)
BA/MA in Computer Science or Engineering or equivalent professional experience
Expertise with Python ETLs in a data processing environment, ideally Databricks
Expertise with distributed big data architectures (schemas, transfers, storage, partitioning, performance monitoring and optimization)
Solid knowledge of modern scalable database and data lake technologies, especially Spark & SQL, but also including Parquet & Delta tables.
Experience with containerization and orchestration technologies, as well as basic DevOps processes and tooling
Experience with software engineering best-practices, Agile, CI/CD, Unit & integration testing
Experience with multimodal data spanning of digital healthcare, clinical, radiomics and genomics (is a plus)
As a public organisation facing ongoing commercial growth, you will bring a success-orientated and solutions-focused mindset that embraces team collaborations, change, growth and inclusion.
As an international organisation, English is our primary business language and you will need to bring full fluency in English. As part of your recruitment journey, you should expect to meet English-only speakers, so for best chances of success, you should include your CV in English. Non-English CVs will be rejected at application stage.
Benefits
Opportunity to work on cutting-edge research projects with an immediate global impact
A flexible, friendly and international working environment with a collaborative atmosphere
An exciting company mission that brings together science and technology to directly impact the lives of patients with life threatening illness
A fast-growing company with plenty of opportunity for personal growth and development
A hard technical challenge to solve with exciting modern technology - cloud computing, Big Data, DevOps, machine learning
Forfait-Jour working types
Health benefits for you and your family covered by 80% employer contributions
Life Insurance and pensions contribution
SWILE meal vouchers and home office allowances
25 Days Vacation
Additional voluntary benefits including sports allowance, language courses, bank partnerships and transportation.
Director of Data Engineering leading data architecture and analytics at Petfolk. Overseeing data infrastructure and managing a data team to drive AI and business intelligence solutions.
Senior Data Engineer managing end - to - end data pipelines with Google Cloud Platform. Collaborating closely with product teams to deliver scalable data solutions in a hybrid setting.
GCP Data Engineer designing, building, and optimising data solutions on Google Cloud Platform. Collaborating with clients to solve complex data challenges and enhance AI capabilities.
Data Engineer developing scalable data solutions across multi - cloud environments for clients. Mentoring junior engineers while ensuring data quality and promoting best practices within the team.
Consultant Data Engineer for modern data transformations at Intelligen. Work on dbt and Snowflake projects for enterprise clients to optimize data pipelines.
Data Engineer designing and delivering modern data solutions across multi - cloud environments for clients in Australia. Collaborating and mentoring while contributing to meaningful projects in a high - performing team.
Principal Product Manager leading GEICO's Customer Data Platform development and strategy. Collaborating with cross - functional stakeholders to improve customer engagement through data - driven solutions.
Senior Product Manager at GEICO managing the Customer Data Platform, collaborating across teams for data - driven solutions. Evolving customer engagement using innovative data strategies.
Data Engineering Lead at Absa enabling analytics and AI through scalable data platforms. Overseeing engineering teams to deliver high - quality, trusted data solutions.