Senior Data Engineer joining Financial Crime team to build data pipelines for fraud detection. Working with complex datasets in Databricks and collaborating with cross-functional teams.
Responsibilities
Build and optimize data ingestion pipelines using Python and PySpark to collect and transform data from multiple sources (transactions, KYC, AML, authentication, devices, logs, etc.).
**Proficiency in SQL (PostGres preferred) **
**Design and maintain data model that support Financial Crime/Fraud detection, profiling, and entity resolution. **
**Implement data quality checks and ensure data reliability across environments.
**Collaborate closely with Data Scientists, Analysts, Compliance, Operations and our Product/Feature teams to operationalize models and rules. **
Utilize jobs, workflows, APIs and streaming to manage large-scale data processing workloads.
Integrate with external systems (e.g. sanctions, ID&V, biometrics and authentication systems) to enrich risk and identity data.
Support **automation and monitoring** of ETL processes to improve operational efficiency.
Requirements
Bachelor’s degree.
**5+ years of experience **
**Strong skills in Python, PySpark, Scala and Advanced SQL (preferably PostGres) **
**Hands-on experience with Databricks, Snowflake, Fabric or similar **
**Experience working with structured and unstructured data in a production environment. **
**Experience with Agentic AI, MLFlow, ML models, Eval **
**Secure Coding practices – testing/QA **
**Comfortable with cloud-based data platforms (preferably AWS). **
**Good communication skills in English — able to collaborate with cross-functional teams in an international environment. **
**Proficiency in working with Text, Delta, Parquet, JSON, CSV, and XML data formats. **
**Working knowledge of Spark structured streaming. **
**AWS infrastructure experience, specifically working with S3. **
**Solid understanding of git-based version control, DevOps, and CI/CD. **
**Experience of working on Atlassian stack a plus. **
**Knowledge of common web API frameworks and web services. **
Strong teamwork, relationship, and client management skills, and the ability to influence peers and senior management to accomplish team goals.
Willingness to embrace modern technology, best practice, and ways of work.
**Nice to Have: **
Experience in **Financial Crime/AML, KYC, **or** fraud detection** systems.
Familiarity with **Entity Resolution frameworks** (e.g., Quantexa, Sensing, open source Entity Resolution tools).
Experience with **data streaming frameworks** (Kafka, Spark Streaming, MQ).
Benefits
Be part of a **mission-driven** team tackling real-world financial crime problems.
Work with **modern data tech stack** with Agentic AI and advanced ML.
**Hybrid working model **with flexible hours.
International and collaborative culture — working with colleagues across **Vietnam, Singapore, Philippines and South Africa**.
Competitive salary, performance bonuses, and learning support.
Senior Data Engineer at Keyrus leading the design, development, and delivery of scalable data platforms. Collaborating with teams to translate requirements into production - grade solutions and mentoring engineers.
Senior Data Engineer for global payments platform designing ETL pipelines and data models. Collaborating across teams to tackle complex data challenges in an innovative fintech environment.
Data Warehouse Modelling Engineer designing and maintaining data models using Data Vault 2.0 for iGaming industry. Collaborating with stakeholders and optimizing data models in a hybrid work environment.
Senior Data Engineer driving impactful data solutions for the climate logistics startup HIVED's core data platform. Collaborating with cross - functional squads to enhance analytics and delivery.
Data Engineer developing and maintaining CRE forecasting infrastructure for Cushman & Wakefield. Collaborates with senior economists and technical teams to ensure high - quality data solutions.
Data Engineer at PwC, engaging with Azure cloud services to enhance data handling and integrity. Responsibilities include pipeline optimizations, documentation, and collaboration with stakeholders.
Data Engineer Manager at PwC focusing on building data infrastructure and solutions. Leading data engineering projects to transform raw data into actionable insights and drive business growth.