About the role

  • Data Engineer designing and building production data pipelines for AI and ML workloads at Capgemini Engineering. Focus on end to end data lifecycle management and AWS infrastructure.

Responsibilities

  • Design, build, and maintain research and production data pipelines spanning edge devices, cloud services, and centralized platforms
  • Own the full data lifecycle including collection, ingestion, processing, obfuscation, versioning, access, retention, and retirement
  • Develop resilient ingestion pipelines that handle device variability and connectivity challenges
  • Support secure data transfer from field environments to cloud storage
  • Collaborate with operations teams to improve data coverage, observability, and reliability
  • Implement privacy preserving transformations and obfuscation pipelines
  • Build automated data cleaning and validation processes
  • Establish data lineage, retention policies, and access controls to ensure compliance and traceability
  • Provide scalable data services for training, evaluation, and research experimentation
  • Support continuous data refresh and retraining workflows
  • Build and optimize pipelines using AWS services such as S3, EC2, SageMaker, Lambda, Glue, and Step Functions

Requirements

  • Bachelor’s or master’s degree in computer science, data engineering, software engineering, or related field
  • 2-3+ years of experience building production data pipelines and data platforms for AI or ML systems
  • Strong proficiency in Python, C++ and distributed data processing frameworks
  • Hands on experience with AWS services including S3, EC2, SageMaker, and Glue
  • Experience designing data systems that support large scale ML training and experimentation
  • Knowledge of data governance, access control, and lifecycle management
  • Experience working with ML, data science, operations, and cloud engineering teams

Benefits

  • Health insurance from the first days
  • Christmas holidays from 25 December to 31 December
  • Cooperation with Superhumans center and Veteran HUB
  • Psychological counseling provided by the Veteran Hub

Job title

Senior Data Engineer

Job type

Experience level

Senior

Salary

Not specified

Degree requirement

Bachelor's Degree

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job