Data Engineer responsible for developing ETL pipelines and ensuring data solutions in AWS data lakes/warehouses. Collaborating with teams and focusing on data quality, compliance, and performance optimization.
Responsibilities
Design, develop, and maintain ETL pipelines using AWS Glue, Glue Studio, and Glue Catalog.
Ingest, transform, and load large datasets from structured and unstructured sources into AWS data lakes/warehouses.
Work with S3, Redshift, Athena, Lambda, and Step Functions for data storage, query, and orchestration.
Build and optimize PySpark/Scala scripts within AWS Glue for complex transformations.
Implement data quality checks, lineage, and monitoring across pipelines.
Collaborate with business analysts, data scientists, and product teams to deliver reliable data solutions.
Ensure compliance with data security, governance, and regulatory requirements (BFSI preferred).
Troubleshoot production issues and optimize pipeline performance.
Requirements
9+ years of experience in Data Engineering, with at least 5+ years on AWS cloud data services.
Data Engineer managing customer datasets to enhance industrial research and development. Responsible for ETL pipelines and data ingestion for the Uncountable Web Platform.
Data Engineer designing and maintaining scalable data solutions on Databricks for clinical trials. Collaborating with teams to overcome data challenges and ensure the smooth logistics of clinical supplies.
Senior Manager leading a team of database engineers to manage CCC's data platform. Overseeing mission - critical applications and collaborating with cross - functional teams in a hybrid environment.
As a Principal Data Architect at Solstice, lead the design and implementation of data architecture solutions. Ensure data integrity, security, and accessibility to meet strategic organizational goals.
Data Platform Specialist overseeing data workflows and enhancing data quality for Stackgini's AI - driven IT solutions. Collaborating with teams to drive improvements and stakeholder support.
Data Engineer designing data pipelines in Python for a major railway industry client. Collaborate with Data Scientists and ensure code quality with agile methodologies.
Senior Data Engineer responsible for building and optimizing data pipelines for banking analytics initiatives. Collaborating with data teams to ensure data quality and readiness for enterprise use.
Senior Data Engineer developing scalable data solutions on Databricks for analytics and operational workloads. Collaborating with cross - functional teams to modernize the data ecosystem.
Data Engineer focused on analytics and data pipeline development for network optimisation. Collaborating with teams to deliver high - quality data solutions with Python and SQL.
Senior Product Manager defining platform capabilities for Data Cloud in Salesforce. Collaborating with R&D teams while shaping product strategy for Data 360 integration.