Data Engineer building solutions on AWS for high-performance data processing. Leading initiatives in data architecture and analytics for operational support.
Responsibilities
Design, develop, and maintain data pipelines using AWS Glue, AWS Lambda, Airflow, and complementary AWS services
Implement and optimize ETL/ELT processes for ingesting, transforming, and provisioning structured and unstructured data
Build distributed architectures using EC2, S3, IAM, CloudWatch, VPC, and other AWS components
Ensure data quality, integrity, and governance throughout the data lifecycle
Monitor systems, optimize performance, and reduce costs for data workloads
Collaborate with analytics, engineering, and product teams to advance data-driven solutions
Apply best practices for security, code versioning, and automation (CI/CD)
Requirements
Proven experience with AWS Glue (Jobs, Crawlers, Data Catalog)
Strong proficiency with Airflow for pipeline orchestration
Experience provisioning and operating EC2 instances
Advanced Python skills (including data manipulation libraries)
Experience with relational and non-relational databases (Redshift, RDS, DynamoDB, etc.)
Knowledge of S3, IAM, CloudWatch, SNS/SQS, and Lambda
Experience with Git and CI/CD tools
Familiarity with data modeling, Data Lake, and Data Warehouse concepts
Experience with Terraform or CloudFormation (preferred)
Knowledge of Redshift Spectrum, Athena, and Lake Formation (preferred)
Experience with big data environments (Spark, EMR, Databricks) (preferred)
Data Engineer designing data pipelines in Python for a major railway industry client. Collaborate with Data Scientists and ensure code quality with agile methodologies.
Senior Data Engineer responsible for building and optimizing data pipelines for banking analytics initiatives. Collaborating with data teams to ensure data quality and readiness for enterprise use.
Senior Data Engineer developing scalable data solutions on Databricks for analytics and operational workloads. Collaborating with cross - functional teams to modernize the data ecosystem.
Data Engineer focused on analytics and data pipeline development for network optimisation. Collaborating with teams to deliver high - quality data solutions with Python and SQL.
Senior Product Manager defining platform capabilities for Data Cloud in Salesforce. Collaborating with R&D teams while shaping product strategy for Data 360 integration.
Senior Data Engineer at Goodwin enhancing data platforms and fostering data - driven culture across teams. Collaborating with IT and Finance on technology solutions and data governance practices.
Director, Data Platform Design and Strategy at MedImpact leading data platform and AI innovations to enhance healthcare services. Overseeing enterprise projects and managing teams to meet strategic goals.
Data Engineer delivering AI - and data - driven solutions for Honeywell’s industrial customers. Architecting and implementing scalable data pipelines and platforms focused on IoT and real - time data processing.
Data Engineering Associate focusing on data quality control and management for distribution platform. Collaborates on large scale data projects to ensure data accuracy and availability for users.
Data Architect managing enterprise data platform built on Microsoft Fabric at Johnstone Supply. Leading architectural standards and collaborating with business and IT leaders for strategic data - driven insights.