AWS Data Engineer designing and deploying infrastructure using Terraform for a modern Data Lakehouse. Driving ETL processes, data ingestion, and analytics using AWS technologies.
Responsibilities
Design, develop, and deploy AWS infrastructure using Terraform, including S3, Glue, IAM, Lake Formation, and Athena resources
Develop and maintain AWS Glue ETL jobs (PySpark or Python shell) for data ingestion, transformation, and curation across raw → clean → curated layers
Integrate Airflow (Amazon MWAA or self-managed) for orchestrating Glue jobs, data pipelines, and dependencies
Build and maintain Glue Catalog, manage metadata, and align with Lake Formation security policies
Write complex SQL queries for data validation, transformation, and reporting logic, ensuring efficient query performance
Manage Terraform state files, backend setup (S3 + DynamoDB), and environment-based deployments
Implement data ingestion frameworks for batch and near real-time pipelines
Collaborate with Snowflake and BI teams for seamless data consumption
Contribute to high availability (multi-AZ) and disaster recovery (multi-region) strategies for core data components
Requirements
6 years of experience as a Data Engineer or Cloud Engineer
Strong expertise in AWS Services: S3, Glue, Glue Catalog, Lake Formation, IAM, Athena, CloudWatch, Lambda (preferred)
Hands-on proficiency in Terraform (HCL) for infrastructure automation
Experience with Airflow DAGs for orchestration of Glue, S3, and external data flows
Solid understanding of PySpark / Python for ETL scripting
Strong ability to write and optimize complex SQL (joins, window functions, CTEs, and analytical queries)
Familiarity with data lake formats (Iceberg, Parquet, Delta, etc.)
Experience with CI/CD pipelines (GitHub Actions, CodePipeline, or Jenkins)
Benefits
Flexible work
Healthcare including dental, vision, mental health, and well-being programs
Financial well-being programs such as 401(k) and Employee Share Ownership Plan
Paid time off and paid holidays
Paid parental leave
Family building benefits like adoption assistance, surrogacy, and cryopreservation
Social well-being benefits like subsidized back-up child/elder care and tutoring
Senior Data Engineer designing and optimizing data platforms for clients using Microsoft Azure, Microsoft Fabric, Power BI, and Databricks. Working closely with clients to deliver scalable solutions.
Data Engineer providing technical expertise on mission - critical NAVSUP OIS program. Work involves data architecture and database management in AWS GovCloud environments.
Senior Data Engineer focusing on data infrastructure for an AI - driven insurtech startup based in Nepal. Collaborating with teams to optimize data models and maintain data quality.
Senior Professional Consultant leading architecture and design for SAP BW and SAC solutions at Freudenberg. Collaborating with stakeholders and optimizing performance of data landscapes.
Senior Data Engineer designing and managing data architectures to transform large - scale data into insights for Humana. Involves leading technical discussions and implementing best data practices.
Data Engineer II at Early Warning Services developing data science tools and infrastructure. Collaborating on software enhancements and mentoring interns in a hybrid work environment.
Senior Data Architect responsible for optimizing data architecture and supporting data - driven business decisions at TruStage. Leading technical guidance for data architecture and cross - functional team collaboration.
Senior Data Architect developing data architecture plans at The Hartford, collaborating with internal teams to align data standards and practices. Leading complex solutions with a focus on operational effectiveness.
Senior Solution Architect defining architecture framework for SA‑CCR in regulatory risk. Collaborating with stakeholders to ensure compliance and efficient data governance.