Data Engineer at Vistra designing and maintaining data pipelines for analytics. Collaborating with teams and optimizing data integration using modern cloud technologies.
Responsibilities
Design and implement scalable ETL/ELT pipelines using AWS services including AWS Glue, Lambda, S3, and Step Functions
Build and optimize data integration processes connecting MySQL databases, APIs, and external data sources to analytical systems and data warehouses
Develop automated data quality monitoring, validation, and cleansing processes
Create and maintain data models, schemas, and documentation to support analytics teams
Implement real-time and batch data processing solutions using serverless architectures
Collaborate with development teams to integrate data collection points into Next.js applications and Node.js services
Build and maintain data analytics APIs and services
Monitor data pipeline performance, troubleshoot issues, and implement proactive alerting and logging mechanisms
Design and implement data backup, archival, and disaster recovery strategies
Work with data analysts and business stakeholders to understand reporting requirements
Requirements
Bachelor’s degree in Computer Science, Data Engineering, Mathematics, or a related technical field
4-6 years of hands-on data engineering experience with strong proficiency in Python for data processing, transformation, and pipeline development
Extensive experience with AWS data services including AWS Glue, Lambda, S3, Athena, Redshift, and Kinesis for building serverless data pipelines
Strong SQL skills and experience with MySQL database design, optimization, and administration including performance tuning and query optimization
Experience with data pipeline orchestration tools such as Apache Airflow, AWS Step Functions, or similar workflow management systems
Proficiency in data formats including JSON, CSV, Parquet, and Avro
Knowledge of data warehousing concepts, dimensional modeling, and analytics best practices for supporting business intelligence requirements
Experience with version control systems, CI/CD pipelines, and infrastructure as code practices for deploying and managing data infrastructure
AWS certifications such as AWS Certified Data Analytics Specialty or AWS Certified Solutions Architect
Experience with streaming data technologies including Apache Kafka, AWS Kinesis, or real-time data processing frameworks
Knowledge of machine learning workflows and experience building data pipelines that support ML model training and inference
Familiarity with business intelligence tools such as Tableau, Power BI, or AWS QuickSight for creating data visualizations and dashboards
Experience with containerization technologies like Docker and orchestration platforms for deploying data processing applications
Understanding of data governance, privacy regulations, and security best practices for handling sensitive data in cloud environments
Experience with NoSQL databases such as DynamoDB, MongoDB, or Elasticsearch for handling unstructured data and high-volume analytics workloads
Benefits
Flexible hybrid working arrangement
Birthday leave
Comprehensive medical insurance and dental coverage
Wellness allowance
Competitive annual leave entitlement
Internal mentorship program
Reimburse professional membership fees for certifications
Software Engineer at Warner Music Group developing an innovative Data Platform for the music industry. Collaborating with dynamic teams to enhance music data processing and delivery.
Data Engineer role specializing in Azure & Snowflake at InfoCentric. Leading design and delivery of enterprise - scale data platforms for large organizations.
Principal Data Architect at PointClickCare ensuring coherent and scalable data architecture. Driving unified data direction while collaborating with Engineering Architecture team for AI enablement.
Data Engineer Tech Lead developing data solutions at Carelon. Leading a cross - functional team to optimize data workflows and maintain data integrity.
Lead Data Engineer responsible for evolving Manna’s data infrastructure for drone delivery. Overseeing data architecture and analytics while building scalable data pipelines.
Data Engineer designing, implementing, and optimizing data pipelines for DeepLight AI. Collaborating closely with a multidisciplinary team to analyze large - scale data.
Data Engineer designing and maintaining scalable ETL pipelines at Satori Analytics. Collaborating with teams to deliver high - quality analytics solutions across various industries.
Data Architect responsible for defining enterprise data architecture on AWS and Databricks Lakehouse platforms. Enabling scalable data lakes and enterprise analytics for financial services organizations.
Data Platform Operations Support leading data engineering strategy across projects for EXL. Driving innovation and optimization while collaborating with various teams in the organization.
Manager II leading data engineering projects at Navy Federal Credit Union. Overseeing data governance and quality initiatives while managing engineering teams in a hybrid work environment.