Senior Data Engineer responsible for building and maintaining scalable data ingestion systems for healthcare data. Optimizing data pipelines and collaborating with analytics engineers in a fast-growing environment.
Responsibilities
The Senior Data Engineer is responsible for building and maintaining scalable data ingestion infrastructure and operational systems.
Develop and optimize scalable data ingestion pipelines from platform sources (RDS, DynamoDB) into Snowflake.
Building event-driven pipelines using Kinesis, Airbyte, or other open-source frameworks to handle high-volume healthcare data.
Implementing and maintaining a staging-layer architecture that supports the broader medallion (staging → intermediate → marts) structure.
Creating configuration-driven, containerized toolsets (Docker/Kubernetes) to ensure data solutions are portable and maintainable.
Ensuring data reliability by building comprehensive monitoring, alerting, and automated testing for all ingestion processes.
Collaborating with analytics engineers to streamline the flow of data for dbt transformation.
Applying software engineering best practices, including modular design and test-driven development, to all data infrastructure.
Refactoring existing ingestion processes to improve performance, cost-efficiency, and scalability.
Mentoring mid-level and junior engineers through code reviews and sharing best practices in data operations.
Requirements
4-6+ years of professional experience in data engineering with a focus on data ingestion and infrastructure.
Proficiency in Python and SQL, with a track record of building production-grade data pipelines.
Strong experience with ingestion tools such as Kinesis, Airbyte, Kafka, or similar frameworks.
Hands-on experience with Snowflake and moving data from operational databases (RDS, DynamoDB) to cloud data warehouses.
Solid understanding of AWS services (S3, Lambda, Step Functions, RDS).
Experience with containerization (Docker) and deploying maintainable systems.
Knowledge of ELT patterns, specifically supporting analytics engineering workflows and dbt.
Experience with CDC (Change Data Capture) and incremental processing methodologies.
Detail-oriented mindset regarding data privacy and compliance (HIPAA experience is a plus).
Strong communication skills, with the ability to collaborate effectively across data science and engineering teams.
Senior Data Engineer building and maintaining robust data pipelines for various data products at Beep Saúde. Collaborating within the team and leading data governance practices.
Software Developer in Test working on cloud - based data platform at Tecsys. Ensuring quality and reliability of data pipelines and transformations using automation frameworks.
Data Engineer responsible for designing, building, and optimizing data pipelines and architectures in a tech environment. Requires extensive experience with modern data warehousing and cloud platforms.
Lead Data Engineer role at Brillio focusing on AI & Data Engineering with expertise in Azure and MS Fabric. Collaborate within the Data Engineering team in Pune, Maharashtra, India.
Data Architect at Whiteshield designing scalable, secure data architectures for national and enterprise transformation programs. Architecting modern data platforms to support analytics, AI and operational use cases.
Data Engineer managing scalable data ecosystems for actionable business intelligence and cross - functional stakeholder collaboration. Optimizing ETL/ELT pipelines and ensuring data integrity and security.
Data Engineer specializing in data architecture and solutions for a banking environment, driving value for customers through innovative engineering practices and technologies in data management.
Technical Lead for data engineering and reporting in healthcare technology at Dedalus. Shaping innovative software solutions and leading cross - functional technical teams in Australia.
Senior ML Data Engineer working on data pipeline curation for Mobileye's autonomous vehicle dataset. Collaborating across teams to enhance ML engineering and vision model applications.
Data Engineer managing customer datasets to enhance industrial research and development. Responsible for ETL pipelines and data ingestion for the Uncountable Web Platform.