Data Engineer in Veepee's Data Factory working on data ingestion pipelines and improving data quality. Collaborative environment utilizing Kubernetes, Python, Java, and modern data architectures.
Responsibilities
The work‑study student will contribute to stabilizing and industrializing the data ingestion platform to ensure a solid foundation for all Data use cases at Veepee.
**
Activity 1: Stabilization of the ingestion stack
Add unit tests to existing pipelines
Improve test coverage
Identify fragile areas
**
Activity 2: Continuous improvement of the Agate platform
Refactor technical components
Participate in performance optimization
Contribute to error handling and retry mechanisms
**
Activity 3: Data quality & reliability
Implement automated checks
Contribute to quality metrics
Participate in technical monitoring
**
Activity 4: Documentation & industrialization
Write technical READMEs
Formalize best practices
Assist in standardizing new pipelines
**
Activity 5: Contribution to strategic projects
Contribute to the dual run (BigQuery ↔ new stack)
Support Data Governance / Data Science / Analytics teams
**
Requirements
🎓 Education
Bachelor's (BSc) to Master's (MSc) in Computer Science, Data, or from an engineering school
Data Engineer managing scalable data ecosystems for actionable business intelligence and cross - functional stakeholder collaboration. Optimizing ETL/ELT pipelines and ensuring data integrity and security.
Data Engineer specializing in data architecture and solutions for a banking environment, driving value for customers through innovative engineering practices and technologies in data management.
Technical Lead for data engineering and reporting in healthcare technology at Dedalus. Shaping innovative software solutions and leading cross - functional technical teams in Australia.
Senior ML Data Engineer working on data pipeline curation for Mobileye's autonomous vehicle dataset. Collaborating across teams to enhance ML engineering and vision model applications.
Data Engineer managing customer datasets to enhance industrial research and development. Responsible for ETL pipelines and data ingestion for the Uncountable Web Platform.
Data Engineer designing and maintaining scalable data solutions on Databricks for clinical trials. Collaborating with teams to overcome data challenges and ensure the smooth logistics of clinical supplies.
Senior Manager leading a team of database engineers to manage CCC's data platform. Overseeing mission - critical applications and collaborating with cross - functional teams in a hybrid environment.
As a Principal Data Architect at Solstice, lead the design and implementation of data architecture solutions. Ensure data integrity, security, and accessibility to meet strategic organizational goals.
Data Platform Specialist overseeing data workflows and enhancing data quality for Stackgini's AI - driven IT solutions. Collaborating with teams to drive improvements and stakeholder support.
Data Engineer designing data pipelines in Python for a major railway industry client. Collaborate with Data Scientists and ensure code quality with agile methodologies.