Data Engineer Intern supporting Intrepid's Martech solutions for ecommerce brands while gaining hands-on experience with data systems. Collaborate with experienced team members to enhance data collection and integration processes.
Responsibilities
Support the Tech team in building and improving Intrepid Martech , our core marketing and data solution for brands.
Work closely with experienced Data Engineers, BI specialists, and Product team members to gain hands-on experience with real production data systems.
Help collect, process, and integrate data from multiple external sources into our data platforms, while learning best practices in data architecture, data quality, system reliability, and monitoring.
Assisting in the design and development of crawler systems and data pipelines to collect data from e-commerce platforms, including login-required sources and open APIs.
Supporting the maintenance and continuous improvement of existing data pipelines, scraping workflows, and ingestion processes.
Testing, validating, and checking data from APIs and crawling processes to ensure accuracy, consistency, and data quality.
Assisting in monitoring data flows, identifying issues, and troubleshooting basic problems under the guidance of senior engineers.
Working closely with team leaders and following technical standards and guidelines to deliver assigned tasks effectively.
Documenting technical processes and contributing to internal knowledge sharing within the team.
Requirements
Careful, diligent, and responsible, with good teamwork and communication skills.
Currently pursuing or recently graduated with a degree in Information Technology, Software Engineering, Computer Science, or Data Science.
Basic understanding of backend development , data pipelines , and Linux environments.
Solid foundation in SQL/NoSQL , including writing queries, understanding table schemas, and basic query and table optimization.
Familiarity with data orchestration and scheduling tools such as Airflow (concepts, DAGs, task dependencies).
Basic knowledge of HTTP, HTML, JavaScript, and networking concepts.
Basic understanding of distributed systems and modern data infrastructure components , including: Message queues and streaming platforms such as RabbitMQ and Kafka.
In-memory data stores and caching mechanisms such as Redis.
Search and analytics engines such as Elasticsearch.
Familiarity with Golang is preferred; exposure to Node.js , Python , and web scraping or browser automation tools (e.g. Puppeteer) is a plus.
Basic experience with Linux command-line tools.
Familiarity with version control systems such as Git.
Exposure to CI/CD concepts and pipelines (e.g. GitLab CI/CD) is a plus.
Willingness and ability to learn cloud-based and containerized technologies , such as Kubernetes (K8s) , Cloud Functions , Cloud Storage , and BigQuery.
Basic awareness of AI-assisted development tools and AI agents , with an understanding of how to use them responsibly and effectively to support development, debugging, and learning.
Ability to review and evaluate AI-generated outputs , ensuring correctness, code quality, and alignment with existing systems before applying them.
Strong attention to code quality and maintainability , avoiding unnecessary complexity, redundant code, or “noise” in the codebase.
Eager to learn and work in an Agile development environment.
Interest in data collection, crawling/scraping, browser automation, and reverse engineering is an advantage.
Good written and spoken English communication skills.
Passion for ecommerce, data, and technology-driven solutions is a plus.
Benefits
The opportunity to contribute to the operating system for digital commerce in South East Asia , covering key platforms and functionalities across Middleware, Martech, Data & Analytics, and more.
Hands-on exposure to real-world SaaS platforms and large-scale data systems built on modern, scalable infrastructure.
The chance to work within a mature, enterprise-grade Tech team that has been operating since 2017, with well-established processes and best practices—while still maintaining a non-hierarchical, entrepreneurial, and collaborative culture.
Close collaboration with business stakeholders, ensuring your technical work has real-world impact and visibility.
Good ideas are encouraged, ownership is valued, and initiative is rewarded.
You are empowered to shape both your role and your growth.
You will have access to structured training through Intrepid Academy , coaching, and real-world project experience that accelerates your professional growth.
We offer a competitive internship allowance and a supportive environment that values your contributions and development.
Senior Manager leading a team of database engineers to manage CCC's data platform. Overseeing mission - critical applications and collaborating with cross - functional teams in a hybrid environment.
As a Principal Data Architect at Solstice, lead the design and implementation of data architecture solutions. Ensure data integrity, security, and accessibility to meet strategic organizational goals.
Data Platform Specialist overseeing data workflows and enhancing data quality for Stackgini's AI - driven IT solutions. Collaborating with teams to drive improvements and stakeholder support.
Data Engineer designing data pipelines in Python for a major railway industry client. Collaborate with Data Scientists and ensure code quality with agile methodologies.
Senior Data Engineer responsible for building and optimizing data pipelines for banking analytics initiatives. Collaborating with data teams to ensure data quality and readiness for enterprise use.
Senior Data Engineer developing scalable data solutions on Databricks for analytics and operational workloads. Collaborating with cross - functional teams to modernize the data ecosystem.
Data Engineer focused on analytics and data pipeline development for network optimisation. Collaborating with teams to deliver high - quality data solutions with Python and SQL.
Senior Product Manager defining platform capabilities for Data Cloud in Salesforce. Collaborating with R&D teams while shaping product strategy for Data 360 integration.
Senior Data Engineer at Goodwin enhancing data platforms and fostering data - driven culture across teams. Collaborating with IT and Finance on technology solutions and data governance practices.
Director, Data Platform Design and Strategy at MedImpact leading data platform and AI innovations to enhance healthcare services. Overseeing enterprise projects and managing teams to meet strategic goals.