Data Engineer Intern for Cognira involved in deploying and managing data pipelines and architecting file-ingestion API. Opportunity to gain experience in data science and engineering processes.
Responsibilities
Design and implement a modular ETL pipeline on Databricks and enable parameterized, YAML-driven deployments using Databricks Bundles.
Implement Spark performance optimizations and CI/CD to promote pipelines across environments.
Build a programmatic deployment and management layer for Databricks using the Databricks REST API to create/configure clusters, jobs, and notebooks dynamically and securely.
Architect and implement a secure, scalable file-ingestion API that provides validation, auto-renaming, manifest generation, and reliable transfer to cloud storage (with full traceability).
Requirements
Excellent academics in Computer Science, Engineering, or related field.
Problem-solving is your jam, and you're all about critical thinking.
You're not afraid to roll up your sleeves and get stuff done, even if you're independently on your own with minimal supervision.
You can juggle multiple projects like a pro.
Challenges don't scare you; in fact, you love diving into them.
You can communicate like a champ, whether it's writing reports or presenting in a room full of people.
You're curious, and you love picking up new skills & technologies.
You're a team player, always up for sharing your ideas and best practices.
Benefits
Great company culture.
"Learn and Share" sessions.
You'll get support from your mentors.
Social events and after-work.
A flexible and fun work environment.
Casual dress code.
You'll work with a cool team!
We respect your ideas, and we're all about trying new things.
Senior Manager leading a team of database engineers to manage CCC's data platform. Overseeing mission - critical applications and collaborating with cross - functional teams in a hybrid environment.
As a Principal Data Architect at Solstice, lead the design and implementation of data architecture solutions. Ensure data integrity, security, and accessibility to meet strategic organizational goals.
Data Platform Specialist overseeing data workflows and enhancing data quality for Stackgini's AI - driven IT solutions. Collaborating with teams to drive improvements and stakeholder support.
Data Engineer designing data pipelines in Python for a major railway industry client. Collaborate with Data Scientists and ensure code quality with agile methodologies.
Senior Data Engineer responsible for building and optimizing data pipelines for banking analytics initiatives. Collaborating with data teams to ensure data quality and readiness for enterprise use.
Senior Data Engineer developing scalable data solutions on Databricks for analytics and operational workloads. Collaborating with cross - functional teams to modernize the data ecosystem.
Data Engineer focused on analytics and data pipeline development for network optimisation. Collaborating with teams to deliver high - quality data solutions with Python and SQL.
Senior Product Manager defining platform capabilities for Data Cloud in Salesforce. Collaborating with R&D teams while shaping product strategy for Data 360 integration.
Senior Data Engineer at Goodwin enhancing data platforms and fostering data - driven culture across teams. Collaborating with IT and Finance on technology solutions and data governance practices.
Director, Data Platform Design and Strategy at MedImpact leading data platform and AI innovations to enhance healthcare services. Overseeing enterprise projects and managing teams to meet strategic goals.