Transforming business strategies into high-performance solutions using Databricks. Leading data engineering, advanced analytics, and generative AI initiatives in a scalable architecture.
Responsibilities
Lakehouse Architecture: Design and evolve the enterprise platform on Databricks (Delta Lake), establishing ingestion patterns (batch/streaming), storage, and consumption standards.
Governance & Security: Implement governance frameworks using Unity Catalog, ensuring data quality, lineage, security (RBAC/ABAC), and compliance with the LGPD (Brazilian Data Protection Law).
AI & ML Support: Design infrastructure for the Machine Learning lifecycle (MLOps) and GenAI initiatives, supporting everything from feature engineering to LLM deployment.
Data Engineering: Define technical standards for complex pipelines, integrating critical systems such as SAP (S/4HANA) and legacy databases.
Technical Leadership: Act as an advisor for executive decisions and guide engineering teams in applying best practices for versioning and resilience.
Requirements
Proven track record as a Data Architect in complex environments.
Strong command of the Databricks ecosystem (Unity Catalog, Delta Lake, Jobs, Workflows).
Advanced experience with Azure Cloud.
Expertise in data modeling and high-performance SQL.
Clear understanding of scalable pipelines and governance/security patterns.
Ability to translate complex business needs into clear technical diagrams.
Comfortable collaborating across Security, Infrastructure, and Business teams.
Pragmatic innovation mindset: focused on continuous improvement with an emphasis on delivering organizational value.
Preferred qualifications:
Knowledge of MLOps (MLflow, monitoring, and retraining).
Experience with Data Mesh and domain-oriented architectures.
Experience in the Energy or Oil & Gas sectors.
Familiarity with SAP technologies (S/4HANA, Datasphere).
Benefits
Health and dental insurance
Meal and grocery allowance
Childcare allowance
Extended parental leave
Partnerships with gyms and health & wellness professionals via Wellhub (Gympass) / TotalPass
Profit Sharing (PLR)
Life insurance
Continuous learning platform (CI&T University)
Discount club
Free online platform dedicated to promoting physical and mental health and wellbeing
Data Engineer responsible for designing and maintaining enterprise data warehouse for various projects. Collaborating with stakeholders to ensure efficient data flow and integration.
Data Engineer developing and evolving BI platform for Thomson Reuters in India. Building architectural solutions and collaborating in all development lifecycle aspects.
Data Engineer responsible for building and maintaining AWS Lakehouse infrastructure for trade contractors at Remarcable. Focused on clean data architecture and AI/ML data infrastructure.
Data Engineer/Analyst maintaining and improving data infrastructure for Braiins. Collaborating with technical and business teams to ensure reliable data flows and insights.
Medior Data Engineer handling Azure migrations for a major urban mobility client. Focused on data pipeline development and ensuring platform reliability with cutting - edge technologies.
Developing ML and computer vision solutions for cutting - edge autonomous vehicle dataset pipeline at Mobileye. Collaborating across teams for data curation and advanced perception algorithms.
Data Migration Lead in a hybrid role managing data migration for a major transformation programme in the media sector. Collaborating with various teams to ensure data integrity and successful migration.
Consultant ML & DataOps at Smile integrating data science projects for major clients. Designing MLOps solutions and enhancing data governance in a collaborative environment.
Data Engineer developing and maintaining data pipelines for Coolbet’s analytical services. Working within an Agile framework to ensure data reliability and efficiency.