Data Analyst focusing on SQL, developing and maintaining data pipelines using PySpark. Collaborating on data modeling and supporting data warehouses in a dynamic environment.
Responsibilities
We are looking for a qualified Mid-level Data Analyst with a strong foundation in SQL-based development.
This role will focus on building and maintaining data pipelines using PySpark, with SQL as the primary coding language.
The candidate should also have a solid understanding of data modeling frameworks (such as Kimball dimensional modeling) and experience supporting data warehouses and data marts.
Develop and maintain batch data pipelines using PySpark (SQL-focused).
Write and optimize complex SQL queries to support business logic and reporting needs.
Independently understand requirements and translate them into code.
Transform and integrate data from multiple sources into Iceberg tables and Snowflake.
Contribute to the development of data marts and curated datasets for business consumption.
Collaborate with business analysts to understand data needs.
Monitor and manage data jobs running on AWS EMR orchestrated by Airflow, leveraging S3, Glue, and other AWS services.
Ensure data quality, consistency, and performance across the pipeline.
Requirements
Fluent in English
Minimum of 3 years of hands-on experience in data engineering writing complex SQL queries
Proven experience in SQL, including joins, aggregations, window functions, and performance tuning
Hands-on experience with PySpark, particularly Spark SQL
Familiarity with AWS data services (e.g., EMR, S3, Glue)
Understanding of data modeling frameworks, including the Kimball methodology
Experience working with Snowflake or similar cloud data warehouses
Knowledge of Apache Iceberg or similar table formats (e.g., Delta Lake, Hudi)
Benefits
Company-subsidized health plan for the employee.
Option to add dependents to the health plan with payroll discount.
Dental assistance (optional).
Option to add dependents to dental assistance with payroll discount.
Meal or food allowance.
Transportation voucher (optional).
Impact & Care - Personal Guidance Program offering confidential emotional support and counseling (psychological, legal, financial, social, and pet-related) at no cost to the employee and legal dependents.
Gympass - Wellhub (Access to 700+ gyms across Brazil with plans starting at R$29.90, deducted from payroll).
Option to add dependents to Gympass - Wellhub (up to 3 dependents — paid via credit card).
Access to Udemy through our intranet.
Partnerships with major consumer brands.
SESC membership plan for employee and dependents.
Discount agreements with educational institutions (undergraduate and postgraduate) and language/certification schools.
Senior Manager for Payments Data Analytics at FIS analyzing revenue opportunities and presenting insights. Collaborating with technology and data teams to enhance financial services.
Data Analytics Advisor responsible for delivering high - quality analytics and reporting for Cigna's customer experience team. Collaborating with operational leaders and stakeholders to inform data - driven decisions.
Student Assistant at Novo Nordisk maintaining data tools and analyzing performance across global manufacturing sites. Seeking MSc students with an analytical mindset and data - driven insights.
Data Analyst supporting the Business Intelligence & Data Insights team at CBIZ. Involves data analysis, quality assurance, and dashboard development using Alteryx and Power BI.
Consultant Data Analyst handling reporting automation projects at Primexis. Working on data - driven financial decisions with modern techniques and tools.
Data Analysis Intern focusing on data analysis, visualization, and processing at NexThreat LLC. Engaging in insights and trends through HUBZone Internship Program.
Data Analyst supporting development and deployment of analytics at Intermountain Health. Collaborating with business and clinical leaders to improve performance and patient care.
Network Optimisation Data Analyst supporting Australia Post’s network strategy. Delivering data - driven insights and assisting in network modelling for operational improvement initiatives.
Lead Securities Quantitative Analytics Specialist developing Asset Liability Management models for Wells Fargo's Investment Portfolio. Collaborating with business stakeholders and quant teams to deliver innovative solutions.