Data Engineer optimizing data pipelines and cloud solutions for GFT Poland. Involves performance tuning, ETL pipelines, and data model development across multiple locations in Poland.
Responsibilities
Your responsibilities will include performance tuning and optimization of existing solutions, building and maintaining ETL pipelines, as well as testing and documenting current data flows
You will also be involved in implementing tools and processes to support data-related projects and promoting the best development standards across the team
Design, build, test and deploy Cloud and on-premise data models and transformations in Cloud Native or dedicated toolset
Optimize data views for specific visualization use cases making use of schema design partitions, indexes, down-sampling, archiving, etc. to manage trade-offs such as performance and flexibility
Review and refine, interpret and implement business and technical requirements
Ensure you are part of the on-going productivity and priorities by refining User Stories, Epics and Backlogs in Jira
Onboarding new data sources, design, build, test and deploy Cloud data ingest, pipelines, warehouse and data models/products
Requirements
At least 4-5 years of commercial experience as a Data Engineer
Strong Python and PySpark skills
Strong hands-on experience with SQL and query optimization
Experience with ETL/ELT pipelines development, testing, and management
Strong experience with Hadoop
Understanding of key concepts around Data Warehousing, Data Lakes and Data Lakehouses
Experience with Cloud Data engineering toolset, preferably GCP
Experience with Java/Scala (Nice to have)
Benefits
Hybrid work in one of our locations: Lodz, Poznan, Krakow, Warszawa, Wroclaw (2 office days per week)
Benefit package that can be tailored to your personal needs (private medical coverage, sport & recreation package, lunch subsidy, life insurance, etc.)
On-line training and certifications fit for career path
Access to e-learning platform Mindgram - a holistic mental health and wellbeing platform
Work From Anywhere (WFA) - the temporary option to work remotely outside of Poland for up to 140 days per year (including Italy, Spain, the UK, Germany, Portugal, and Bulgaria)
Data Engineer II leading development and delivery of data pipelines for Syneos Health. Collaborating with teams to optimize data processing and integrate solutions into production environments.
Lead Data Engineer overseeing data operations and analytics engineering teams for OneOncology. Focused on operational excellence in data platform and model reliability for cancer care improvement.
Senior AWS Software Data Engineer at Boeing focusing on AWS Data services to support digital analytics capabilities. Collaborating with cross - functional teams to design, develop, and maintain software data solutions.
Senior Data Engineer designing and improving software for business capabilities at Barclays. Collaborating with teams to build a data and intelligence platform for Equity Derivatives.
Senior AI & Data Engineer developing and implementing AI solutions in collaboration with clients and teams. Working on projects involving generative AI, predictive analytics, and data mastery.
Consultant driving IA business growth in Deloitte's Artificial Intelligence & Data team. Delivering innovative solutions using data analytics and automation technologies.
Data Engineer responsible for managing data architecture and pipelines at Snappi, a neobank. Collaborating with teams to enable data processing and analysis in innovative banking solutions.
Data Engineer at Destinus developing the data platform to support production and analytics needs. Involves migrating Excel sources to Lakehouse and integrating ERP systems in a hybrid role.
Senior Data Engineer developing solutions within the Global Specialty portfolio at an insurance company. Engaging with diverse business partners to ensure high quality data reporting.
Data Engineer at UBDS Group focusing on designing and optimizing modern data platforms. Collaborating in a multidisciplinary team to develop reliable data assets for analytics and operational use cases.