Senior Data Engineer designing complex data systems for Opea, focusing on cloud and big data technologies. Leading a team and optimizing data processing solutions.
Responsibilities
Design and implement complex, scalable data systems using technologies such as cloud computing, big data, streaming, machine learning and AI, defining strategies for data storage, processing and analysis. Incorporate Large Language Models (LLMs), Retrieval-Augmented Generation (RAG) and vector databases for intelligent data processing.
Lead and inspire the data engineering team by sharing knowledge, defining standards and best practices, and fostering the team's technical growth.
Identify and resolve complex problems related to data systems, architecture, performance and scalability, using monitoring tools, log analysis and code debugging.
Create and implement innovative solutions to optimize data systems, leveraging emerging technologies such as machine learning, data lakes, real-time data streaming and big data analytics.
Communicate technical solutions clearly and concisely to stakeholders, managers, developers and other teams, influencing strategic decisions and helping align the data strategy with company objectives.
Requirements
Deep proficiency in Python and data processing tools such as Apache Spark, Kafka, Flink or equivalents.
Knowledge of LLMs, fine-tuning AI models and using RAG to improve search and information retrieval. Proficiency with Amazon Bedrock and/or Azure OpenAI services.
Experience with data system architecture, big data (Hadoop, Hive, HBase), streaming (Kafka, Kinesis) and databases (SQL, NoSQL).
Advanced knowledge of data modeling, data processing and data visualization tools such as SQL, NoSQL, Tableau, Power BI, etc.
Advanced experience building Data Lakes and Data Warehouses using tools like AWS Athena, Amazon Redshift, Amazon S3, AWS Glue, Airbyte, dbt and PostgreSQL.
Deep understanding of machine learning concepts and applying ML techniques within data systems.
Bachelor's degree in Computer Science, Statistics, Mathematics or a related field.
Technical English for reading, writing and professional communication.
Data Platform Specialist overseeing data workflows and enhancing data quality for Stackgini's AI - driven IT solutions. Collaborating with teams to drive improvements and stakeholder support.
Data Engineer designing data pipelines in Python for a major railway industry client. Collaborate with Data Scientists and ensure code quality with agile methodologies.
Senior Data Engineer responsible for building and optimizing data pipelines for banking analytics initiatives. Collaborating with data teams to ensure data quality and readiness for enterprise use.
Senior Data Engineer developing scalable data solutions on Databricks for analytics and operational workloads. Collaborating with cross - functional teams to modernize the data ecosystem.
Data Engineer focused on analytics and data pipeline development for network optimisation. Collaborating with teams to deliver high - quality data solutions with Python and SQL.
Senior Product Manager defining platform capabilities for Data Cloud in Salesforce. Collaborating with R&D teams while shaping product strategy for Data 360 integration.
Senior Data Engineer at Goodwin enhancing data platforms and fostering data - driven culture across teams. Collaborating with IT and Finance on technology solutions and data governance practices.
Director, Data Platform Design and Strategy at MedImpact leading data platform and AI innovations to enhance healthcare services. Overseeing enterprise projects and managing teams to meet strategic goals.
Data Engineer delivering AI - and data - driven solutions for Honeywell’s industrial customers. Architecting and implementing scalable data pipelines and platforms focused on IoT and real - time data processing.
Data Engineering Associate focusing on data quality control and management for distribution platform. Collaborates on large scale data projects to ensure data accuracy and availability for users.