Associate Data Engineer designing, building, and maintaining data solutions at Manulife. Collaborating with Data Scientists and Engineers to enhance data workflows and model deployment.
Responsibilities
Designs, builds, and maintains reliable, efficient and scalable data infrastructure for data collection, storage, transformation, and analysis.
Implements data orchestration pipelines, data sourcing, cleansing, augmentation, and quality control processes.
Works with business and technology collaborators to grasp current and future data infrastructure needs.
Designs, builds and maintains scalable data solutions including data pipelines, data models, and applications for efficient and reliable data workflow; including those specifically tailored for machine learning workflows.
Builds, implements, and upholds current and upcoming data platforms such as data warehouses, repositories for structured and unstructured data.
Collaborates with Data Scientists and Engineers to create features and pre-process data for ML models and move data analysis models into production.
Designs and develops analytical tools, algorithms, data landscape modernization roadmaps, and programs to support Data Engineering activities like writing scripts and automating tasks.
Applies a variety of data interchange formats to ensure data requirements are met and continuously monitors data integrity across the organization.
Integrates machine learning algorithms into current production systems and workflows, taking into account compatibility with other systems, data sources, and APIs.
Builds and advocates for efficient utilization of data querying APIs to ensure seamless access to organizational data sources.
Evaluates, integrates, and manages tools and frameworks within the data engineering ecosystem, ensuring compatibility and efficiency in model development and deployment.
Designs and promotes data versioning and lineage tracking, including transparency and traceability for data used in ML model training and inference.
Requirements
Knowledge of database systems, data lakes, and NoSQL databases
Knowledge of data warehouse concepts and architectures (e.g., Synapse)
Familiarity with data quality and data modelling tools
Proficiency in using version control systems like Git for managing codebase
Experience with Cloud native data services such as PySpark, Scala, Azure Data Factory and Databricks
Practical experience with big data processing frameworks and techniques such as HDFS, MapReduce, Storage formats (Avro, Parquet), Stream processing
Experience with integrating to back-end/legacy environments
Knowledge of AI model deployment in production environments
Experience handling real-time data for AI Applications
Ability to build and deploy Data Ops and ML Ops Pipelines in Cloud-native environments
Benefits
health, dental, mental health, vision insurance
short- and long-term disability
life and AD&D insurance coverage
adoption/surrogacy and wellness benefits
employee/family assistance plans
retirement savings plans (including pension and employer matching contributions)
customizable benefits
paid time off including holidays, vacation, personal days, sick days
Technical Lead for data engineering and reporting in healthcare technology at Dedalus. Shaping innovative software solutions and leading cross - functional technical teams in Australia.
Senior ML Data Engineer working on data pipeline curation for Mobileye's autonomous vehicle dataset. Collaborating across teams to enhance ML engineering and vision model applications.
Data Engineer managing customer datasets to enhance industrial research and development. Responsible for ETL pipelines and data ingestion for the Uncountable Web Platform.
Data Engineer designing and maintaining scalable data solutions on Databricks for clinical trials. Collaborating with teams to overcome data challenges and ensure the smooth logistics of clinical supplies.
Senior Manager leading a team of database engineers to manage CCC's data platform. Overseeing mission - critical applications and collaborating with cross - functional teams in a hybrid environment.
As a Principal Data Architect at Solstice, lead the design and implementation of data architecture solutions. Ensure data integrity, security, and accessibility to meet strategic organizational goals.
Data Platform Specialist overseeing data workflows and enhancing data quality for Stackgini's AI - driven IT solutions. Collaborating with teams to drive improvements and stakeholder support.
Data Engineer designing data pipelines in Python for a major railway industry client. Collaborate with Data Scientists and ensure code quality with agile methodologies.
Senior Data Engineer responsible for building and optimizing data pipelines for banking analytics initiatives. Collaborating with data teams to ensure data quality and readiness for enterprise use.
Senior Data Engineer developing scalable data solutions on Databricks for analytics and operational workloads. Collaborating with cross - functional teams to modernize the data ecosystem.