Data Engineer developing sustainable data assets for machine learning and analytics solutions. Collaborating with teams and using modern technologies in a hybrid work setting.
Responsibilities
Participate in developing high quality, scalable software modules for next generation analytics solution suite
Engage in activities with cross-functional IT unit stakeholders (e.g., database, operations, telecommunications, technical support, etc.)
Formulates logical statements of business problems and devises, tests and implements efficient, cost-effective application program solutions
Identify and validate internal and external data sources for availability and quality
Work with SMEs to describe and understand data lineage and suitability for a use case
Create data assets and build data pipelines that align to modern software development principles for further analytical consumption
Perform data analysis to ensure quality of data assets
Perform preliminary exploratory analysis to evaluate nulls, duplicates and other issues with data sources
Assist in developing code that enables real-time solutions to be ingested into front-end systems and platforms
Produce code artifacts and documentation using GitHub for reproducible results and hand-off to other data science teams.
Requirements
2+ years of relevant experience recommended
Bachelor’s degree in Computer Science, Engineering, IT, Management Information Systems, or a related discipline
Experience in Python and SQL
Experience in ingesting data from a variety of structures including relational databases, Hadoop/Spark, cloud data sources, XML, JSON
Experience in ETL concerning metadata management and data validation
Experience in Unix and Git
Experience in Automation tools (Autosys, Cron, Airflow, etc.)
Exposure to AWS or GCP services a plus
Experience with Cloud data warehouses, automation, and data pipelines (i.e. Snowflake, Redshift) a plus
Experience with ELT tools (i.e. DBT, Talend) a plus
Able to communicate effectively with both technical and non-technical teams
Able to translate complex technical topics into business solutions and strategies
Candidate must be authorized to work in the US without company sponsorship.
Benefits
Other rewards may include short-term or annual bonuses
Data Engineer II leading development and delivery of data pipelines for Syneos Health. Collaborating with teams to optimize data processing and integrate solutions into production environments.
Lead Data Engineer overseeing data operations and analytics engineering teams for OneOncology. Focused on operational excellence in data platform and model reliability for cancer care improvement.
Senior AWS Software Data Engineer at Boeing focusing on AWS Data services to support digital analytics capabilities. Collaborating with cross - functional teams to design, develop, and maintain software data solutions.
Senior Data Engineer designing and improving software for business capabilities at Barclays. Collaborating with teams to build a data and intelligence platform for Equity Derivatives.
Senior AI & Data Engineer developing and implementing AI solutions in collaboration with clients and teams. Working on projects involving generative AI, predictive analytics, and data mastery.
Consultant driving IA business growth in Deloitte's Artificial Intelligence & Data team. Delivering innovative solutions using data analytics and automation technologies.
Data Engineer responsible for managing data architecture and pipelines at Snappi, a neobank. Collaborating with teams to enable data processing and analysis in innovative banking solutions.
Data Engineer at Destinus developing the data platform to support production and analytics needs. Involves migrating Excel sources to Lakehouse and integrating ERP systems in a hybrid role.
Senior Data Engineer developing solutions within the Global Specialty portfolio at an insurance company. Engaging with diverse business partners to ensure high quality data reporting.
Data Engineer at UBDS Group focusing on designing and optimizing modern data platforms. Collaborating in a multidisciplinary team to develop reliable data assets for analytics and operational use cases.