Data Engineer responsible for developing data solutions and integrating systems for advanced analytics at Lilly. Focusing on data pipelines and solutions ensuring data quality and compliance.
Responsibilities
Engage with business stakeholders to design, develop, and maintain the data pipelines and data solutions that ensure the availability and quality of data sets and actionable insights for the Foundry
Includes data capture, integration, acquisition, contextualization, and harmonization, leading to the delivery of data-as-a-product and reusable data domains and products
Focus on integrating IT/OT systems with cloud data lakehouse architecture (AWS/Azure) to enable advanced analytics and AI/ML capabilities while ensuring data integrity and compliance with relevant regulatory standards and best practices
Work closely with the Data Architect and Data Scientists
Collaborate with business and IT groups beyond the data sphere, understanding the enterprise infrastructure and the many source systems
Requirements
Bachelor’s degree in Computer Science, Data Science, Engineering or related field or equivalent work experience
At least 3 years of experience in several of the following disciplines: statistical methods, data modeling, ETL/ELT, ontology development, semantic graph construction and linked data, relational schema design
1-3 years of experience designing large scale data models for functional, operational, and analytical environments (Conceptual, Logical, Physical & Dimensional)
Demonstrated SQL and data modeling proficiency
Experience with data modeling tools such as, ER*Studio and Erwin or TOAD
Experience with cloud platforms (e.g., AWS, Azure)
Experience with AI/ML/LLM Concepts and tools and building agentic AI solution sets
Experience with data integration such as data streaming, Industrial IOT, using MQTT, AQMP, Kafka and related protocols
Understanding of modern data architecture, data lakehouse, data warehousing and/or big data concepts
Experience with security models and development on large data sets
Experience with multiple database solutions (e.g. Postgres, Redshift, Aurora, Athena, Graph DB like Neptune, No SQL like DynamoDB, MongoDB) and formal database designs (3NF, Dimensional Models)
Experience with Agile Development, CI/CD, Github, Automation platforms
Demonstrated ability to analyze large, complex data domains and craft practical solutions for subsequent data exploitation via analytics
Ability to review and provide practical recommendations on design patterns, performance considerations & optimization, database versions, and database deployment strategies
Knowledgeable in data functions such as Data Governance, Master Data Management, Business Intelligence
Prior work experience working in pharma or other GMP setting
Solid knowledge of Computer System Validation process
Demonstrated ability to analyze, anticipate, and resolve complex issues through sound problem-solving skills
Demonstrated learning agility and curiosity
Desire and ability to communicate using a variety of methods in diverse forums.
Benefits
eligibility to participate in a company-sponsored 401(k)
pension
vacation benefits
eligibility for medical, dental, vision and prescription drug benefits
flexible benefits (e.g., healthcare and/or dependent day care flexible spending accounts)
life insurance and death benefits
certain time off and leave of absence benefits
well-being benefits (e.g., employee assistance program, fitness benefits, and employee clubs and activities)
Data Engineer Consultant building and maintaining enriched data infrastructure for analytical thinking at Northwest Permanente. Involves data collection, cleansing, and transformation for business intelligence.
Vice President - Business, Data Architect role at TD Securities focusing on business data architecture and analytics capabilities. Collaborate with stakeholders to define and govern data models and ensure alignment with strategy.
Staff Data Engineer at Headspace building privacy - first data platforms for mental health support. Leading data engineering strategies and mentoring team members to enhance data - driven decision making.
Senior Data Engineer building and implementing data pipelines at Headspace. Collaborating with analytics and data science teams to enhance personalized mental health support.
Data Engineering Intern working on data pipelines and infrastructure in fast - growing fintech. Collaborating with data engineers, learning best practices and developing data solutions.
Senior Software Engineer building and maintaining data infrastructure for Gusto. Collaborating with Data Science and Business Intelligence teams to achieve their goals.
Data Engineer building and maintaining scalable data pipelines for AI Search Infrastructure at You.com. Collaborating across teams to ensure data quality and enable AI capabilities.
Data Engineer developing and managing technology - based data solutions for clients in different industries in Greece. Participating in software development lifecycle within Agile team setting.
Data Architect leading design and governance of high - quality data architectures for clients. Collaborating with engineering teams and stakeholders to transform business challenges into scalable data solutions.
Data Engineer supporting vehicle buying and selling solutions through integration pipelines. Collaborating with teams to build digital vehicle platforms and optimize data processes in São Paulo.