Manager in Data Science for Oncology, supporting data workflows and engineering requirements in R&D. Collaborate with technical teams to deliver AI-ready data applications for innovative medicine.
Responsibilities
Serve as both a people leader and a hands-on contributor for designing, developing and maintaining data pipelines for acquiring, managing and storing Oncology R&D data from diverse sources (e.g. biomarker labs, real-world data sources, pre-clinical applications)
Work closely with Data Science and Oncology R&D partners to understand, document and prioritize business requirements. Translate these business needs in to high quality data products.
Work closely with other technical leaders, such as Ontology and Knowledge graph Engineers to design and deliver future-proof, AI-ready data systems aligned with Oncology R&D business needs.
Develop Oncology R&D-specific data repositories by implementing standard enterprise-level data models and create new data models as needed.
Leverage cloud-based technology platform to accomplish goals, such as building and maintaining data repositories using AWS S3.
Create and optimize data flows for structured and unstructured data using technologies such as Python, R, SQL, AWS services and other relevant tools.
Implement quality and performance standards and measure KPIs to determine accuracy and consistency
Leverage and implement data versioning and lineage tracking to support data traceability, compliance, maintaining documentation for data architectures and workflows.
In adherence to internal standards, implement software development best practices such as Code Versioning, DevOps.
Requirements
Advanced degree (Master’s or equivalent) in Computer Science, Engineering, Life Sciences, or other relevant field is strongly preferred. (Bachelor’s Degree with experience equivalency may be considered.)
5+ years of experience in data engineering, including data modeling and database design, preferably in the healthcare industry
2+ years experience managing a technical team aimed at delivering data systems, preferably in the healthcare industry.
Proficiency in data engineering tools such as Python, R and SQL for data processing as well as cloud architecture (e.g. AWS services, Redshift, FSx, Glue, Lambda).
Experience with unstructured database technologies (e.g. NoSQL) as well as other database types (e.g. Graph).
Strong skills in analysis, problem-solving, organizational change, project delivery, and managing external vendors.
Proven record leading improvement initiatives with multi-disciplinary and remote partners.
Demonstrated stakeholder management capabilities- including requirements gathering, business analysis and planning.
Must have the capacity to translate discussions into user requirements and project plans.
Ability to manage a numerous projects simultaneously, prioritize work, exhibit organizational skills and flexibility to deliver maximum business value.
Willingness to conduct periodic travel (<15% of time) to conferences and internal meetings.
Benefits
medical, dental, vision, life insurance, short- and long-term disability, business accident insurance, and group legal insurance
401(k)
Vacation – up to 120 hours per calendar year
Sick time - up to 40 hours per calendar year
Holiday pay, including Floating Holidays – up to 13 days per calendar year
Work, Personal and Family Time - up to 40 hours per calendar year
Data Lead at Bifrost Studios building core data and analytics systems for new ventures. Collaborating with founders to streamline operations and establish scalable infrastructures.
Data Science Engineer building the software and data infrastructure for media advertising at Medialab. A 1 year university placement role focusing on AI and automation in data science.
Data Scientist for climate action progress tracking at C40. Analyzing climate - related data and managing data warehouses for performance measurement across global cities.
Product Data Scientist responsible for co - building an Agentic AI framework. Collaborating with product teams while working within a hybrid model in Warsaw, Poland.
Staff Data Scientist at Clio developing scalable ML systems and strategic experimentation frameworks. Leading high - impact modeling projects and mentoring junior team members.
Manager, Data Science and AI delivering actionable insights and AI - powered analytics tools for Pfizer’s Commercial organization. Leading execution of AI/ML models and facilitating communication of data - driven insights.
Data Scientist at Votorantim Cimentos focusing on Sales & Marketing data initiatives and model development. Collaborating with stakeholders to deliver effective data solutions.
Senior Data Scientist at INSZO applying data - driven solutions for improved patient outcomes in healthcare. Working on complex challenges using large datasets and innovative algorithms.
Data Scientist leveraging expertise in statistics, machine learning, and AI at Simplot. Contributing to diverse, challenging projects to create business value.
Data Scientist analyzing data for informed decision - making at PayPal. Collaborating cross - functionally to enhance process quality and effectiveness in payment solutions.