Data Science Engineer working within Early Research to build infrastructure for biological data analysis. Combining data engineering, data science, and machine learning to support scientific discovery and research teams.
Responsibilities
Build and maintain scalable ETL pipelines for experimental, biological, and pharmacological data
Integrate data from multiple sources such as GeneData, LabVantage, CROs, and cloud platforms into unified analytical environments
Automate recurring workflows to ensure data quality, accessibility, and reproducibility
Collaborate with data scientists to implement and operationalize AI and machine learning models into production-ready pipelines on cloud and HPC environments
Monitor and maintain deployed models to ensure performance, scalability, and traceability
Develop scripts, APIs, and workflows to connect laboratory systems such as GeneData Biologics and LabVantage with central analytics environments
Act as the technical link between Discovery, IT, and Data Science teams
Translate scientific and technical needs into robust, scalable solutions
Provide responsive technical support to scientists to ensure data accessibility and workflow reliability.
Requirements
MSc in Computer Science, Data Science, Computational Biology, or a related field
Three to five years of relevant experience in data engineering or scientific data environments
Proficiency in Python and version control systems such as Git
Experience with AWS services including S3, Lambda, ECS, Glue, and Step Functions
Strong understanding of data integration, workflow automation, and pipeline design
Excellent communication skills with the ability to explain technical solutions clearly to scientific and technical colleagues
Proactive problem solver who can adapt quickly to changing priorities.
Data Scientist responsible for modeling and analyzing credit risk at CAIXA Consórcio. Utilizing data - driven insights to support strategic decision - making in credit operations.
Data Scientist optimizing payments ecosystem for Preply, enhancing user experience through data - driven insights. Collaborating with teams to improve payment processes and fraud management.
Staff Data Scientist at Preply developing data strategies for product domains. Collaborating with executives to drive long - term strategy and experimentation frameworks.
Data Manager leading data strategy and governance for Global Payments Solutions at Bank of America. Managing data architecture aligning with business and regulatory needs while overseeing complex data ecosystems.
Data Scientist developing and implementing LLM - based agents and leveraging AI techniques to improve client value. Collaborating on project challenges in a dynamic, start - up environment at Gartner.
Data Scientist in AI SaaS integrating 100+ systems for a European unicorn - in - the - making. Ensure scalability, security, and performance in a high - growth environment.
Data Science Intern working on AI - driven recipe and hardware optimization problems in semiconductor processes. Developing machine learning models and collaborating with engineering teams for innovative solutions.
Senior Data Scientist at LexisNexis developing AI - driven solutions for legal analytics. Collaborating with teams to implement machine learning models and monitor performance metrics.
Product Analyst for a leading media tech company managing SEO - friendly content and commercial campaigns. Collaborate with teams for digital content production and user engagement analysis.
Data Scientist at Capital One leveraging machine learning models for credit underwriting decisions. Partnering with data scientists, engineers, and product managers using advanced technologies.