Data Scientist internship focusing on developing innovative AI solutions for diverse sectors at Sogeti Labs. Collaborating with a research team and contributing to technologically advanced projects.
Responsibilities
You will join the SogetiLabs R&D team composed of researchers and AI experts working across a range of industries.
You will work within one of our research teams and contribute to the development of innovative AI solutions applied to concrete problems in various sectors.
Development of NLP and LLM systems.
Design of machine translation modules to build a sign language interpreter.
Study and exploration of the state of the art in NLP and LLMs, proposing new approaches.
Fine-tuning of models (SFT, RL, policy optimization, evolutionary algorithms).
Multimodal text/vision alignment and continuous performance improvement.
Generation, AI agents, and reasoning.
Development of GenAI modules for content analysis or generation (summaries, reports, web navigation, recommendations, ...).
Implementation of RAG architectures and hallucination-mitigation mechanisms (coherence checks, citations).
Participation in the design of AI agents (Flowise, n8n, ...).
Scientific research.
Analysis of the scientific literature (NLP, multimodality, accessibility, ...).
Testing, validation and critical analysis of developed models.
Contribution to scientific publications, internal reports and technical presentations.
Requirements
Final year of a Master's degree in AI, Computer Science, NLP, Data Science, or Applied Mathematics
Available for a 1-year work-study (alternance) placement
Advanced proficiency in Python and knowledge of NLP and LLMs
Familiarity with at least one of the following tools: PyTorch, TensorFlow, Jax, LangChain, n8n, Flowise
Fundamentals in multimodality and language processing
You are autonomous, scientifically curious, and a team player
Good oral and written English (minimum B2).
Benefits
Continuous learning: benefit from training paths including bootcamps, certifications (Azure, Databricks, Scrum...) and immersive programs such as the GenAI Campus.
Leading on emerging technologies: as the Group’s "technologist" arm, our mission is to explore and test new technologies to identify their potential and find business use cases.
Quality of life at work: enjoy work–life balance, the possibility to work remotely (in France and internationally), and health and wellbeing services (support line, dedicated platform...).
Inclusive environment: join engaged networks such as Women@Capgemini, Parents@Capgemini, OUTfront or CapAbility, and work within an EDGE+ certified environment recognized by the Bloomberg Gender Equality Index.
Happy Trainees: our commitment to young talent is recognized in the HappyTrainees ranking — interns and work-study students here don’t just come to learn, they come to thrive!
Data Scientist at Capital One leveraging technology to improve fraud prevention and customer safety. Collaborating with cross - functional teams to deliver industry - leading fraud defenses.
Data Scientist delivering insights for product and operations teams in Customer Support at Etsy. Using behavioral analysis to drive product development and strategy within a collaborative environment.
Data Scientist developing predictive models and enhancing investment strategies with AI at MDOTM. Collaborating in a dynamic research team to drive data - driven insights and innovative solutions.
Lead Data Scientist at Vizient developing automated analytics and advanced data science solutions. Collaborating with teams to improve clinical, operational, and economic outcomes.
Data Scientist developing analytic solutions and analyzing healthcare datasets for client decision - making. Collaborating with teams to build scalable analytics products and communicate insights.
Senior Data Scientist at Pinterest applying GenAI to build analytics solutions and data models. Collaborating across teams to improve data integration and pipeline management.
Solution Analyst / Data Scientist at Analytic Partners utilizing advanced data analysis and AI solutions for marketing performance in a hybrid work environment.
Lead Data Scientist building core AI systems for OpenExpert.AI, an AI operations platform in the energy sector. Collaborating across teams to design, deploy, and scale AI systems in high - stakes environments.
Senior Marketing Data Scientist leading advanced analytics initiatives for biBerk, enhancing marketing ROI and optimizing campaign performance. Collaborate with the Marketing team to drive effective investment decisions.
Cientista de Dados Pleno enhancing CRM performance through data analysis and predictive modeling. Collaborative role directly impacting client success with actionable insights.