Data Scientist internship focusing on developing innovative AI solutions for diverse sectors at Sogeti Labs. Collaborating with a research team and contributing to technologically advanced projects.
Responsibilities
You will join the SogetiLabs R&D team composed of researchers and AI experts working across a range of industries.
You will work within one of our research teams and contribute to the development of innovative AI solutions applied to concrete problems in various sectors.
Development of NLP and LLM systems.
Design of machine translation modules to build a sign language interpreter.
Study and exploration of the state of the art in NLP and LLMs, proposing new approaches.
Fine-tuning of models (SFT, RL, policy optimization, evolutionary algorithms).
Multimodal text/vision alignment and continuous performance improvement.
Generation, AI agents, and reasoning.
Development of GenAI modules for content analysis or generation (summaries, reports, web navigation, recommendations, ...).
Implementation of RAG architectures and hallucination-mitigation mechanisms (coherence checks, citations).
Participation in the design of AI agents (Flowise, n8n, ...).
Scientific research.
Analysis of the scientific literature (NLP, multimodality, accessibility, ...).
Testing, validation and critical analysis of developed models.
Contribution to scientific publications, internal reports and technical presentations.
Requirements
Final year of a Master's degree in AI, Computer Science, NLP, Data Science, or Applied Mathematics
Available for a 1-year work-study (alternance) placement
Advanced proficiency in Python and knowledge of NLP and LLMs
Familiarity with at least one of the following tools: PyTorch, TensorFlow, Jax, LangChain, n8n, Flowise
Fundamentals in multimodality and language processing
You are autonomous, scientifically curious, and a team player
Good oral and written English (minimum B2).
Benefits
Continuous learning: benefit from training paths including bootcamps, certifications (Azure, Databricks, Scrum...) and immersive programs such as the GenAI Campus.
Leading on emerging technologies: as the Group’s "technologist" arm, our mission is to explore and test new technologies to identify their potential and find business use cases.
Quality of life at work: enjoy work–life balance, the possibility to work remotely (in France and internationally), and health and wellbeing services (support line, dedicated platform...).
Inclusive environment: join engaged networks such as Women@Capgemini, Parents@Capgemini, OUTfront or CapAbility, and work within an EDGE+ certified environment recognized by the Bloomberg Gender Equality Index.
Happy Trainees: our commitment to young talent is recognized in the HappyTrainees ranking — interns and work-study students here don’t just come to learn, they come to thrive!
Data Manager managing data analytics consulting projects at PwC. Collaborating on data - driven solutions and overseeing implementation while maintaining client relationships.
Principal Data Science Engineer at Qodea leading development of AI - driven reasoning tools and recommendation systems. Collaborating to bridge advanced analytics with practical implementation in Buenos Aires.
Lead Data Scientist at Target developing predictive and prescriptive algorithms for supply chain optimization. Collaborating across teams to foster data - driven decision - making while ensuring continuous innovation.
Data Scientist working with data analytics to drive efficiency improvements in the energy sector. Collaborate with teams to build and deploy models for actionable insights.
Senior Data Scientist at Betclic developing data products and user experiences in an innovative gaming environment. Collaborating across teams to drive data - driven decisions and product improvements.
Senior Data Science Engineer developing and maintaining machine learning models for fintech company. Collaborating with cross - functional teams to drive product improvements and user insights.
Senior Manager of Data Analytics leading high - impact analytics projects for business strategy at Conduent. Overseeing teams to develop data models and present insights to leadership.
Team Lead for Data Science at PAIR Finance focusing on machine learning product development. Leading a team to innovate in debt collection with AI - driven insights.