Associate Data Scientist leading development and implementation of advanced data engineering solutions for Generative AI at Capgemini. Collaborating on data architectures and driving innovation in machine learning capabilities.
Responsibilities
Lead the development and implementation of advanced data engineering solutions to support the deployment and optimization of Generative AI models
Design robust, scalable, and innovative data architectures that align with the unique requirements of General Artificial Intelligence (GenAI) applications
Responsible for architectural design and planning, advanced data pipelines, model integration and optimization, scalability, performance and research and innovation supporting production generative AI systems
Build and maintain data engineering solutions on cloud platforms using hyperscaler services
Develop production-grade cloud (AWS/Azure/GCP) infrastructure that supports the deployment of ML applications, including drift monitoring
Design, develop, and maintain data pipelines to efficiently collect, process, and load data from various sources into data storage systems (e.g., data warehouses, data lakes)
Requirements
Bachelor's degree in computer science, data engineering, or a related field with 5+ years experience (Master's preferred)
Proven experience in data engineering, MLOps, ETL, and database management
Strong understanding of fundamental data science concepts in NLP, including selection and understanding of embedding models
Experience with cloud platforms (AWS, Azure, or GCP)
Proficiency in Azure, Python, Java, or Scala
Experience with data warehousing platforms (e.g., Databricks, Amazon Redshift, Snowflake) and big data technologies (e.g., Hadoop, Spark)
Experience with highly scalable Data stores, Data Lake, Data Warehouse, Lakehouse, and unstructured datasets
Benefits
Paid time off based on employee grade (A-F), defined by policy: Vacation: 12-25 days, depending on grade
Company paid holidays
Personal Days
Sick Leave
Medical, dental, and vision coverage (or provincial healthcare coordination in Canada)
Retirement savings plans (e.g., 401(k) in the U.S., RRSP in Canada)
Life and disability insurance
Employee assistance programs
Other benefits as provided by local policy and eligibility
Analytics Lead managing analytical strategy and translating data into insights at privacy startup. Collaborating across teams to influence company - level decisions rooted in data.
Data Manager responsible for designing enterprise data management framework for Marco Capital. Engaging stakeholders and overseeing data quality and compliance in a regulated insurance environment.
Data Manager at La Fosse Academy responsible for managing apprenticeship data processes and compliance. Ensuring accuracy and providing insights for effective decision - making in a growing programme.
Data Scientist developing AI - powered solutions for Everflow's SaaS marketing platform. Shaping AI strategy and collaborating with teams in a fast - growing environment.
Director of Data Science at Sam's Club focusing on advanced data science solutions. Leading team collaboration for machine learning and data - driven decision making.
Senior Data Scientist leading AI/ML initiatives for Marketplace Payments at Walmart. Focused on payments risk analytics and financial decision - making for sellers.
Data Scientist at Booz Allen unlocking secrets in data sets for global challenges. Collaborating with clients to turn data into actionable insights and leading algorithm development.
Senior Data Scientist at Booz Allen working with complex data sets to solve global challenges. Leading data analysis projects and guiding teams to inform decision - making for clients.
Senior Data Scientist at Zappi predicting consumer response to ads and product concepts. Leading the development of custom models and data collection processes for market research.
Senior Data Scientist developing predictive models for consumer response to marketing campaigns. Shaping data collection and analysis to validate models in advertising, product innovation, and brand tracking.