Junior Data Engineer developing scalable data pipelines and ETL processes for Capgemini. Collaborating with data scientists and analysts to maintain data integrity and compliance.
Responsibilities
Design, develop, and maintain scalable data pipelines and ETL processes using Databricks
Design and develop Python scripts for data transformation, automation, and integration tasks
Develop and optimize SQL queries for data extraction, transformation, and loading
Collaborate with data scientists, analysts, and business stakeholders
Ensure data integrity, security, and compliance with organizational standards
Participate in code reviews and contribute to best practices in data engineering
Requirements
3-5 years of professional experience in data engineering or related roles
Strong proficiency in Databricks (including Spark-based data processing)
Strong programming skills in Python
Advanced knowledge of SQL for querying and data modeling
Familiarity with Azure cloud and ADF
Understanding of ETL frameworks, data governance, and performance tuning
Knowledge of CI/CD practices and version control (Git)
Exposure to BI tools (Power BI, Tableau) for data visualization
Benefits
Flexible work
Healthcare including dental, vision, mental health, and well-being programs
Financial well-being programs such as 401(k) and Employee Share Ownership Plan
Paid time off and paid holidays
Paid parental leave
Family building benefits like adoption assistance, surrogacy, and cryopreservation
Social well-being benefits like subsidized back-up child/elder care and tutoring
Data Engineer developing sustainable data assets for machine learning and analytics solutions. Collaborating with teams and using modern technologies in a hybrid work setting.
Data Engineer role focusing on integrating SAP data sources and maintaining data platforms using Microsoft Fabric. Collaborating with teams to build scalable data solutions and ensure data quality.
Data Engineer Developer focused on data collection and analysis at Dev4Side Software. Collaborating on diverse projects with Italian and European partners.
Senior Manager Data Engineering spearheading cloud - native solutions and operational excellence at Reckitt. Championing data engineering practices and mentoring teams to drive impactful business outcomes.
Senior Data Engineer delivering creative solutions for market - leading clients in life sciences and retail. Supporting teams with data initiatives and mentoring engineering teams for project success.
Early career technologist working on high impact projects at Fidelity Investments. Collaborating with experts to develop automation solutions using modern technologies.
Data Engineer at Octopus Energy improving data systems and applications for the Energy Markets team. Responsible for critical data applications, databases, and pipeline solutions.
Data Engineer supporting ingestion of mission - critical data into big data environment at IAMUS Consulting. Leading department with extensive experience in software and data engineering.
Data Engineer developing data solutions for Travelers' analytics landscape. Building pipelines and supporting AI and business intelligence initiatives.
Data Scientist III leveraging sophisticated analytics and data science solutions for business improvement at Truist. Leading projects and providing actionable insights in a collaborative environment.