Senior Data Engineer at Capgemini designing and optimizing scalable data architectures on Databricks and GCP. Collaborating across teams to transform business needs into reliable technical solutions.
Responsibilities
Design, develop and optimize scalable data architectures on Databricks and/or Google Cloud Platform (GCP).
Implement and productionize robust data pipelines (batch & streaming).
Define and maintain engineering standards: best practices, security, governance, data quality.
Collaborate with Data Science, Product and Architecture teams to translate business requirements into reliable technical solutions.
Ensure performance, availability and resilience of data platforms and processing.
Participate in migration, modernization or redesign of existing data environments to Databricks or GCP.
Provide technical mentorship to junior Data Engineers and contribute to the team's skill development.
Document solutions and ensure continuous improvement of technical processes.
Requirements
Proven experience (5+ years) as a Data Engineer, ideally in advanced cloud environments.
Proficiency with Databricks technologies (Spark, Delta Lake, Unity Catalog, MLflow) or GCP services (BigQuery, Dataflow, Dataproc, Pub/Sub…).
Strong skills in Python, SQL and distributed frameworks (Spark, Beam…).
Expertise in designing data architectures: Data Lake, Lakehouse, modern Data Warehouse.
Knowledge of CI/CD best practices, versioning and automation (Git, Jenkins, Cloud Build…).
Experience with DevOps/MLOps environments: Docker, orchestration (Airflow, Cloud Composer…), monitoring.
Solid foundations in data security and governance.
Ability to analyze and solve complex problems and propose reliable technical solutions.
Good interpersonal skills, collaborative mindset, autonomy and attention to quality.
Benefits
Quality of work-life: Remote work options within Morocco and internationally and autonomy in organizing your daily work; hybrid assignments according to your preferences.
Continuous learning: access to technology-specific training and certifications, personalized support and a structured onboarding path.
Varied, high-impact projects: work with large accounts across diverse sectors with stimulating business and technological challenges.
Expert ecosystem: receive personalized technical support and active integration into our professional communities.
Data Engineer transforming legacy on - premises systems to cloud - native architectures for advanced data analytics. Collaborating with teams to build efficient data solutions using Python and AWS.
Data Engineering Academy focused on Snowflake and Databricks for professionals interested in expanding their technical capabilities. Fully remote with future office work in Monterrey or Saltillo after completion.
Senior Data Engineer at Intent HQ designing and scaling data platforms. Building high - impact intelligence from millions of customer insights with a focus on performance and reliability.
SAP Data Engineer supporting MERKUR GROUP's evolution into a data - driven company. Responsible for data integration, modeling, and collaboration with various departments in Group Finance.
Data Engineer at Booz Allen Hamilton organizing data and developing advanced technology solutions. Leading data engineering activities for mission - driven projects and mentoring multidisciplinary teams.
Senior Data Engineer at Bristol Myers Squibb developing scalable data pipelines for foundational products. Collaborating with data scientists and IT professionals to ensure data quality and accessibility.
Data Engineer II role focusing on developing and maintaining data pipelines for analytics. Collaborating with Data Science and Analytics teams to ensure data quality and reliability.
Senior Data Architecture Specialist designing and maintaining data integration solutions for Morgan Stanley. Involved in building data architecture and optimizing data storage using various technologies.
Lead Data Engineer responsible for building and maintaining the central HR data lake. Collaborating with analysts and business stakeholders for data - driven decision making.