AWS Glue Data Engineer at DeepLight AI responsible for data ingestion and pipeline performance optimisation. Collaborate with teams to build scalable solutions in a hybrid work environment.
Responsibilities
***Your responsibilities as the AWS Glue Data Engineer will include:***
**Data Ingestion Development**
Building and implementing AWS Glue jobs for Bronze layer ingestion using defined standards and templates.
Implementing correct loading methods based on source requirements (CDC, full load, delta, snapshot).
Designing and executing historical loading mechanisms to bring legacy data into the Lakehouse.
**Performance Optimisation**
Optimising Glue job performance (DPU allocation, parallelization, partitioning) according to best practices.
Collaborating with platform teams to ensure tooling and optimization alignment.
**Migration & Automation**
Aggressively migrating source tables to Bronze layer, initially using manual approaches with standards/templates, later leveraging AI-enabled acceleration.
Ensuring jobs are version-controlled and production deployment is automated via Git and Terraform.
**Governance & Monitoring**
Implementing source system connectivity into CDP in collaboration with source system owners.
Ensuring jobs comply with data contracts and are properly monitored.
Preparing documentation and handover to operational support teams.
**Collaboration**
Working closely with Data Architect for ingestion patterns and standards.
Coordinating with Data Assurance Lead to apply quality checks across all jobs.
Partnering with platform engineers for tooling and optimisation.
Requirements
***You will have experience in:***
AWS Glue, PySpark, and ETL pipeline development;
substantial knowledge of Lakehouse architecture and Medallion design principles;
familiarity with CDC, delta loads, and historical data ingestion strategies; and;
5+ years experience in data engineering roles, with hands-on experience in AWS Glue.
***You should also have knowledge of:***
AWS services: Glue, S3, Athena, Lambda;
Git, Terraform for CI/CD automation;
data quality frameworks (e.g., Soda Core);
identifying ways to automate their work / repetitive tasks;
working in a fast-paced environment and deliver aggressive migration targets;
collaborating and communication with different stakeholder levels; and;
working with Jira and agile way of working.
Benefits
**Benefits & Growth Opportunities:**
· Competitive salary and performance bonuses
· Comprehensive health insurance
· Professional development and certification support
· Opportunity to work on cutting-edge AI projects
· Flexible working arrangements
· Career advancement opportunities in a rapidly growing AI company
Cloud Data Engineer implementing tailored solutions for Volkswagen Group data processing. Building ETL/ELT pipelines while collaborating with technical experts.
Data Engineer designing and optimizing data pipelines using Databricks and Google Cloud Platform. Collaborating with analysts and scientists to deliver high - quality data products.
Data Engineer responsible for building scalable data infrastructure that supports data - driven decisions. Collaborating with team to maintain systems and unlock data value for organizations.
Associate Data Engineer supporting privacy engineering controls and executing privacy impact assessments in a financial services company. Collaborating across business units to ensure alignment with privacy regulations.
Data Engineer at CVS Health optimizing data pipelines and analytical models. Driving data - driven decisions with healthcare data for improved business outcomes.
Senior Data Engineer at CVS Health developing robust data pipelines for healthcare data. Collaborating with teams to provide actionable insights and integrate them with consumer touchpoints.
Senior Data Engineer supporting AI - enabled financial compliance initiative with data pipelines and ingestion processes. Collaborating with diverse teams in a mission - critical regulated environment.
Data Architect leading the definition and construction of cloud data architecture for Kyndryl. Participating in significant technological modernization initiatives, focusing on Google Cloud Platform.
Senior Data Engineer driving data intelligence requirements and scalable data solutions for a global consulting firm. Collaborating across functions to enhance Microsoft architecture and analytics capabilities.
Experienced AI Engineer designing and building production - grade agentic AI systems using generative AI and large language models. Collaborating with data engineers, data scientists in a tech - driven company.