AWS Glue Data Engineer at DeepLight AI responsible for data ingestion and pipeline performance optimisation. Collaborate with teams to build scalable solutions in a hybrid work environment.
Responsibilities
***Your responsibilities as the AWS Glue Data Engineer will include:***
**Data Ingestion Development**
Building and implementing AWS Glue jobs for Bronze layer ingestion using defined standards and templates.
Implementing correct loading methods based on source requirements (CDC, full load, delta, snapshot).
Designing and executing historical loading mechanisms to bring legacy data into the Lakehouse.
**Performance Optimisation**
Optimising Glue job performance (DPU allocation, parallelization, partitioning) according to best practices.
Collaborating with platform teams to ensure tooling and optimization alignment.
**Migration & Automation**
Aggressively migrating source tables to Bronze layer, initially using manual approaches with standards/templates, later leveraging AI-enabled acceleration.
Ensuring jobs are version-controlled and production deployment is automated via Git and Terraform.
**Governance & Monitoring**
Implementing source system connectivity into CDP in collaboration with source system owners.
Ensuring jobs comply with data contracts and are properly monitored.
Preparing documentation and handover to operational support teams.
**Collaboration**
Working closely with Data Architect for ingestion patterns and standards.
Coordinating with Data Assurance Lead to apply quality checks across all jobs.
Partnering with platform engineers for tooling and optimisation.
Requirements
***You will have experience in:***
AWS Glue, PySpark, and ETL pipeline development;
substantial knowledge of Lakehouse architecture and Medallion design principles;
familiarity with CDC, delta loads, and historical data ingestion strategies; and;
5+ years experience in data engineering roles, with hands-on experience in AWS Glue.
***You should also have knowledge of:***
AWS services: Glue, S3, Athena, Lambda;
Git, Terraform for CI/CD automation;
data quality frameworks (e.g., Soda Core);
identifying ways to automate their work / repetitive tasks;
working in a fast-paced environment and deliver aggressive migration targets;
collaborating and communication with different stakeholder levels; and;
working with Jira and agile way of working.
Benefits
**Benefits & Growth Opportunities:**
· Competitive salary and performance bonuses
· Comprehensive health insurance
· Professional development and certification support
· Opportunity to work on cutting-edge AI projects
· Flexible working arrangements
· Career advancement opportunities in a rapidly growing AI company
Data Engineer building modern Data Lake architecture on AWS and implementing scalable ETL/ELT pipelines. Collaborating across teams for analytics and reporting on gaming platforms.
Chief Data Engineer leading Scania’s Commercial Data Engineering team for growing sustainable transport solutions. Focused on data products and pipelines for BI, analytics, and AI.
Data Engineer designing and building scalable ETL/ELT pipelines for enterprise - grade analytics solutions. Collaborating with product teams to deliver high - quality, secure, and discoverable data.
Entry - Level Data Engineer at GM, focusing on building large scale data platforms in cloud environments. Collaborating with data engineers and scientists while migrating systems to cloud solutions.
Data Engineer responsible for data integrations with AWS technology stack for Adobe's Digital Experience. Collaborating with multiple teams to conceptualize solutions and improve data ecosystem.
People Data Architect designing and managing people data analytics for Gen, delivering actionable insights for HR. Collaborating across teams to enhance data - driven decision - making.
Data Engineer role focused on shaping future connectivity for customers at Vodafone. Involves solving complex challenges in a diverse and inclusive environment.
VP, Senior Data Engineer responsible for designing and developing cloud data solutions for insider risk in Information Security at SMBC. Collaborating with multiple teams to enhance cybersecurity data platform.
Data Engineer responsible for architecting, developing, and maintaining Allegiant’s enterprise data infrastructure. Overseeing transition to cloud hosted data warehouse and developing next - generation data tools.
Senior Data Engineer developing Azure - based data solutions for clients in the Data & AI department. Collaborating with architects and consultants to enhance automated decision making.