Data Engineer enhancing data infrastructure reliability and efficiency for Love, Bonito. Involves pipeline optimization, data quality assurance, and robust operations.
Responsibilities
Update documentation of existing data pipelines to establish clear operational standards and knowledge transfer
Refactor legacy code to improve scalability, performance and maintainability
Optimise aggregation layers and transformation logic for reporting workloads
Reduce pipeline runtimes through strategic improvements in query optimisation, partitioning and resource allocation
Review and enhance data models supporting analytics, reporting and machine learning use cases
Design and implement efficient data structures that balance query performance with storage costs
Collaborate with analytics and data science teams to ensure data models meet downstream requirements
Implement comprehensive data quality checks and validation frameworks within pipelines
Develop unit tests for data transformations to ensure correctness and prevent regressions
Monitor data integrity throughout the pipeline lifecycle
Establish and maintain data quality metrics and alerting mechanisms
Troubleshoot pipeline failures and implement root cause analysis
Design and deploy preventative measures to minimize future incidents
Maintain pipeline uptime targets of 99%
Ensure SLA adherence across critical data workflows
Maintain data security, governance and compliance standards across all data assets
Implement access controls and data lineage tracking
Ensure adherence to regulatory requirements and internal data policies
Requirements
Bachelor's degree in Computer Science, Engineering, or a related field.
2-4 years data engineering experience
Proven track record of building and maintaining production data pipelines at scale
Strong problem-solving skills with focus on systematic root cause analysis
Excellent communication skills for technical documentation and cross-functional collaboration
Core Stack
Data Warehousing: Experience with cloud data warehouses (Redshift, BIgQuery) or lakehouse architectures
AWS Cloud Services: Working knowledge of S3, IAM, EC and related services
Programming: Strong Python skills with focus on PySpark, SQL, Scala for ETL
DevOps & Engineer Practices
Containerisation (Docker, Kubernetes)
CI/CD pipelines and version control (Git, GitHub Actions)
Preferred Qualifications
Experience optimisation aggregation layers for high-volume reporting workloads
Familiarity with Delta Lake and lakehouse design patterns
Background in distributed computing and Spark performance tuning
Infrastructure as code and automated deployment practices
Data Engineer building modern Data Lake architecture on AWS and implementing scalable ETL/ELT pipelines. Collaborating across teams for analytics and reporting on gaming platforms.
Chief Data Engineer leading Scania’s Commercial Data Engineering team for growing sustainable transport solutions. Focused on data products and pipelines for BI, analytics, and AI.
Data Engineer designing and building scalable ETL/ELT pipelines for enterprise - grade analytics solutions. Collaborating with product teams to deliver high - quality, secure, and discoverable data.
Entry - Level Data Engineer at GM, focusing on building large scale data platforms in cloud environments. Collaborating with data engineers and scientists while migrating systems to cloud solutions.
Data Engineer responsible for data integrations with AWS technology stack for Adobe's Digital Experience. Collaborating with multiple teams to conceptualize solutions and improve data ecosystem.
People Data Architect designing and managing people data analytics for Gen, delivering actionable insights for HR. Collaborating across teams to enhance data - driven decision - making.
Data Engineer role focused on shaping future connectivity for customers at Vodafone. Involves solving complex challenges in a diverse and inclusive environment.
VP, Senior Data Engineer responsible for designing and developing cloud data solutions for insider risk in Information Security at SMBC. Collaborating with multiple teams to enhance cybersecurity data platform.
Data Engineer responsible for architecting, developing, and maintaining Allegiant’s enterprise data infrastructure. Overseeing transition to cloud hosted data warehouse and developing next - generation data tools.
Senior Data Engineer developing Azure - based data solutions for clients in the Data & AI department. Collaborating with architects and consultants to enhance automated decision making.