Principal Data Engineer responsible for architecting scalable data pipelines and building high-quality data foundations. Collaborating closely with experts to ensure data readiness for advanced analytics.
Responsibilities
Architect and implement scalable data pipelines for batch and real-time ingestion and processing.
Build sophisticated transformations for attribute extraction, normalisation, and entity resolution.
Develop knowledge infrastructure, including metadata layers, product graphs, and ontologies.
Collaborate with domain experts to define taxonomies and classification schemas.
Enforce data contracts and validation rules to ensure consistency and lineage across the organisation.
Promote engineering best practices around testing, documentation, and observability for data workflows.
Requirements
Mastery of Apache Beam, Dataflow, and Pub/Sub in a cloud-native environment.
Expert knowledge of SQL, dbt, BigQuery, and distributed computing frameworks.
Extensive experience building production-grade pipelines for large-scale data.
Strong analytical thinking and the ability to collaborate across engineering, ML, and product teams.
Proven track record of independently researching and applying new data technologies.
Nice to have: Background in building agentic systems or personalised recommendation engines.
Experience with various data formats including Avro and Protobuf.
Google Cloud Professional Data Engineer certification.
Benefits
10 days PTO + 17 days paid public holidays
Pension
Law 19032 (Social Security)
Family allowance
National Employment Fund
Accident Insurance
Life Insurance
Work from Home Allowance
Private Medical Insurance
Birthday leave
10 paid learning days per year
Bonusly 100 points per month to recognise colleagues
Software Engineer at Warner Music Group developing an innovative Data Platform for the music industry. Collaborating with dynamic teams to enhance music data processing and delivery.
Data Engineer role specializing in Azure & Snowflake at InfoCentric. Leading design and delivery of enterprise - scale data platforms for large organizations.
Principal Data Architect at PointClickCare ensuring coherent and scalable data architecture. Driving unified data direction while collaborating with Engineering Architecture team for AI enablement.
Data Engineer Tech Lead developing data solutions at Carelon. Leading a cross - functional team to optimize data workflows and maintain data integrity.
Lead Data Engineer responsible for evolving Manna’s data infrastructure for drone delivery. Overseeing data architecture and analytics while building scalable data pipelines.
Data Engineer designing, implementing, and optimizing data pipelines for DeepLight AI. Collaborating closely with a multidisciplinary team to analyze large - scale data.
Data Engineer designing and maintaining scalable ETL pipelines at Satori Analytics. Collaborating with teams to deliver high - quality analytics solutions across various industries.
Data Architect responsible for defining enterprise data architecture on AWS and Databricks Lakehouse platforms. Enabling scalable data lakes and enterprise analytics for financial services organizations.
Data Platform Operations Support leading data engineering strategy across projects for EXL. Driving innovation and optimization while collaborating with various teams in the organization.
Manager II leading data engineering projects at Navy Federal Credit Union. Overseeing data governance and quality initiatives while managing engineering teams in a hybrid work environment.