Senior Data Engineer designing and scaling data pipelines for a go-to-market platform. Build and own systems that create a high-quality first-party contact dataset.
Responsibilities
Build the enrichment platform - Design and scale pipelines that process 100M+ contact records, integrating with large-scale contact data vendors.
Own data quality - Build deduplication, entity resolution, and record matching systems that merge contacts from multiple sources into a single high-quality record.
Optimize vendor economics - Create waterfall enrichment logic that maximizes coverage and freshness while minimizing per-record cost across a portfolio of data vendors.
Ship freshness infrastructure - Build systems that detect job changes, flag stale records, and trigger re-enrichment to keep the dataset current.
Instrument and measure - Create quality scoring, coverage dashboards, and accuracy metrics that give the business visibility into dataset health.
Requirements
5+ years of data engineering experience, including 2+ years working with contact data, enrichment systems, or data infrastructure.
Deep experience with data vendor APIs and the economics of contact data (coverage rates, accuracy tradeoffs, cost per record).
Expert-level SQL and data modeling skills, with experience designing schemas for large-scale entity datasets.
Track record building production data pipelines using modern tooling (dbt, Airflow, Dagster, Spark).
Experience with entity resolution, deduplication, and fuzzy matching at scale.
Strong understanding of data warehousing (Snowflake, ClickHouse, BigQuery) and performance optimization.
Business-minded: you understand that data quality directly impacts revenue and can articulate tradeoffs in terms the GTM team understands.
Benefits
comprehensive benefits (including medical, dental, vision, and 401(k) options)
Data Engineer/Analyst maintaining and improving data infrastructure for Braiins. Collaborating with technical and business teams to ensure reliable data flows and insights.
Medior Data Engineer handling Azure migrations for a major urban mobility client. Focused on data pipeline development and ensuring platform reliability with cutting - edge technologies.
Developing ML and computer vision solutions for cutting - edge autonomous vehicle dataset pipeline at Mobileye. Collaborating across teams for data curation and advanced perception algorithms.
Data Migration Lead in a hybrid role managing data migration for a major transformation programme in the media sector. Collaborating with various teams to ensure data integrity and successful migration.
Consultant ML & DataOps at Smile integrating data science projects for major clients. Designing MLOps solutions and enhancing data governance in a collaborative environment.
Data Engineer developing and maintaining data pipelines for Coolbet’s analytical services. Working within an Agile framework to ensure data reliability and efficiency.
API Data Engineer developing innovative data - driven solutions and advancing data architecture for AI Control Tower. Building and integrating APIs and data pipelines to support organizational needs.
Journeyman Data Architect supporting Leidos' enterprise data and analytics program for the Department of War. Collaborating on solutions for data architecture, cloud environments, and governance.
AWS Streaming Data Engineer developing software and systems in a fast, agile environment. Utilizing experience with real - time data ingestion and processing systems across distributed environments.
Senior Software Engineer developing backend services and data infrastructure for integrated products at Booz Allen. Collaborating with a small elite team to deliver reliable and scalable services.