Data Engineer responsible for ingestion pipelines and data quality on AI-driven marketing platform. Collaborating with data teams to ensure accuracy and performance of data systems.
Responsibilities
Reliability and completeness of our multi-provider ingestion pipeline: build failover logic, track daily completion rates per AI provider, and maintain the systems that ensure customers always get accurate, timely data
Data quality and consistency end to end: own the contracts between ingestion, processing, and downstream serving so that what customers see in the product is correct and trustworthy
Extraction pipeline performance: design and maintain async processing patterns that keep our ingestion throughput ahead of demand as we scale
Data readiness for new features: partner with engineering upstream so instrumentation, schema changes, and pipeline capacity are in place before features ship
Pipeline observability: build and maintain health monitoring across the full data stack so the team knows the state of the system in real time
Requirements
Built and maintained production data pipelines at meaningful scale, including experience with ingestion, ETL, and distributed job queue systems
Hands-on experience with ClickHouse or a comparable columnar store, including schema design, materialized views, and query optimization
Experience working with third-party data providers or scraping infrastructure at scale
Designed and owned reliability and observability systems: you have built the dashboards and alerting that tell you what is wrong before users do
Comfortable working close to the application layer: our stack is Rails and Postgres and this person will need to understand how backend systems feed the data platform
Benefits
Equity in a fast-growing startup
Competitive benefits package tailored to your location
Flexible time off policy
Parental Leave
A fun-loving and (just a bit) nerdy team that loves to move fast!
Senior Data Engineer at Goodwin enhancing data platforms and fostering data - driven culture across teams. Collaborating with IT and Finance on technology solutions and data governance practices.
Director, Data Platform Design and Strategy at MedImpact leading data platform and AI innovations to enhance healthcare services. Overseeing enterprise projects and managing teams to meet strategic goals.
Data Engineer delivering AI - and data - driven solutions for Honeywell’s industrial customers. Architecting and implementing scalable data pipelines and platforms focused on IoT and real - time data processing.
Data Engineering Associate focusing on data quality control and management for distribution platform. Collaborates on large scale data projects to ensure data accuracy and availability for users.
Data Architect managing enterprise data platform built on Microsoft Fabric at Johnstone Supply. Leading architectural standards and collaborating with business and IT leaders for strategic data - driven insights.
Data Engineer at Studyportals responsible for data pipelines and infrastructure. Join a team ensuring accurate and trustworthy data for analytics and business decisions.
AI/ML Engineer designing and refining prompts and workflows using large language models. Responsible for developing data pipelines and delivering scalable AI solutions in a hybrid work environment.
AWS Data Architect at Fractal designing and operationalizing AWS data solutions at enterprise scale. Collaborating with clients and mentoring engineers in best practices.
Senior Data Engineer driving data - driven success at Pacific Life. Collaborating with a team to build scalable and secure data solutions in Newport Beach, CA or Charlotte, NC.
Data Architect managing Commercial Data architecture initiatives for Valmet's sales and service team. Leading AI - driven data integrity and quality efforts in a global context.