Data Engineer building data pipelines for AI Safety company. Handling petabytes of logs and creating a clean, reliable data environment.
Responsibilities
Build and maintain a clean, stable data environment so all team members can access petabytes of traces, logs, and model outputs in the formats they need – without delays or manual extraction.
Develop and run internal data APIs, SDKs, and tools that help engineering, product, and research teams discover, query, and use data without touching infrastructure.
Monitor and improve data performance – from table layouts to query plans – so analytics and research workloads stay smooth as volumes grow.
Manage data access and governance, defining and enforcing permissions, access rules, and security policies.
Requirements
You’ve built or scaled a modern data stack – Snowflake, ClickHouse, event streaming – in a startup or similarly fast environment.
You’re strong in SQL and Python and comfortable working with large, messy datasets.
You communicate clearly and can work directly with engineers and researchers. Fluent English.
A big plus: experience with Metabase, Tableau, or similar tools for internal dashboards.
Benefits
20 days of paid vacation
Work from Paris (hybrid) + relocation package
Best medical insurance in France
All the hardware, tools, and services you need
Covered subscriptions for AI agents and IDEs
Team off-sites twice a year: we’ve recently been to the Alps and to Saint-Tropez
Azure Lead Data Engineer designing and developing ETL/ELT pipelines with Azure Data Factory and Snowflake. Collaborating with cross - functional teams in a cloud - native environment.
Principal Data Engineer leading Azure platform designs and implementations for enterprise solutions at UBDS Group. Mentoring teams and driving high engineering standards in hybrid environments.
Data Engineer designing and maintaining the data systems for Skiffra’s AI - native orchestration platform. Collaborating closely with product and engineering teams for data integration and system design.
Data Engineer at Kyndryl designing and maintaining data pipelines using AWS and Python. Optimizing ingestion, transformation workflows, and cloud solutions for large - scale data environments.
Data Architect responsible for the integrity and reliability of Patient Services data in Life Sciences. Ensuring analytics - ready data through strategic vendor collaboration and data stewardship.
Project & Data Engineer providing operational support and data management for utility service projects in the Greater Los Angeles area. Involves invoice processing, data accuracy, and system coordination.
Senior Data Engineer developing scalable data architectures and integrating data ecosystems at Porto Bank. Ensuring data quality and effective pipeline development for various business teams.
Data Engineering Advisor designing data flow management systems to support advanced analytics at Desjardins Group. Collaborate with teams to enhance data value and transformation.
Founding Staff Data Engineer building and leading data engineering team for AI - driven art valuation platform. Establishing architecture and standards for data systems and pipelines.
Senior Data Engineer responsible for developing, maintaining ETL processes and integrating data solutions. Collaborating with teams on data quality and cloud migration initiatives.