Data Engineer designing and maintaining the data systems for Skiffra’s AI-native orchestration platform. Collaborating closely with product and engineering teams for data integration and system design.
Responsibilities
Design and maintain a robust semantic layer that translates raw database schemas into high-context metadata, allowing LLMs and autonomous agents to reason across enterprise data
Integrate ERPs, operational tools, and legacy systems into a clean, unified internal data layer that powers Skiffra’s orchestration engine
Design the schemas and data contracts consumed by LLMs and workflow engines to ensure predictable, high-fidelity inputs from varied, often "messy" sources
Ensure every data point carries the necessary lineage and metadata for an LLM to understand its business significance
Architect the end-to-end ingestion and normalization pipelines for structured, semi-structured, and unstructured data, transforming fragmented enterprise fragments into a high-fidelity stream for AI agents
Implement the monitoring, observability, and automated data quality gates necessary to ensure our orchestration engine doesn't act on stale or corrupted enterprise context
Partner closely with product and engineering to translate complex operational needs into scalable data systems
Operate with speed and rigor in environments where reliability matters
Requirements
7+ years building production-grade data engineering or backend systems
Strong Python and SQL mastery
Experience with ETL/ELT pipelines and API-based integrations
Experience with cloud data infrastructure and streaming or event-driven systems
Ability to work independently and make sound technical tradeoffs
Proven track record building systems where data discovery and cataloging were core features
Ability to make sound technical tradeoffs and thrive in "zero-to-one" environments without a roadmap or perfect documentation
Benefits
health insurance
retirement plans
participation in the Company’s bonus and incentive programs
Google Data Architect creating and optimizing data solutions using GCP technologies. Collaborating with teams to enhance enterprise data architecture across business functions.
Azure Lead Data Engineer designing and developing ETL/ELT pipelines with Azure Data Factory and Snowflake. Collaborating with cross - functional teams in a cloud - native environment.
Principal Data Engineer leading Azure platform designs and implementations for enterprise solutions at UBDS Group. Mentoring teams and driving high engineering standards in hybrid environments.
Data Engineer at Kyndryl designing and maintaining data pipelines using AWS and Python. Optimizing ingestion, transformation workflows, and cloud solutions for large - scale data environments.
Data Architect responsible for the integrity and reliability of Patient Services data in Life Sciences. Ensuring analytics - ready data through strategic vendor collaboration and data stewardship.
Project & Data Engineer providing operational support and data management for utility service projects in the Greater Los Angeles area. Involves invoice processing, data accuracy, and system coordination.
Senior Data Engineer developing scalable data architectures and integrating data ecosystems at Porto Bank. Ensuring data quality and effective pipeline development for various business teams.
Data Engineering Advisor designing data flow management systems to support advanced analytics at Desjardins Group. Collaborate with teams to enhance data value and transformation.
Founding Staff Data Engineer building and leading data engineering team for AI - driven art valuation platform. Establishing architecture and standards for data systems and pipelines.