Data Engineer crafting data ingestion and transformation processes for an AI healthcare platform. Collaborating with teams to turn complex healthcare data into actionable insights.
Responsibilities
Own data ingestion, transformation, and curation across Bronze, Silver, and Gold layers of our Databricks-based Medallion Architecture.
Manage and optimize data pipelines using Airflow for orchestration and Airbyte (or similar tools) for multi-source ingestion.
Build and maintain connectors and workflows for APIs, EHR/EMR systems (FHIR), resident life, and IoT/monitoring data sources.
Implement batch and streaming pipelines supporting both analytics and near real-time use cases.
Develop and monitor data quality, validation, and profiling frameworks across ingestion points.
Support AI enablement efforts — preparing data for LLM-based insights, population health analytics, and predictive modeling use cases (e.g., fall risk, medication adherence, staffing optimization).
Collaborate closely with data science to enable curated datasets and semantic layers for Superset and AI query interfaces.
Partner with our infrastructure team to maintain infrastructure as code (Terraform) for data services, ensuring scalability and reproducibility.
Partner with security and compliance officers to move towards HIPAA and SOC 2 alignment for all data storage and processing.
Requirements
4–8 years of hands-on data engineering experience.
Strong proficiency with Airflow and Databricks (Spark, Delta Lake, SQL, Python).
Experience building scalable ingestion pipelines with Airbyte, Fivetran, or custom API connectors.
Solid understanding of Azure data ecosystem (Data Lake, Blob Storage, Key Vault, Functions, FHIR Server, etc.).
Experience implementing and maintaining ETL/ELT pipelines in a HIPAA or regulated environment.
Comfort with both SQL and Python for transformations, orchestration, and testing.
Strong grasp of data modeling, schema evolution, and versioned datasets.
Ability to operate independently and deliver results in a small, fast-moving team.
Experience with FHIR and healthcare data structures and interoperability standards.
Familiarity with vector databases (e.g., pgvector, Pinecone) or embedding pipelines for AI/LLM applications.
Experience with GitHub best practices for maintaining and sharing code.
Familiarity with Superset or other analytics tools for internal visualization.
Understanding of security best practices, including encryption, RBAC, and least-privilege design.
Benefits
Christmas Bonus: 30 days, to be paid in December.
Major Medical Expense Insurance: Coverage up to $20,000,000.00 MXN.
Minor Medical Insurance: VRIM membership with special discounts on doctor’s appointments and accident reimbursements.
Dental Insurance: Always smile with confidence!
Life Insurance: (Death and MXN Disability)
Vacation Days: 12 vacation days in accordance with Federal Labor Law, with prior approval from your manager. + Floating Holidays: 3 floating holidays in addition to the 7 official holidays in Mexico.
Senior Data Engineer role at Dun & Bradstreet focused on data analytics and visualization. Collaborating with teams to optimize data processes and deliver actionable insights.
Senior Data Engineer with AWS expertise leading financial data architecture and scalable solutions. Collaborating in wealth management to enhance data quality and systems.
Data Migration Specialist handling large - scale data migration from legacy to enterprise PLM platform. Analyzing data structures, developing strategies, and ensuring integrity across systems.
Director leading strategy, governance, and delivery of enterprise data platform at Phillips 66. Partnering with AI, Data Science, and business teams to enhance analytics and business systems.
Product Owner driving ERP data migration initiatives for BioNTech’s global landscape. Leading effective data management and ensuring compliance with regulatory standards in a fast - paced environment.
Data Engineer II leading development and delivery of data pipelines for Syneos Health. Collaborating with teams to optimize data processing and integrate solutions into production environments.
Lead Data Engineer overseeing data operations and analytics engineering teams for OneOncology. Focused on operational excellence in data platform and model reliability for cancer care improvement.
Senior AWS Software Data Engineer at Boeing focusing on AWS Data services to support digital analytics capabilities. Collaborating with cross - functional teams to design, develop, and maintain software data solutions.
Senior Data Engineer designing and improving software for business capabilities at Barclays. Collaborating with teams to build a data and intelligence platform for Equity Derivatives.