About the role

  • Data Engineer responsible for data infrastructure and pipelines to support drug discovery efforts. Collaborating with scientists and engineers to facilitate data-driven insights in an innovative biotech startup.

Responsibilities

  • Design and implement data pipelines that harmonize, validate, and version scientific data for downstream use in modeling and analysis
  • Develop tools and schemas for integrating heterogeneous data types (chemical, image-based, genomic, etc)
  • Build and maintain scalable data storage systems and APIs to make experimental and model-derived data accessible to scientists and machine learning teams
  • Collaborate with ML Scientists to prepare and curate datasets for training and evaluating predictive models
  • Partner with Software Engineers to surface clean, well-structured data to end users through our internal and customer-facing platforms
  • Establish and enforce best practices for data governance, reproducibility, and lineage tracking

Requirements

  • 4+ years of experience as a Data Engineer, ML Platform Engineer, or similar role
  • Proficiency building and maintaining data pipelines and ETL processes in python (e.g. using orchestration tools such as Dagster, Airflow, or Prefect)
  • Experience with cloud-based storage and compute (AWS S3, ECS, etc, or equivalent)
  • Outstanding written and oral communication skills
  • Interest in diving deep into the science of a drug discovery and the business of a growing startup
  • Nice to have: Experience managing and working with scientific data, particularly in chemistry

Benefits

  • Competitive salary and equity-based compensation
  • Comprehensive healthcare benefits (including dental and vision)
  • Opportunity to grow along with a rapidly scaling company

Job title

Data Engineer

Job type

Experience level

Mid levelSenior

Salary

Not specified

Degree requirement

Bachelor's Degree

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job