Data Engineer responsible for data infrastructure and pipelines to support drug discovery efforts. Collaborating with scientists and engineers to facilitate data-driven insights in an innovative biotech startup.
Responsibilities
Design and implement data pipelines that harmonize, validate, and version scientific data for downstream use in modeling and analysis
Develop tools and schemas for integrating heterogeneous data types (chemical, image-based, genomic, etc)
Build and maintain scalable data storage systems and APIs to make experimental and model-derived data accessible to scientists and machine learning teams
Collaborate with ML Scientists to prepare and curate datasets for training and evaluating predictive models
Partner with Software Engineers to surface clean, well-structured data to end users through our internal and customer-facing platforms
Establish and enforce best practices for data governance, reproducibility, and lineage tracking
Requirements
4+ years of experience as a Data Engineer, ML Platform Engineer, or similar role
Proficiency building and maintaining data pipelines and ETL processes in python (e.g. using orchestration tools such as Dagster, Airflow, or Prefect)
Experience with cloud-based storage and compute (AWS S3, ECS, etc, or equivalent)
Outstanding written and oral communication skills
Interest in diving deep into the science of a drug discovery and the business of a growing startup
Nice to have: Experience managing and working with scientific data, particularly in chemistry
Benefits
Competitive salary and equity-based compensation
Comprehensive healthcare benefits (including dental and vision)
Opportunity to grow along with a rapidly scaling company
Manager leading coordination between teams for data engineering at DPR Construction. Supporting core markets and account management through data analytics and technical delivery.
Program Manager leading enterprise - wide data migration efforts for Boeing's transition to modern data platforms. Overseeing complex processes to ensure secure and effective migrations across multiple systems.
Senior Data Engineer responsible for developing data products for Disney's immersive digital experiences. Collaborating with teams to ensure data quality and operational efficiency in a fast - paced environment.
Senior Data Engineer crafting and developing data products for analytical insights at Zendesk. Collaborating in an Agile environment with a focus on data warehousing and process optimization.
Senior Data Engineer designing and building data warehouse solutions with Snowflake for a fintech company. Collaborating with cross - functional teams to facilitate data insights and analytics.
Data Engineer developing and maintaining data pipelines and applications at EvidenceCare. Collaborating across teams to generate actionable insights from healthcare data for better decision - making.
Data Engineer managing and expanding enterprise business intelligence and data platform. Focusing on Tableau development and administration with a strong engineering background.
Lead Data Engineer overseeing engineers and advancing the data platform at American Family Insurance. Creating tools and infrastructure to empower teams across the company.