Data Science Engineer working within Early Research to build infrastructure for biological data analysis. Combining data engineering, data science, and machine learning to support scientific discovery and research teams.
Responsibilities
Build and maintain scalable ETL pipelines for experimental, biological, and pharmacological data
Integrate data from multiple sources such as GeneData, LabVantage, CROs, and cloud platforms into unified analytical environments
Automate recurring workflows to ensure data quality, accessibility, and reproducibility
Collaborate with data scientists to implement and operationalize AI and machine learning models into production-ready pipelines on cloud and HPC environments
Monitor and maintain deployed models to ensure performance, scalability, and traceability
Develop scripts, APIs, and workflows to connect laboratory systems such as GeneData Biologics and LabVantage with central analytics environments
Act as the technical link between Discovery, IT, and Data Science teams
Translate scientific and technical needs into robust, scalable solutions
Provide responsive technical support to scientists to ensure data accessibility and workflow reliability.
Requirements
MSc in Computer Science, Data Science, Computational Biology, or a related field
Three to five years of relevant experience in data engineering or scientific data environments
Proficiency in Python and version control systems such as Git
Experience with AWS services including S3, Lambda, ECS, Glue, and Step Functions
Strong understanding of data integration, workflow automation, and pipeline design
Excellent communication skills with the ability to explain technical solutions clearly to scientific and technical colleagues
Proactive problem solver who can adapt quickly to changing priorities.
Data Scientist analyzing claims data, optimizing processes and collaborating across departments at Allianz Spain. Utilizing statistical techniques and developing predictive models for operational efficiency.
Data Scientist enhancing merchandising and inventory performance using machine learning techniques at Nordstrom. Collaborating with teams to develop data - driven solutions for better customer experiences.
Clinical Data Manager supporting research projects and managing patient databases at Mass General Brigham. Interact with patients and maintain data integrity in clinical research.
Technical Lead in Data Science guiding analytical systems design and team mentorship for a finance technology company. Collaborating across engineering and research to drive innovation and data excellence.
Technical Lead in Data Science at Voleon, driving the design and implementation of analytical systems. Mentoring a growing team of data scientists while ensuring methodological rigor in finance applications.
Data Scientist developing analytical solutions for customer data transformation and supporting critical business decisions. Focus on quality and integrity of data through statistical models and analysis.
Junior Data Scientist developing demand forecasting models for Nestlé’s Supply Chain team. Collaborating with analysts and planners to improve forecasting tools and processes.
HCM Data and Process Analytics Lead at ABB building business intelligence for HCM solutions. Analyzing data to drive improvements in performance and data quality across the organization.
Data Scientist owning and delivering production - grade data pipelines at Simbe Robotics. Collaborating with Product Management, Engineering, and Commercial teams to surface insights from retail data.
Data and Analytics Manager defining strategies for a digital transportation company connecting freight and truckers in Brazil. Leading initiatives to maximize data value and promote a culture of data - driven decision - making.