Data Engineer building and maintaining data infrastructure for AI life science products. Collaborating with cross-functional teams to deliver impactful solutions while participating in Agile processes.
Responsibilities
Develop, maintain and update ETL catalog.
Build, maintain and update repeatable and trackable data pipelines.
Collect, analyze and organize raw data in collections of datasets.
Design databases and data stores for big datasets with performance considerations in mind.
Collaborate with engineers and data scientists to develop and productize prototypes.
Participate in architectural decisions regarding our data architecture.
Participate in our Agile processes like planning and daily stand-up meetings.
Requirements
3+ years of professional experience as a data engineer or in a similar role.
Proven experience with Relational and NoSQL data stores.
Proven experience with data models, data mining, and segmentation techniques.
Experience with big data tools, specifically Spark (preferably with Databricks).
Experience with data pipeline / workflow orchestration tools (Azure DataFactory, Prefect).
Experience with cloud compute frameworks like Azure Batch.
Experience with Python.
Experience with SQL query authoring.
Experience with Git and collaborative development workflows.
Experience with data manipulation and transformation using Pandas and/or Polars.
Experience working with cross-functional teams in agile environments.
Excellent verbal and written communication in English.
Ability to provide clear and concise step-by-step technical help, verbally and in writing.
Ability to interact effectively with audiences of varying technical backgrounds & seniority.
BS/MS degree in Computer Science, Information Technology or a related field (nice to have).
Familiarity with Atlassian stack of tools(Jira, Jira service management & Confluence)(nice to have).
Familiarity with microservices-based architectures (nice to have).
Familiarity with bioinformatics or life sciences data (nice to have).
Benefits
Competitive compensation packages based on qualifications.
Flexible work schedule.
Professional and personal development opportunities.
Private life & health insurance.
Room to experiment, learn and have fun.
Peers with big smiles and fascinating ideas.
A multi-disciplinary, multinational team that values trust and autonomy.
Lead Data Engineer overseeing engineers and advancing the data platform at American Family Insurance. Creating tools and infrastructure to empower teams across the company.
Data Architect designing end - to - end Snowflake data solutions and collaborating with technical stakeholders at Emerson. Supporting the realization of Data and Digitalization Strategy.
Manager of Data Engineering leading data assets and infrastructure initiatives at CLA. Collaborating with teams to enforce data quality standards and drive integration efforts.
Data Engineer building modern Data Lake architecture on AWS and implementing scalable ETL/ELT pipelines. Collaborating across teams for analytics and reporting on gaming platforms.
Chief Data Engineer leading Scania’s Commercial Data Engineering team for growing sustainable transport solutions. Focused on data products and pipelines for BI, analytics, and AI.
Entry - Level Data Engineer at GM, focusing on building large scale data platforms in cloud environments. Collaborating with data engineers and scientists while migrating systems to cloud solutions.
Data Engineer designing and building scalable ETL/ELT pipelines for enterprise - grade analytics solutions. Collaborating with product teams to deliver high - quality, secure, and discoverable data.
Data Engineer responsible for data integrations with AWS technology stack for Adobe's Digital Experience. Collaborating with multiple teams to conceptualize solutions and improve data ecosystem.
People Data Architect designing and managing people data analytics for Gen, delivering actionable insights for HR. Collaborating across teams to enhance data - driven decision - making.