Data Engineer building and maintaining data infrastructure for AI life science products. Collaborating with cross-functional teams to deliver impactful solutions while participating in Agile processes.
Responsibilities
Develop, maintain and update ETL catalog.
Build, maintain and update repeatable and trackable data pipelines.
Collect, analyze and organize raw data in collections of datasets.
Design databases and data stores for big datasets with performance considerations in mind.
Collaborate with engineers and data scientists to develop and productize prototypes.
Participate in architectural decisions regarding our data architecture.
Participate in our Agile processes like planning and daily stand-up meetings.
Requirements
3+ years of professional experience as a data engineer or in a similar role.
Proven experience with Relational and NoSQL data stores.
Proven experience with data models, data mining, and segmentation techniques.
Experience with big data tools, specifically Spark (preferably with Databricks).
Experience with data pipeline / workflow orchestration tools (Azure DataFactory, Prefect).
Experience with cloud compute frameworks like Azure Batch.
Experience with Python.
Experience with SQL query authoring.
Experience with Git and collaborative development workflows.
Experience with data manipulation and transformation using Pandas and/or Polars.
Experience working with cross-functional teams in agile environments.
Excellent verbal and written communication in English.
Ability to provide clear and concise step-by-step technical help, verbally and in writing.
Ability to interact effectively with audiences of varying technical backgrounds & seniority.
BS/MS degree in Computer Science, Information Technology or a related field (nice to have).
Familiarity with Atlassian stack of tools(Jira, Jira service management & Confluence)(nice to have).
Familiarity with microservices-based architectures (nice to have).
Familiarity with bioinformatics or life sciences data (nice to have).
Benefits
Competitive compensation packages based on qualifications.
Flexible work schedule.
Professional and personal development opportunities.
Private life & health insurance.
Room to experiment, learn and have fun.
Peers with big smiles and fascinating ideas.
A multi-disciplinary, multinational team that values trust and autonomy.
Cloud Data Engineer implementing tailored solutions for Volkswagen Group data processing. Building ETL/ELT pipelines while collaborating with technical experts.
Data Engineer designing and optimizing data pipelines using Databricks and Google Cloud Platform. Collaborating with analysts and scientists to deliver high - quality data products.
Data Engineer responsible for building scalable data infrastructure that supports data - driven decisions. Collaborating with team to maintain systems and unlock data value for organizations.
Associate Data Engineer supporting privacy engineering controls and executing privacy impact assessments in a financial services company. Collaborating across business units to ensure alignment with privacy regulations.
Data Engineer at CVS Health optimizing data pipelines and analytical models. Driving data - driven decisions with healthcare data for improved business outcomes.
Senior Data Engineer at CVS Health developing robust data pipelines for healthcare data. Collaborating with teams to provide actionable insights and integrate them with consumer touchpoints.
Senior Data Engineer supporting AI - enabled financial compliance initiative with data pipelines and ingestion processes. Collaborating with diverse teams in a mission - critical regulated environment.
Data Architect leading the definition and construction of cloud data architecture for Kyndryl. Participating in significant technological modernization initiatives, focusing on Google Cloud Platform.
Senior Data Engineer driving data intelligence requirements and scalable data solutions for a global consulting firm. Collaborating across functions to enhance Microsoft architecture and analytics capabilities.
Experienced AI Engineer designing and building production - grade agentic AI systems using generative AI and large language models. Collaborating with data engineers, data scientists in a tech - driven company.