Data Engineer responsible for designing and maintaining data pipelines at SOPHiA GENETICS. Collaborating with stakeholders to deliver actionable insights through robust data models and workflows.
Responsibilities
Designing, building, and maintaining the data pipelines and transformations that power analytics and decision-making across SOPHiA GENETICS.
Working closely with stakeholders from different functions to understand their use cases, translate their needs into robust data models and workflows, and ensure that data is reliable, well-structured, and ready for downstream analysis and visualization.
Building scalable data solutions, thinking critically about what the data means, how it is produced, and how it can be used to generate actionable insights.
Design and maintain scalable, reliable data pipelines and ETL processes that integrate seamlessly and perform efficiently across the platform.
Optimize data workflows, queries, and resource utilization to maximize performance and cost-efficiency as data volume and complexity grow.
Develop and implement new data features and transformations that translate stakeholder use cases into production-ready solutions.
Automate deployment of data jobs, CI/CD pipelines, and infrastructure configurations to ensure consistency and reproducibility.
Implement monitoring and observability across data quality, pipeline health, and performance to detect issues early and ensure reliability.
Requirements
2-5 years of experience working within Data Engineer (distributed data, data lakes, microservice-oriented architectures, and APIs)
BA/MA in Computer Science or Engineering or equivalent professional experience
Expertise with Python ETLs in a data processing environment, ideally Databricks
Expertise with distributed big data architectures (schemas, transfers, storage, partitioning, performance monitoring and optimization)
Solid knowledge of modern scalable database and data lake technologies, especially Spark & SQL, but also including Parquet & Delta tables.
Experience with containerization and orchestration technologies, as well as basic DevOps processes and tooling
Experience with software engineering best-practices, Agile, CI/CD, Unit & integration testing
Experience with multimodal data spanning of digital healthcare, clinical, radiomics and genomics (is a plus)
As a public organisation facing ongoing commercial growth, you will bring a success-orientated and solutions-focused mindset that embraces team collaborations, change, growth and inclusion.
As an international organisation, English is our primary business language and you will need to bring full fluency in English. As part of your recruitment journey, you should expect to meet English-only speakers, so for best chances of success, you should include your CV in English. Non-English CVs will be rejected at application stage.
Benefits
Opportunity to work on cutting-edge research projects with an immediate global impact
A flexible, friendly and international working environment with a collaborative atmosphere
An exciting company mission that brings together science and technology to directly impact the lives of patients with life threatening illness
A fast-growing company with plenty of opportunity for personal growth and development
A hard technical challenge to solve with exciting modern technology - cloud computing, Big Data, DevOps, machine learning
Forfait-Jour working types
Health benefits for you and your family covered by 80% employer contributions
Life Insurance and pensions contribution
SWILE meal vouchers and home office allowances
25 Days Vacation
Additional voluntary benefits including sports allowance, language courses, bank partnerships and transportation.
Data Engineer working on open - source Data Lakehouse and data pipeline development. Involves data integration and ensuring data quality in a hybrid work environment.
Data Engineer creating data pipelines in Databricks for a fast - growing digital banking platform. Responsible for ensuring data quality and optimising processes to support decision - making.
Data Engineer building scalable data pipelines and collaborating with teams at Ekimetrics. Involved in data quality, governance, and maintaining data integrity.
Senior Data Engineer developing data solutions and scalable systems at SimplePractice. Collaborating with teams to enhance analytics and decision - making for health and wellness clinicians.
Senior Data Engineer responsible for designing and implementing cloud - native data platforms for LPL Financial. Collaborating with stakeholders to enhance party reference data services and solutions.
Data Engineer in charge of designing and building data integration pipelines with Informatica and AWS technologies. Work collaboratively to deliver high - quality solutions in an agile environment.
Senior Software Engineer specializing in data engineering and infrastructure for cloud - native solutions at Cloudera. Leading technical direction and mentoring engineers in a high - impact role.
Senior Data Engineer at Sonatype responsible for building data pipelines and BI solutions. Collaborating with teams to design infrastructures empowering analytics and decision - making.
Lead Data Engineer responsible for driving data initiatives at Lennar, one of the nation's leading homebuilders. Manage projects, ensure scalability, and collaborate with stakeholders to meet organizational goals.