Data Engineer with expertise in Databricks, SQL, and Python for scalable data solutions. Focused on ETL/ELT pipeline development, auditability, data quality, and automated testing.
Responsibilities
Lead the design, development, and optimization of ETL/ELT pipelines using Databricks, Python, Spark, and Delta Lake.
Architect scalable data solutions using Medallion architecture (Bronze, Silver, Gold layers).
Design and implement data models and transformations using SQL and Python.
Build and maintain audit frameworks to ensure traceability, compliance, and data lineage.
Develop data quality monitoring and automated testing frameworks for pipeline reliability.
Perform data analysis to support operational data requests and user queries.
Collaborate with clinical data teams to analyse IRT/RTSM datasets.
Create and maintain dashboards and reports using BI tools (e.g., Superset Power BI, Tableau, Qlik, or similar).
Help manage CICD and automated code branching/deployment.
Ensure compliance with GxP, CDISC, and other regulatory standards.
Mentor junior engineers and promote engineering best practices.
Requirements
8–10 years of experience in data engineering, with leadership or team lead responsibilities.
Strong hands-on experience with Databricks, Apache Spark, and Delta Lake.
Advanced proficiency in SQL and Python for data transformation and automation.
Experience with ETL/ELT orchestration, job optimization, and performance tuning.
Proven experience designing and implementing audit, data quality, and testing frameworks.
Hands-on experience with IRT/RTSM clinical trial data systems.
Strong data analysis skills and ability to interpret complex datasets.
Experience with BI/reporting tools such as Power BI, Tableau, or Qlik.
Knowledge of clinical data standards (e.g., CDISC, SDTM, ADaM).
Experience with cloud platforms (Azure, AWS, or GCP) and CI/CD pipelines.
Data Engineer II leading development and delivery of data pipelines for Syneos Health. Collaborating with teams to optimize data processing and integrate solutions into production environments.
Lead Data Engineer overseeing data operations and analytics engineering teams for OneOncology. Focused on operational excellence in data platform and model reliability for cancer care improvement.
Senior AWS Software Data Engineer at Boeing focusing on AWS Data services to support digital analytics capabilities. Collaborating with cross - functional teams to design, develop, and maintain software data solutions.
Senior Data Engineer designing and improving software for business capabilities at Barclays. Collaborating with teams to build a data and intelligence platform for Equity Derivatives.
Senior AI & Data Engineer developing and implementing AI solutions in collaboration with clients and teams. Working on projects involving generative AI, predictive analytics, and data mastery.
Consultant driving IA business growth in Deloitte's Artificial Intelligence & Data team. Delivering innovative solutions using data analytics and automation technologies.
Data Engineer responsible for managing data architecture and pipelines at Snappi, a neobank. Collaborating with teams to enable data processing and analysis in innovative banking solutions.
Data Engineer at Destinus developing the data platform to support production and analytics needs. Involves migrating Excel sources to Lakehouse and integrating ERP systems in a hybrid role.
Senior Data Engineer developing solutions within the Global Specialty portfolio at an insurance company. Engaging with diverse business partners to ensure high quality data reporting.
Data Engineer at UBDS Group focusing on designing and optimizing modern data platforms. Collaborating in a multidisciplinary team to develop reliable data assets for analytics and operational use cases.