Lead Azure Data Engineer designing and optimizing data ecosystems on Microsoft Cloud. Responsible for building scalable data platforms and pipelines for analytics and reporting.
Responsibilities
Lead end-to-end development of scalable data pipelines and orchestration frameworks using Azure Data Factory (ADF), Azure Synapse Analytics, Azure Databricks, and Microsoft Fabric.
Build robust real-time and batch data pipelines, including integration with streaming sources (e.g., Event Hubs, Kafka) and structured streaming engines.
Design and implement Structured Streaming applications in Spark for near-real-time processing of streaming data.
Develop and maintain ETL/ELT pipelines and transformations leveraging Spark, PySpark, SQL, and fabric orchestration capabilities.
Architect and implement data solutions using Microsoft Fabric, including OneLake, Dataflows, warehouses, and Fabric capacity planning to support enterprise analytics.
Collaborate on data governance, cataloging, and asset organization using Unity Catalog within Databricks and Fabric environments.
Manage Microsoft Fabric capacity and resource utilization to optimize performance and cost efficiency for analytics workloads.
Design, deploy, and optimize Databricks dashboards and reporting artifacts for business stakeholders.
Apply best practices for data modelling, caching, file sizing, and performance tuning of Spark and Delta Lake jobs (e.g., Z-ORDER, broadcast joins, adaptive query execution).
Oversee governance, access controls, metadata management, and lineage using Unity Catalog.
Lead and mentor a team of data engineers, fostering best practices in development, operations, documentation, and quality.
Work with cross-functional teams (architecture, BI, data science, DevOps) to translate business requirements into scalable data solutions.
Partner with stakeholders to define data strategy, standards, and architectural roadmaps.
Establish and enforce standards for data quality, testing, monitoring, operational observability, and governance.
Implement secure, compliant data access and lineage frameworks across cloud data platforms.
Implement CI/CD pipelines, infrastructure-as-code for data platform artifacts, and automated testing frameworks for data jobs and workflows.
Requirements
10+ years of hands-on experience in data engineering on Azure with deep expertise in ADF, Synapse, Databricks, and Microsoft Fabric.
Proven experience with real-time data processing, streaming architectures, and Spark Structured Streaming.
Strong proficiency in Azure Data Factory, Spark (PySpark), SQL, Azure Synapse Analytics, Databricks Runtime, and cloud storage.
Solid knowledge of Unity Catalog for data governance, security, and access management.
Experience designing and managing Databricks Dashboards, performance optimization, cost controls, and data platform resource tuning.
Expertise in building scalable, fault-tolerant, and high-throughput batch & streaming data solutions.
Excellent leadership, cross-team collaboration, and communication skills.
Data Migration Specialist handling large - scale data migration from legacy to enterprise PLM platform. Analyzing data structures, developing strategies, and ensuring integrity across systems.
Director leading strategy, governance, and delivery of enterprise data platform at Phillips 66. Partnering with AI, Data Science, and business teams to enhance analytics and business systems.
Product Owner driving ERP data migration initiatives for BioNTech’s global landscape. Leading effective data management and ensuring compliance with regulatory standards in a fast - paced environment.
Data Engineer II leading development and delivery of data pipelines for Syneos Health. Collaborating with teams to optimize data processing and integrate solutions into production environments.
Lead Data Engineer overseeing data operations and analytics engineering teams for OneOncology. Focused on operational excellence in data platform and model reliability for cancer care improvement.
Senior AWS Software Data Engineer at Boeing focusing on AWS Data services to support digital analytics capabilities. Collaborating with cross - functional teams to design, develop, and maintain software data solutions.
Senior Data Engineer designing and improving software for business capabilities at Barclays. Collaborating with teams to build a data and intelligence platform for Equity Derivatives.
Senior AI & Data Engineer developing and implementing AI solutions in collaboration with clients and teams. Working on projects involving generative AI, predictive analytics, and data mastery.
Consultant driving IA business growth in Deloitte's Artificial Intelligence & Data team. Delivering innovative solutions using data analytics and automation technologies.
Data Engineer responsible for managing data architecture and pipelines at Snappi, a neobank. Collaborating with teams to enable data processing and analysis in innovative banking solutions.