Data Engineer optimizing data ingestion and transformation pipelines for seamless data flow. Collaborating with cross-functional teams using Databricks and other cloud services in a hybrid work setting.
Responsibilities
Ingest data from a variety of sources such as Azure SQL DB, Google Analytics, Google Play Store, Apple App Store, Salesforce, and others.
Develop and optimize ETL/ELT pipelines to transform data from CSV, JSON, SQL tables, and APIs into usable formats.
Work with REST APIs to pull data from various external sources and integrate it into our data ecosystem.
Design and implement efficient data transformation processes to cleanse, aggregate, and enrich data.
Apply industry best practices for data modeling to ensure scalability, performance, and data integrity.
Collaborate with data analysts and data scientists to provide clean, high-quality datasets for reporting and analysis.
Utilize Databricks for data processing, transformation, and orchestration tasks.
Manage and optimize Databricks clusters for performance, reliability, and cost-effectiveness.
Implement Databricks workflows to automate and streamline data pipelines.
Use Unity Catalog for data governance and metadata management, ensuring compliance and data access control.
Requirements
5+ years of hands-on experience in data engineering or a related field.
Proven experience with Databricks and Databricks workflows, including cluster management and data pipeline orchestration.
Strong experience in data ingestion from SQL databases (Azure SQL DB), APIs (Google Analytics, Google Play Store, Apple App Store, Salesforce), and file-based sources (CSV, JSON).
Proficiency in SQL for data manipulation and transformation.
Experience with Python or Scala for writing and managing data workflows.
Working knowledge of REST APIs for data integration.
Experience in data transformation using Apache Spark, Delta Lake, or similar technologies.
Knowledge of cloud platforms such as Azure, with a focus on Azure SQL DB.
Familiarity with Unity Catalog for metadata management and governance.
Understanding of data architecture, data pipelines, and the ETL/ELT process.
Experience in data modeling, optimizing queries, and working with large datasets.
Familiar with data governance, metadata management, and data access controls.
Knowledge of Apache Kafka or other real-time streaming technologies (optional).
Experience with Data Lake or Data Warehouse technologies (optional).
Familiarity with additional data transformation tools such as Apache Airflow or dbt (optional).
Understanding of machine learning workflows and data pipelines (optional).
Data Engineer II leading development and delivery of data pipelines for Syneos Health. Collaborating with teams to optimize data processing and integrate solutions into production environments.
Lead Data Engineer overseeing data operations and analytics engineering teams for OneOncology. Focused on operational excellence in data platform and model reliability for cancer care improvement.
Senior AWS Software Data Engineer at Boeing focusing on AWS Data services to support digital analytics capabilities. Collaborating with cross - functional teams to design, develop, and maintain software data solutions.
Senior Data Engineer designing and improving software for business capabilities at Barclays. Collaborating with teams to build a data and intelligence platform for Equity Derivatives.
Senior AI & Data Engineer developing and implementing AI solutions in collaboration with clients and teams. Working on projects involving generative AI, predictive analytics, and data mastery.
Consultant driving IA business growth in Deloitte's Artificial Intelligence & Data team. Delivering innovative solutions using data analytics and automation technologies.
Data Engineer responsible for managing data architecture and pipelines at Snappi, a neobank. Collaborating with teams to enable data processing and analysis in innovative banking solutions.
Data Engineer at Destinus developing the data platform to support production and analytics needs. Involves migrating Excel sources to Lakehouse and integrating ERP systems in a hybrid role.
Senior Data Engineer developing solutions within the Global Specialty portfolio at an insurance company. Engaging with diverse business partners to ensure high quality data reporting.
Data Engineer at UBDS Group focusing on designing and optimizing modern data platforms. Collaborating in a multidisciplinary team to develop reliable data assets for analytics and operational use cases.