Data Engineer creating data pipelines in Databricks for a fast-growing digital banking platform. Responsible for ensuring data quality and optimising processes to support decision-making.
Responsibilities
Design, develop, test, deploy and monitor data pipelines in Databricks on AWS from a wide variety of data sources.
Design, develop, test, deploy and monitor scalable code with PySpark and SQL in Databricks.
Identify opportunities to improve internal process through code optimisation and automation.
Build data quality dashboards, lineage flows / and or monitoring tools to utilize the data pipeline, providing active monitoring and actionable insight into overall data quality and data governance.
Assist in migrating data from legacy systems onto newly developed solutions.
Follow and lead best practices on all data security, retention, and privacy policies.
Requirements
Bachelor’s degree.
** 3+ years’ experience of building ETL/ELT pipelines.**
Proven competency in solution design, development, implementation, reporting and analysis.
Proficiency in **Apache-Spark, Python and SQL languages**.
Proficiency in working with **Text, Delta, Parquet, JSON, CSV, and XML data formats.**
Working knowledge of Spark structured streaming.
**AWS infrastructure experience, specifically working with S3.**
**Solid understanding of git-based version control, DevOps, and CI/CD. Experience of working on Atlassian stack a plus.**
**Knowledge of common web API frameworks and web services.**
Strong teamwork, relationship, and client management skills, and the ability to influence peers and senior management to accomplish team goals.
Willingness to embrace modern technology, best practice, and ways of work.
Data Engineer building scalable data pipelines and collaborating with teams at Ekimetrics. Involved in data quality, governance, and maintaining data integrity.
Senior Data Engineer developing data solutions and scalable systems at SimplePractice. Collaborating with teams to enhance analytics and decision - making for health and wellness clinicians.
Senior Data Engineer responsible for designing and implementing cloud - native data platforms for LPL Financial. Collaborating with stakeholders to enhance party reference data services and solutions.
Data Engineer in charge of designing and building data integration pipelines with Informatica and AWS technologies. Work collaboratively to deliver high - quality solutions in an agile environment.
Senior Software Engineer specializing in data engineering and infrastructure for cloud - native solutions at Cloudera. Leading technical direction and mentoring engineers in a high - impact role.
Senior Data Engineer at Sonatype responsible for building data pipelines and BI solutions. Collaborating with teams to design infrastructures empowering analytics and decision - making.
Lead Data Engineer responsible for driving data initiatives at Lennar, one of the nation's leading homebuilders. Manage projects, ensure scalability, and collaborate with stakeholders to meet organizational goals.
Financial Data Engineer Intern assisting with model integration and process automation at Transamerica. Focused on data engineering tasks with collaboration across IT and Finance teams.
Data Engineer managing reliable data pipelines and solutions using Microsoft Azure and Python. Collaborating across teams to meet business data needs in a scalable environment.