Senior Data Engineer developing ETL and data pipelines for Burlington’s digital transformation team. Collaborating with analytics and engineering teams to support insights from data analysis.
Responsibilities
Design and Develop ETL to extract data from first and third-party data sources, stitch, and wrangle it for advanced analytics activities and AI/ML and GenAI uses
Guide and lead development of data pipelines to enable real-time, high volume, data consumption
Leverage data engineering best practices and tools in collaboration with analytics translators, data scientists, and cloud engineers
Assist with researching first and third-party data sources to enable access and ensure availability of relevant data sources for those who need it
Provide input to cloud engineers for the design and implementation of data management and/or architecture solutions
Provide ongoing operational support for ETL/ELT environments including development, test, and production
Design, implement, and deploy data loaders to load data into the cloud data platform
Assist in pulling, filtering, tagging, joining, parsing, and normalizing data sets for use
Mentoring junior members of the team
Requirements
Bachelor’s or Master’s degree in Computer Science / Engineering, Informatics, or related areas
5+ years of experience in designing and implementing large scale data loading, manipulation, processing, analysis, and exploration solutions
Extensive experience developing SQL based data processing
5+ years of experience with Data Architecture, Data Warehouse, Lake Houses, Data Lake, Data Marts and Data Stores with focus on AI/ML techniques and LLMs
Experience with Snowflake and Oracle Databases as well as Azure and ADLS
Experience in development & provisioning RESTful APIs to enable real-time data consumption
Experience with source control tools such as GitHub and related CI/CD processes
Technical expertise with pulling and massaging data
Great understanding of first/third party data
Agile Development methodology
Database Normalization
Advanced SQL skills
Understanding of data management principles and processes
Passion for data, analytics and pushing business innovation
Strategic thinker who loves technology and innovation
Excellent communication skills
Experience with Informatica IDMC (preferred)
Experience with GenAI and Natural Language Processing (NLP) frameworks (preferred)
Experience with real-time and streaming technology (i.e. Azure Event Hubs, Azure Functions Kafka, Spark Streaming) (preferred)
Unix / Linux Experience (preferred)
Experience with CI/CD practices (preferred)
Previous experience with DevOps (preferred)
Experience with Python / Java programming languages (preferred)
Experience with one or more enterprise scheduler (preferred)
Experience with AI-based testing, experimentation and prototyping (preferred)
Performance Data Engineer providing data modeling expertise and engineering a cloud - based Data Lakehouse platform. Support Federal agency ETL applications with ongoing development and maintenance responsibilities.
Manager for Data & AI team at Valorem Reply focused on modern data platforms with Microsoft technologies. Leading technical direction and collaborating with clients on data governance frameworks.
Manager of Data & AI team focused on building modern data platforms using Microsoft technologies. Collaborating with clients and team members for successful delivery and governance.
Senior Staff Data Engineer at DeepL working on enterprise - wide data engineering standards and cloud solutions. Leading technical initiatives and mentoring engineers to support data capabilities across the organization.
Data Engineer contributing to advanced analytics and machine learning solutions in aviation at Boeing. Collaborating within a data science team to produce industry - leading insights and build cloud - based tools.
Data Engineer designing and maintaining scalable data solutions in GCP and Snowflake environments. Collaborating with clients and stakeholders to ensure data quality and functionality.
Data Architect leading data architecture and engineering projects within GCP and Snowflake environments for AI consulting company. Collaborate with clients to define data strategies and guide implementation teams.
Senior Data Engineer optimizing scalable data pipelines at Matrix for the Brazilian energy market. Involved in ETL processes and data architecture design with a collaborative team environment.
Implementation Engineer guiding customers in data architecture and integration solutions at MotherDuck. Collaborating with teams and leading technical discussions to ensure customer success.
Data Engineer/Senior Data Engineer developing scalable ETL/ELT pipelines and architecting data systems at Manulife. Collaborating with data professionals to ensure data quality and compliance.