Data Engineer focused on developing and sustaining Machine Learning solutions for ICBC's data needs. Collaborating with Data Scientists and Statistical Analysts on data-driven projects.
Responsibilities
Understanding Data Science, Machine Learning, Performance & Evaluative Analytics model requirements, working closely with Data Scientists & Statistical Analysts, supporting them with their data and Machine Learning operational needs.
Operationalizing Data Science Model into Machine Learning pipelines, applying coding optimization of the data science models, conducting model training and re-training, deploying the models and sustaining them in Production.
Responding to data requests, data discovery and data profiling to support various data science, evaluative and machine learning solutions and projects, reviewing and clarifying data requirements, ensuring the data artifacts are acceptable within policy and privacy protocols.
Providing subject matter & data expertise to the Strategic Analytics, Actuarial and Regulatory Affairs departs as well as ICBC divisional clients on data sources, reporting workflows, business process, and the appropriate tools with which to analyze their data.
Participating with corporate data user teams, developing data science model validation and test plans, performing user acceptance testing, and providing support to data scientists, evaluative & performance metrics analysts and sustainment of their end products.
Conducting analysis for moderate to complex strategic solutions and POCs, defining data fields and determining data availability, developing information layout, format and interactivity. Presenting findings and providing clarification.
Requirements
Proven work-based experience coding using Python Language and PySpark data framework will be required.
Experience working with ML libraries & frameworks including Scikit-Learn for traditional ML, TensorFlow and PyTorch for deep learning.
Proficiency in Data Science Stack such as NumPy, PySpark and Pandas for data manipulation.
Technical knowledge in cleaning, transforming and preparing un-curated data including handling of values and feature scaling.
Exposure to Machine Learning Operations (MLOps) supporting Model development, skills with Docker for containerization, API development and using cloud platforms.
Knowledge & experience with Machine Learning Algorithms and techniques
Experience or exposure to working with pre-trained models such as Large Language Models (LLM), using Retrieval-Augmented Generation (RAG) and working with HuggingFace pre-trained models
Experience with processing structured and unstructured data.
Intermediate to Advance experience of writing SQL Queries & working with NoSQL Databases
Knowledge of experiment tracking & Management using tools like MLFlow, Data Version Control (DVC), managing model versions, parameters and results.
Pipeline orchestration using Apache AirFlow to automate training, testing and deployment workflows.
Setting up automated pipelines for Continuous integration and continuous deployment (CI/CD) using GitLab.
Excellent interpersonal, verbal and written communication skills to work with customers.
Strong data quality management process understanding, data analysis and data profiling.
Ability to apply critical thinking skills to troubleshoot and perform root cause analysis on technical problems and Machine Learning model deployments.
Understanding of Agile Methodologies.
Experience with reporting and visualization tools, such as Tableau, Jupiter or other reporting tools would be an asset.
Senior Data Engineer designing and optimizing the core data layer for Degreed's upskilling platform. Collaborating with internal teams and clients to ensure access to reliable and performant data.
Big Data Engineer handling both internal and external stakeholders for data processing related to fraud and compliance at ING. Managing and transforming high volume data and working closely with project teams.
Big Data Engineer role developing state - of - the - art solutions for financial crime prevention at ING. Collaborating with teams to manage high volume data and deliver effective technical solutions.
Senior Data Engineer focusing on Databricks in a market - leading company. Designing data architectures and optimizing data workflows in a collaborative environment.
Senior Data Architect Lead at Leidos developing enterprise data and analytics solutions for the Department of War. Collaborating with teams to implement data strategies and governance frameworks.
Senior Data Engineer Lead at Leidos, enhancing enterprise data solutions for DoD organizations. Collaborating with teams to deliver scalable data analytics and AI capabilities.
Data Engineer building advanced technology solutions for clients. Organizing and making disparate data meaningful to impact missions in fraud detection, cancer research, and national intelligence.
Staff Data Engineer on Real World Evidence team driving large - scale data initiatives. Collaborating with cross - functional teams to optimize data pipelines and improve healthcare outcomes.
Data Engineer Trainee developing expertise in Python, SQL, and modern cloud technologies. Innovating in a dynamic team environment with a focus on practical application.
AWS Data Engineer designing and maintaining scalable cloud - based data platforms within Banking and Financial Services at EXL. Collaborating across teams to enable data - driven decision - making through cloud technologies.