Data Engineer transforming data into easily analyzable formats at Sprout Solutions. Designing systems and frameworks to support data science initiatives in the HR tech space.
Responsibilities
Transforming data into a format that can be easily analyzed
Designing and developing Sprout’s data system
Processing and extracting data features
Deploying the data science team’s machine learning models
Supporting software engineers, architects and data scientists on data initiatives
Ensuring optimal data delivery architecture is consistent throughout ongoing projects
Building, optimizing and maintaining conceptual, logical and physical database models
Developing database solutions to store and retrieve information
Assembling datasets that meet functional/non-functional business requirements
Building the infrastructure for optimal ETL from a wide variety of data sources
Monitoring data integrity and adopting appropriate tools
Improving system performance
Optimizing or re-designing data architecture to support Sprout’s next generation of products and data initiatives
Designing, developing, testing and deploying web service APIs
Working with Data Scientists to identify future needs and requirements
Deploying models and algorithms developed by the Data Science team
Requirements
Extensive knowledge on databases (SQL and/or NoSQL) and data engineering best practices
Expertise in SQL and other programming languages(e.g. Python, Java, Scala, shell scripting etc.)
Experience with data modeling (data warehouse, data lake) and designing data storage schemes
Familiarity with data engineering and ETL software tools, hadoop, spark, talend, SSAS, etc. is also helpful
Experience building and optimizing data pipelines, architecture and datasets
Experience in software development
Experience with Azure
A successful history of manipulating, processing and extracting value from large disconnected datasets
Familiarity with data visualization tools (e.g. PowerBI)
Familiarity with agile development as a project management methodology is a plus
Strong problem-solving and analytical skills
Must be self-motivated and comfortable supporting the data needs of multiple teams, systems and products
A good team player and willingness to learn
Strong innate desire and proven track record of continuous self-improvement (in learning, job expansion, extracurricular activities, etc.)
Data Engineer/Analyst maintaining and improving data infrastructure for Braiins. Collaborating with technical and business teams to ensure reliable data flows and insights.
Medior Data Engineer handling Azure migrations for a major urban mobility client. Focused on data pipeline development and ensuring platform reliability with cutting - edge technologies.
Developing ML and computer vision solutions for cutting - edge autonomous vehicle dataset pipeline at Mobileye. Collaborating across teams for data curation and advanced perception algorithms.
Data Migration Lead in a hybrid role managing data migration for a major transformation programme in the media sector. Collaborating with various teams to ensure data integrity and successful migration.
Consultant ML & DataOps at Smile integrating data science projects for major clients. Designing MLOps solutions and enhancing data governance in a collaborative environment.
Data Engineer developing and maintaining data pipelines for Coolbet’s analytical services. Working within an Agile framework to ensure data reliability and efficiency.
API Data Engineer developing innovative data - driven solutions and advancing data architecture for AI Control Tower. Building and integrating APIs and data pipelines to support organizational needs.
Journeyman Data Architect supporting Leidos' enterprise data and analytics program for the Department of War. Collaborating on solutions for data architecture, cloud environments, and governance.
Senior Software Engineer developing backend services and data infrastructure for integrated products at Booz Allen. Collaborating with a small elite team to deliver reliable and scalable services.
AWS Streaming Data Engineer developing software and systems in a fast, agile environment. Utilizing experience with real - time data ingestion and processing systems across distributed environments.