Data Engineer to build and support data pipelines and models for analytics and machine learning. Working with ADI's Product Data Science team to leverage extensive data and automate processes.
Responsibilities
Designing and implementing data pipelines to efficiently extract, transform, and load (ETL) from various data sources into a central repository or data warehouse
Deploy and maintain scalable production pipelines for AI models, ensuring seamless integration with existing systems, continuous performance monitoring, and iterative improvements based on real-world feedback
Working on transformations and contributing to building the dimensional model of ADI product data
Building and maintaining scalable and robust data infrastructure, including data lakes, and distributed computing systems that are equipped to handle large volumes of data
Implement, and build data solutions using Spark, Python, Databricks, and the AWS ecosystem (S3, Redshift, EMR, Athena, Glue).
Implementing data validation and cleaning processes to ensure data quality and monitoring data pipelines for errors or anomalies
Optimizing data storage and retrieval processes to enhance performance and reduce latency, particularly through techniques such as indexing, partitioning, and caching
Implementing data security and privacy measures, including data encryption, access controls, and compliance with data governance policies and regulations
Collaborating with data scientists and data analysts to understand metrics and data requirements and to provide them with access to the necessary data sets and data systems
Adapt, learn, grow, and teach to deliver world-class products and platforms
Bring a curious mind to current business problems and opportunities, expanding your understanding of all products in the ADI ecosystem
Requirements
Strong programming skills and in Python
Strong expertise in database and query languages like SQL, NoSQL
Knowledge of data engineer skills such as data management, data visualization, and familiarity with data architecture
Knowledge and hands-on experience in building scalable data platforms and reliable data pipelines using technologies such as Spark, Databricks, AWS Kinesis, and/or Kafka
Solid understanding of MLOps principles and model versioning
Experience working with large volumes of metadata and schemas
Hands-on experience in ETL/ELT and data integration
Data warehousing knowledge, including data modeling, data security, and data governance understanding
Understanding how the role complements others working with machine learning, data science, algorithms, business intelligence
Benefits
Health package
Insurance in case of serious illness, surgical intervention, professional illness, and insurance from the consequences of an accident
Flexible working hours
English classes during working hours
Employee referral bonus program
Corporate social events and team buildings
Food and drinks: Free use of coffee machines, free fruit and snacks
Data Engineer/Analyst maintaining and improving data infrastructure for Braiins. Collaborating with technical and business teams to ensure reliable data flows and insights.
Medior Data Engineer handling Azure migrations for a major urban mobility client. Focused on data pipeline development and ensuring platform reliability with cutting - edge technologies.
Developing ML and computer vision solutions for cutting - edge autonomous vehicle dataset pipeline at Mobileye. Collaborating across teams for data curation and advanced perception algorithms.
Data Migration Lead in a hybrid role managing data migration for a major transformation programme in the media sector. Collaborating with various teams to ensure data integrity and successful migration.
Consultant ML & DataOps at Smile integrating data science projects for major clients. Designing MLOps solutions and enhancing data governance in a collaborative environment.
Data Engineer developing and maintaining data pipelines for Coolbet’s analytical services. Working within an Agile framework to ensure data reliability and efficiency.
API Data Engineer developing innovative data - driven solutions and advancing data architecture for AI Control Tower. Building and integrating APIs and data pipelines to support organizational needs.
Journeyman Data Architect supporting Leidos' enterprise data and analytics program for the Department of War. Collaborating on solutions for data architecture, cloud environments, and governance.
Senior Software Engineer developing backend services and data infrastructure for integrated products at Booz Allen. Collaborating with a small elite team to deliver reliable and scalable services.
AWS Streaming Data Engineer developing software and systems in a fast, agile environment. Utilizing experience with real - time data ingestion and processing systems across distributed environments.