Senior Data Engineer designing complex data systems for Opea, focusing on cloud and big data technologies. Leading a team and optimizing data processing solutions.
Responsibilities
Design and implement complex, scalable data systems using technologies such as cloud computing, big data, streaming, machine learning and AI, defining strategies for data storage, processing and analysis. Incorporate Large Language Models (LLMs), Retrieval-Augmented Generation (RAG) and vector databases for intelligent data processing.
Lead and inspire the data engineering team by sharing knowledge, defining standards and best practices, and fostering the team's technical growth.
Identify and resolve complex problems related to data systems, architecture, performance and scalability, using monitoring tools, log analysis and code debugging.
Create and implement innovative solutions to optimize data systems, leveraging emerging technologies such as machine learning, data lakes, real-time data streaming and big data analytics.
Communicate technical solutions clearly and concisely to stakeholders, managers, developers and other teams, influencing strategic decisions and helping align the data strategy with company objectives.
Requirements
Deep proficiency in Python and data processing tools such as Apache Spark, Kafka, Flink or equivalents.
Knowledge of LLMs, fine-tuning AI models and using RAG to improve search and information retrieval. Proficiency with Amazon Bedrock and/or Azure OpenAI services.
Experience with data system architecture, big data (Hadoop, Hive, HBase), streaming (Kafka, Kinesis) and databases (SQL, NoSQL).
Advanced knowledge of data modeling, data processing and data visualization tools such as SQL, NoSQL, Tableau, Power BI, etc.
Advanced experience building Data Lakes and Data Warehouses using tools like AWS Athena, Amazon Redshift, Amazon S3, AWS Glue, Airbyte, dbt and PostgreSQL.
Deep understanding of machine learning concepts and applying ML techniques within data systems.
Bachelor's degree in Computer Science, Statistics, Mathematics or a related field.
Technical English for reading, writing and professional communication.
Senior Software Engineer contributing to Workday's AI/MLOps cloud ops platform. Involves data ingestion, computation, and generation of curated data sets with modern technologies.
Data Engineer role at Citi designing and maintaining scalable data solutions. Seeking a skilled professional with extensive data engineering experience and expertise in various technologies.
Data Engineer responsible for designing and maintaining enterprise data warehouse for various projects. Collaborating with stakeholders to ensure efficient data flow and integration.
Data Engineer developing and evolving BI platform for Thomson Reuters in India. Building architectural solutions and collaborating in all development lifecycle aspects.
Data Engineer responsible for building and maintaining AWS Lakehouse infrastructure for trade contractors at Remarcable. Focused on clean data architecture and AI/ML data infrastructure.
Data Engineer/Analyst maintaining and improving data infrastructure for Braiins. Collaborating with technical and business teams to ensure reliable data flows and insights.
Medior Data Engineer handling Azure migrations for a major urban mobility client. Focused on data pipeline development and ensuring platform reliability with cutting - edge technologies.
Developing ML and computer vision solutions for cutting - edge autonomous vehicle dataset pipeline at Mobileye. Collaborating across teams for data curation and advanced perception algorithms.
Data Migration Lead in a hybrid role managing data migration for a major transformation programme in the media sector. Collaborating with various teams to ensure data integrity and successful migration.