Senior Data Engineer designing complex data systems for Opea, focusing on cloud and big data technologies. Leading a team and optimizing data processing solutions.
Responsibilities
Design and implement complex, scalable data systems using technologies such as cloud computing, big data, streaming, machine learning and AI, defining strategies for data storage, processing and analysis. Incorporate Large Language Models (LLMs), Retrieval-Augmented Generation (RAG) and vector databases for intelligent data processing.
Lead and inspire the data engineering team by sharing knowledge, defining standards and best practices, and fostering the team's technical growth.
Identify and resolve complex problems related to data systems, architecture, performance and scalability, using monitoring tools, log analysis and code debugging.
Create and implement innovative solutions to optimize data systems, leveraging emerging technologies such as machine learning, data lakes, real-time data streaming and big data analytics.
Communicate technical solutions clearly and concisely to stakeholders, managers, developers and other teams, influencing strategic decisions and helping align the data strategy with company objectives.
Requirements
Deep proficiency in Python and data processing tools such as Apache Spark, Kafka, Flink or equivalents.
Knowledge of LLMs, fine-tuning AI models and using RAG to improve search and information retrieval. Proficiency with Amazon Bedrock and/or Azure OpenAI services.
Experience with data system architecture, big data (Hadoop, Hive, HBase), streaming (Kafka, Kinesis) and databases (SQL, NoSQL).
Advanced knowledge of data modeling, data processing and data visualization tools such as SQL, NoSQL, Tableau, Power BI, etc.
Advanced experience building Data Lakes and Data Warehouses using tools like AWS Athena, Amazon Redshift, Amazon S3, AWS Glue, Airbyte, dbt and PostgreSQL.
Deep understanding of machine learning concepts and applying ML techniques within data systems.
Bachelor's degree in Computer Science, Statistics, Mathematics or a related field.
Technical English for reading, writing and professional communication.
Data Governance Engineer in Fintech developing a formal cyber data governance framework. Collaborating with cyber security, analytics, and platform engineering teams on metadata and lineage capabilities.
Junior Data Engineer role at Allegro, focusing on developing ETL/ELT pipelines and processing large datasets. Collaborate with cross - functional teams for data quality and reporting.
Data Engineer at Concept Reply developing innovative data - driven solutions in IoT. Collaborating with teams to unlock the potential of data and cloud computing.
Data Engineer creating and managing data pipelines for critical data solutions at S&P Global. Collaborating on enterprise - scale data processing in a supportive, innovative environment.
Data Engineer supporting and evolving data environment in cloud migration. Maintain and optimize existing databases while designing modern data solutions with cross - functional collaboration.
Senior Data Engineer responsible for data pipeline projects at Suprema Gaming. Focus on batch and streaming data solutions while collaborating with business teams.
Senior data leader managing the enterprise data architecture at Breakthru Beverage. Leading high - performing teams in data engineering and defining modern data strategies.
Data Engineer at Equinix implementing data architecture solutions for scalability and analytics. Collaborating with teams to design data pipelines and maintain data models for business objectives.
Data Warehouse Architect developing and optimizing robust data warehouse environments on SAP BW/4HANA. Critical for enabling advanced analytics and reporting across the organization.
Data Engineering Manager leading a new Data Engineering team in Bengaluru. Shaping the design and scaling of core data engineering practices across the organization.