Data Engineer leading data foundation architecture and optimization for a Kenyan startup. Constructing data pipelines that fuel machine learning models and internal analytics.
Responsibilities
Architect and sustain scalable ETL workflows, guaranteeing consistency and accuracy across diverse data origins.
Refine and optimize data models and database structures specifically tailored for reporting and analytics.
Enforce industry best practices regarding data warehousing and storage methodologies.
Fine-tune data systems to handle the demands of both real-time streams and batch processing.
Manage the cloud data environment, utilizing platforms such as AWS, Azure, or GCP.
Coordinate with software engineers to embed data solutions directly into our product suite.
Design robust processes for ingesting both structured and unstructured datasets.
Script automated quality checks and deploy monitoring instrumentation to instantly detect data anomalies.
Build APIs and services that ensure seamless data interoperability between systems.
Continuously monitor pipeline health, troubleshooting bottlenecks to maintain an uninterrupted data flow.
Embed data governance and security protocols that meet rigorous industry standards.
Collaborate with data scientists and analysts to maximize the usability and accessibility of our data assets.
Maintain comprehensive documentation covering schemas, transformations, and pipeline architecture.
Keep a pulse on emerging trends in cloud tech, analytics, and data engineering to drive continuous improvement.
Requirements
A minimum of 3 years of professional experience in Data Engineering or a similar technical role.
Bachelor’s or Master’s degree in Engineering, Computer Science, Data Science, or a relevant discipline.
Expert-level command of SQL and management systems like PostgreSQL or MySQL.
Hands-on proficiency with pipeline tools such as Luigi, DBT, or Apache Airflow.
Practical experience with heavy-lifting technologies like Hadoop, Spark, or Kafka.
Proven skills with cloud data stacks, specifically Google BigQuery, AWS Redshift, or Azure Data Factory.
Strong programming logic in Java, Scala, or Python for data processing tasks.
Familiarity with data integration frameworks and API utilization.
Understanding of security best practices and compliance frameworks.
Cloud Data Engineer implementing tailored solutions for Volkswagen Group data processing. Building ETL/ELT pipelines while collaborating with technical experts.
Data Engineer designing and optimizing data pipelines using Databricks and Google Cloud Platform. Collaborating with analysts and scientists to deliver high - quality data products.
Data Engineer responsible for building scalable data infrastructure that supports data - driven decisions. Collaborating with team to maintain systems and unlock data value for organizations.
Associate Data Engineer supporting privacy engineering controls and executing privacy impact assessments in a financial services company. Collaborating across business units to ensure alignment with privacy regulations.
Data Engineer at CVS Health optimizing data pipelines and analytical models. Driving data - driven decisions with healthcare data for improved business outcomes.
Senior Data Engineer at CVS Health developing robust data pipelines for healthcare data. Collaborating with teams to provide actionable insights and integrate them with consumer touchpoints.
Senior Data Engineer supporting AI - enabled financial compliance initiative with data pipelines and ingestion processes. Collaborating with diverse teams in a mission - critical regulated environment.
Data Architect leading the definition and construction of cloud data architecture for Kyndryl. Participating in significant technological modernization initiatives, focusing on Google Cloud Platform.
Senior Data Engineer driving data intelligence requirements and scalable data solutions for a global consulting firm. Collaborating across functions to enhance Microsoft architecture and analytics capabilities.
Experienced AI Engineer designing and building production - grade agentic AI systems using generative AI and large language models. Collaborating with data engineers, data scientists in a tech - driven company.