Senior Data Engineer leveraging big data and cloud expertise to build data pipelines at Alight. Ensuring reliability, governance, and operational excellence across data platforms.
Responsibilities
Design, build, and maintain high‑volume ETL/ELT pipelines across Hadoop (HDFS, Hive, Spark, Kafka) and AWS (Glue, EMR, Lambda, Step Functions, Redshift)
Develop distributed data processing solutions using PySpark, Spark SQL , and scalable cloud serverless patterns
Implement reusable data ingestion frameworks for batch (Sqoop, Hive, Spark) and streaming (Kafka, Kinesis)
Optimize data workflows using partitioning, bucketing, compression, file formats (Parquet/ORC)
Understanding hybrid data lake architectures using S3 + HDFS , ensuring governance consistency (Atlas, Ranger, Lake Formation)
Understanding the reporting requirements and perform data profiling and create design for same
Create data flow diagram and do data modelling
Job orchestration using Airflow, Control‑M, Step Functions , or event-driven triggers
Understand auto-scaling, capacity planning, and performance tuning on EMR and Spark clusters
Ensure data is protected and compliant with regulatory standards
Work closely with business stakeholders to enable high‑quality datasets
Provide technical leadership in architecture decisions, code reviews, and best‑practice adoption and provide technical guidance to peers/juniors in team
Improve reliability, scalability, and performance through automation, autoscaling, and capacity planning
Own deployment, incident response, and post-incident reviews for production environments, troubleshooting Spark performance issues, job failures, and cluster bottlenecks
Data Engineer supporting data pipelines for business insights and GenAI applications at Assurity Trusted Solutions. Collaborating with teams on data workflows in a hybrid environment.
Data & AI Engineer at Decathlon Colombia designing and optimizing data ecosystems. Involves building data pipelines and deploying machine learning models.
Senior hands - on engineering lead at Electric Mind focusing on data platforms, AI, and analytics. Collaborating with teams on complex, mission - critical transformations in technology solutions.
AI Data Engineer (Azure + Copilot Studio) developing data pipelines and AI agent solutions. Involves working with Azure Data Factory, Copilot Studio, and data integration in a hybrid role.
Data Engineer developing scalable data architectures supporting analytics and ML at Porto Bank. Ensuring quality and efficient data integration while managing risks and standards.
Senior Data Engineer in Porto Bank developing and managing scalable data architectures and systems. Focused on integrating diverse data solutions while identifying and managing risks.
Engineer of Data designing and architecting data for educational intelligence. Collaborating on data analytics and visualization in business decision - making processes.
Senior Data Engineer building data pipelines and analytics solutions in a healthcare - focused R&D environment. Collaborating with cross - functional teams to support AI and scientific advancement.
Lead Data Engineer building reliable data infrastructure at Eve, a legal technology company. Architecting data systems to enable data - driven decision making for law firms.
Senior Data Engineer designing and delivering scalable data pipelines and high performance analytical solutions on Google Cloud. Enhance data quality via KPIs and migrate data platforms to cloud - native ecosystems.