Apache Spark Specialist responsible for architecting and managing Spark environments on Nebul's AI cloud. Focus on performance, security, and solution innovation within a hybrid work environment.
Responsibilities
Architect, deploy, and operate scalable Apache Spark environments on Nebul’s sovereign AI cloud
Design and optimize Spark workloads for GPU-accelerated and distributed performance
Define and implement best practices for security, monitoring, governance, and data protection
Partner closely with product, engineering, and customer teams to shape our managed Spark offering
Evaluate and integrate complementary technologies (e.g., Delta Lake, Lakehouse components, tooling)
Support early customer pilots and translate feedback into roadmap improvements
Develop automation and CI/CD deployment models to ensure reliability, repeatability, and efficiency
Document architectures, operational procedures, and performance benchmarks
Requirements
4–7 years of experience working with Apache Spark in production environments
Strong deep-dive knowledge of Spark internals: performance tuning, partition strategies, caching, and shuffle management
Hands-on deployment experience in Kubernetes, cloud infrastructure, or on-prem clusters
Solid understanding of distributed data platforms (e.g., Databricks, EMR, Hadoop, Lakehouse architectures)
Strong scripting and automation skills (Python / Scala preferred)
Ability to translate client needs into technical architectures and operational models
Familiarity with cloud-security principles and infrastructure-as-code practices
Valid EU work permit (no sponsorship currently available)
Senior Data Engineer supporting AI - enabled financial compliance initiative with data pipelines and ingestion processes. Collaborating with diverse teams in a mission - critical regulated environment.
Data Architect leading the definition and construction of cloud data architecture for Kyndryl. Participating in significant technological modernization initiatives, focusing on Google Cloud Platform.
Senior Data Engineer driving data intelligence requirements and scalable data solutions for a global consulting firm. Collaborating across functions to enhance Microsoft architecture and analytics capabilities.
Experienced AI Engineer designing and building production - grade agentic AI systems using generative AI and large language models. Collaborating with data engineers, data scientists in a tech - driven company.
Intermediate Data Engineer designing and building data pipelines for travel industry data management. Collaborating across teams to ensure reliable data for analytics and reporting.
Data Engineer managing and organizing datasets for AI models at Walaris, developing AI - driven autonomous systems for defense and security applications.
Data Engineer designing and maintaining data pipelines at Black Semiconductor. Collaborating with process, equipment, and IT teams to support manufacturing analytics and decision - making.
Junior Data Engineer role focusing on Business Intelligence and Big Data at Avanade. Collaborating on data analysis and SQL queries in a supportive learning environment.
GCP Data Engineer designing and developing data processing modules for Ki, an algorithmic insurance carrier. Working closely with multiple teams to optimize data pipelines and reporting.
Data Engineer at Securian Financial optimizing scalable data pipelines for AI and advanced analytics. Collaborating with teams to deliver secure and accessible data solutions.