Apache Spark Specialist responsible for architecting and managing Spark environments on Nebul's AI cloud. Focus on performance, security, and solution innovation within a hybrid work environment.
Responsibilities
Architect, deploy, and operate scalable Apache Spark environments on Nebul’s sovereign AI cloud
Design and optimize Spark workloads for GPU-accelerated and distributed performance
Define and implement best practices for security, monitoring, governance, and data protection
Partner closely with product, engineering, and customer teams to shape our managed Spark offering
Evaluate and integrate complementary technologies (e.g., Delta Lake, Lakehouse components, tooling)
Support early customer pilots and translate feedback into roadmap improvements
Develop automation and CI/CD deployment models to ensure reliability, repeatability, and efficiency
Document architectures, operational procedures, and performance benchmarks
Requirements
4–7 years of experience working with Apache Spark in production environments
Strong deep-dive knowledge of Spark internals: performance tuning, partition strategies, caching, and shuffle management
Hands-on deployment experience in Kubernetes, cloud infrastructure, or on-prem clusters
Solid understanding of distributed data platforms (e.g., Databricks, EMR, Hadoop, Lakehouse architectures)
Strong scripting and automation skills (Python / Scala preferred)
Ability to translate client needs into technical architectures and operational models
Familiarity with cloud-security principles and infrastructure-as-code practices
Valid EU work permit (no sponsorship currently available)
As a Principal Data Architect at Solstice, lead the design and implementation of data architecture solutions. Ensure data integrity, security, and accessibility to meet strategic organizational goals.
Data Platform Specialist overseeing data workflows and enhancing data quality for Stackgini's AI - driven IT solutions. Collaborating with teams to drive improvements and stakeholder support.
Data Engineer designing data pipelines in Python for a major railway industry client. Collaborate with Data Scientists and ensure code quality with agile methodologies.
Senior Data Engineer responsible for building and optimizing data pipelines for banking analytics initiatives. Collaborating with data teams to ensure data quality and readiness for enterprise use.
Senior Data Engineer developing scalable data solutions on Databricks for analytics and operational workloads. Collaborating with cross - functional teams to modernize the data ecosystem.
Data Engineer focused on analytics and data pipeline development for network optimisation. Collaborating with teams to deliver high - quality data solutions with Python and SQL.
Senior Product Manager defining platform capabilities for Data Cloud in Salesforce. Collaborating with R&D teams while shaping product strategy for Data 360 integration.
Senior Data Engineer at Goodwin enhancing data platforms and fostering data - driven culture across teams. Collaborating with IT and Finance on technology solutions and data governance practices.
Director, Data Platform Design and Strategy at MedImpact leading data platform and AI innovations to enhance healthcare services. Overseeing enterprise projects and managing teams to meet strategic goals.
Data Engineer delivering AI - and data - driven solutions for Honeywell’s industrial customers. Architecting and implementing scalable data pipelines and platforms focused on IoT and real - time data processing.