DevOps Engineer for Flutter's data streaming platforms at Betfair Romania Development. Collaborating to enhance data streaming technologies and improve developer experience in a global team.
Responsibilities
Design, build, and operate core components of our secure, performant, and cost-effective data steaming platforms.
Develop and maintain platform capabilities and interfaces to enhance developer experience and self-service.
Drive automation across the platform using Infrastructure as Code (IaC – e.g., Helm, Crossplane, Terraform) and configuration management tooling.
Implement and manage robust CI/CD pipelines (e.g., ArgoCD, GitHub Actions) promoting GitOps practices for managing the platform.
Manage and enhance our data streaming platforms
Maintain source code and artifact repositories.
Write and review platform code, offering constructive feedback to ensure code quality.
Enable and support engineering teams with the adoption of data streaming services.
Write and maintain appropriate technical documentation.
Define, document, support, and approve Low-Level-Designs (LLDs) for Flutter data-streaming services.
Support architecture and function-leads with an evolving technology roadmap and strategy.
Act as a trusted SME offering advice and knowledge sharing to the broader technology team and internal customers.
Create and drive adoption of standard operating procedures, policies, and runbooks.
Liaise with other group functions, building and maintaining relationships, and recognising opportunities for strategic collaboration.
Liaise with Third Party vendors and partners.
Practice sustainable incident response and blameless postmortems.
On-call is likely to be introduced in the future.
Requirements
Proven experience building and operating data streaming platforms, with focus on our core technologies: Apache Kafka, Cassandra & Apache Pulsar.
Proven experience building and operating data streaming platforms on AWS, with focus on Kafka, Keyspaces, EMR, Athena, Glue, Quicksight, S3
An understanding of and hands-on experience with Kubernetes (concepts, architecture, operations).
Proficiency with Infrastructure as Code (IaC) tools (e.g., Helm, Crossplane, Terraform).
Experience implementing and managing GitOps workflows and tooling (e.g., ArgoCD).
Solid understanding and practical experience with CI/CD principles and tooling (e.g., Argo Workflows, GitHub Actions, Jenkins).
DevOps Engineer in the Security Data and AI Lab at Lloyds Banking Group driving data and cloud infrastructure's influence on product operations and customer service improvements.
Senior Platform DevOps Engineer at Code Metal designing and implementing cloud and hybrid infrastructure to support customer deployments and internal platforms. Collaborating with software and security teams for reliable delivery.
DevOps Platform Intern managing cloud infrastructure and deployment pipelines for AI - native software delivery. Partnering with a Product Development Intern, set up and manage containerized applications on Azure Kubernetes Service.
UNIX DevOps Engineer managing AIX and Solaris server operations for a Swiss telecom company. Focusing on automation, optimization and 7x24h monitoring responsibilities across multiple locations.
Staff Site Reliability Engineer designing and building backend services for NordVPN. High - ownership role focusing on system architecture and operational excellence.
Senior Site Reliability Engineer managing VPN and DNS services to ensure performance and reliability. Collaborating with application teams to maintain security and quality across global infrastructure operations.
Senior Site Reliability Engineer managing globally distributed VPN and DNS services. Optimizing service performance and handling security posture in a hybrid work environment.
Senior Site Reliability Engineer focused on observability for NordVPN. Designing monitoring systems and collaborating with data teams on anomaly detection.
Senior Site Reliability Engineer ensuring content accessibility across global edge infrastructure for NordVPN. Designing and troubleshooting systems critical to internet traffic management.
Staff Site Reliability Engineer designing tools for Threat Protection Pro and NordLynx protocol. Working on globally distributed backend services for NordVPN with a focus on security and privacy.