Site Reliability Engineer at Trainline | Hybrid Hired

About the role

Site Reliability Engineer contributing to platform reliability at Trainline, Europe's leading rail ticketing platform. Collaborating with product engineering to ensure operational readiness and incident response.

Responsibilities

Developing an understanding of system architecture, dependencies, and failure modes across the Trainline platform
Participating in production incident response, supporting investigation, mitigation, communication, and coordinated service restoration
Contributing to post-incident reviews and follow-up actions to improve reliability, scalability, and resilience
Taking part in the SRE on-call rotation
Designing, building, and maintaining observability using metrics, logs, events, and traces to support effective detection and diagnosis
Improving monitoring and alerting by aligning signals to business and customer impact, reducing noise and improving mean time to detection (MTTD)
Ensuring relevant operational data is surfaced quickly and clearly during live incidents
Making informed tooling and technology choices using SRE principles, balancing team and business needs
Supporting AWS-hosted infrastructure and shared platform services using infrastructure-as-code and CI/CD tooling
Collaborating with product engineering teams to ensure services are operationally ready and deployed safely
Advising on reliability and resilience practices
Writing and maintaining reliable, well-structured code and scripts to support reliability and observability goals
Prioritising work effectively and collaborating using agile processes to deliver against team and business goals

Requirements

Experience of SRE concepts such as SLI, SLO and error budgets.
Hands-on experience with observability tooling such as New Relic, Elastic (ELK Stack), Influx, Grafana or similar
Experience working with cloud providers (preferably AWS).
Experience troubleshooting Linux operating systems.
Experience of scripting in at least one language (preferably Python)
Understanding of load balancing and reverse proxy concepts, upstream config concepts, upstream health checks, worker & data flow concepts.
Application architecture concepts (threading, queuing, readiness checks, health checks, circuit breakers, timeouts, exponential backoff, throttling).
Experience building, maintaining and evolving time series data, retention, cardinality, deviation, moving averages and other functions.
Experience with build, deployment & configuration management tooling such as GitHub Actions and Terraform.

Benefits

private healthcare & dental insurance
generous work from abroad policy
2-for-1 share purchase plans
EV Scheme to reduce carbon emissions
extra festive time off
excellent family-friendly benefits
clear career paths
transparent pay bands
personal learning budgets
regular learning days

Similar roles

Browse all Devops Engineer jobs

4 minutes ago

LE

Data Transport Infrastructure DevOps Engineer

Leidos

Data Transport Infrastructure DevOps Engineer at Leidos modernizing global - scale multi - cloud environments for USAF missions. Involves developing cloud - native solutions and ensuring security best practices.

Hybrid Role

United States Devops Engineer

$87,100 - $157,450 per year

2 hours ago

AG

DevOps Engineer – mwd

Allguth GmbH

DevOps Engineer responsible for building and optimizing AWS - based infrastructure and backend systems at Allguth GmbH. Part of a team focused on innovative mobility solutions in Munich region.

Hybrid Role

Gräfelfing Germany Devops Engineer

7 hours ago

A[

Senior DevOps Engineer

Alexander Thamm [at]

(Senior) DevOps Engineer specializing in ML solutions implementation and management in Germany. Focused on CI/CD pipelines, automation, and cloud services.

Hybrid Role

Berlin Germany Devops Engineer

8 hours ago

PG

DevSecOps Specialist

Periferia IT Group

Specialist DevSecOps joining Periferia IT Group, a leader in digital transformation. Work in a dynamic environment with continuous learning and professional development opportunities.

Hybrid Role

Bogotá Colombia Devops Engineer

9 hours ago

ZI

Senior Platform Engineer, DevOps, Terraform, Ansible

Zinkworks

Join Zinkworks as a Senior Platform Engineer designing scalable IaC - driven cloud platforms for a large - scale enterprise contact centre. Focused on automation, reliability, and platform ownership in a hybrid work environment.

Hybrid Role

Athlone Ireland Devops Engineer

9 hours ago

SE

Asset Reliability Engineer – Predictive Maintenance

Sensorfact

Asset Reliability Engineer providing maintenance advice and service innovations. Join Sensorfact, the leading smart monitoring platform, to modernize the industrial sector.

Hybrid Role

Utrecht Netherlands Devops Engineer

10 hours ago

AS

Cloud Operations Engineer

Avalon Healthcare Solutions

Cloud Operations Engineer responsible for securing AWS infrastructure at Avalon Healthcare Solutions. Collaborating on SRE best practices and ensuring system reliability and performance.

Hybrid Role

Tampa United States Devops Engineer

10 hours ago

FC

Design Release Engineer – Seat Complete

Ford Motor Company

Design Release Engineer designing, developing, and releasing seat systems for Ford vehicles. Ensuring engineering deliverables meet quality, cost, and timing targets while collaborating with cross - functional teams.

Hybrid Role

Dearborn United States Devops Engineer

$63,480 - $121,440 per year

11 hours ago

SS

DevOps II

Safe Software

DevOps Engineer responsible for maintaining FME infrastructure and development pipelines at Safe Software. Collaborate in an agile team focused on constant improvement and automation.

Hybrid Role

British Columbia Canada Devops Engineer

CA$81,900 - CA$91,200 per year

14 hours ago

GE

Lead Site Reliability Engineer

GetGround

Lead Site Reliability Engineer responsible for GCP cloud infrastructure and SRE practices. Join a fintech platform making real estate investment accessible globally.

Hybrid Role

London United Kingdom Devops Engineer