Senior Observability Engineer managing observability and performance ecosystem for Trading 212. Involved in automation, monitoring, and optimizing large-scale distributed systems.
Responsibilities
Own and evolve Trading 212’s observability and performance ecosystem across cloud and on-prem Kubernetes environments.
Design, automate, and optimize observability infrastructure (Prometheus, CloudWatch, Elasticsearch, Kafka, etc.) using IaC and GitOps.
Build Grafana dashboards and implement a smart alerting strategy to surface actionable insights.
Monitor and analyze system performance, identify bottlenecks, and drive improvements in reliability and cost-efficiency.
Collaborate with product, QA, and engineering teams to embed observability best practices.
Maintain clear documentation and mentor engineers, fostering a culture of data-driven performance.
Plan and test Multi-AZ/Region DR and resilience scenarios.
Requirements
5+ years of experience in DevOps, SRE, or Systems Engineering, focusing on observability for large-scale distributed systems.
Proven experience deploying and maintaining observability tools.
Metrics & Monitoring: Strong proficiency with Prometheus and Grafana; experience with AWS CloudWatch.
Log Management: Deep knowledge of the ELK stack (Elasticsearch, Logstash, Kibana, Fluentbit).
Cloud & Containers: Hands-on experience with AWS, Docker, and Kubernetes.
Automation & IaC: Skilled in Python, Go, or Bash for scripting, and proficient with Terraform (Ansible/Puppet a plus).
Systems Knowledge: Strong grasp of distributed systems, networking, and Linux/Unix internals.
Problem-Solving: Analytical, detail-oriented, and methodical in root cause analysis and troubleshooting.
Benefits
Challenges that will help you grow and realize your potential really fast
Opportunity to make a big Impact - you will build innovative services used by millions of investors to build wealth
Work with smart, spirited, helpful, high-performing colleagues with a common goal
An environment where nothing is set in stone
Appreciation for your talent and ideas
Generous remuneration package including annual bonuses
Excellent social benefits package, including private health insurance and sports card
Aerospace Engineer employing expertise to manage ICBM programs at Booz Allen. Ensuring the safety of next - generation nuclear weapon systems through risk assessments and technology evaluation.
Principal Electrical Controls Engineer leading control system development for data centers. Focus on automation, reliability, and integration of critical infrastructure systems.
Highways Engineer role focused on highway and drainage project delivery for Mott MacDonald. Collaborating with cross - functional teams to develop compliance - driven design solutions.
HyCO Plant Process Engineer providing process design and technical support to Southeast Asia plants. Maintaining efficiency and reliability while driving productivity and safety initiatives.
Services Engineer providing technical support and ensuring customer satisfaction in NYC metro area. Solving technical issues and managing service orders for Technogym products.
Services Engineer responsible for customer satisfaction and equipment care at Technogym. Executing service operations and maintaining technical standards in San Francisco area.
Services Engineer in the Austin metro area for Technogym, focusing on customer satisfaction and equipment service duties. Resolve customer requests and manage service orders effectively.
Senior - level Manufacturing Engineer at Arjo developing processes for reprocessing medical devices. Implementing improvements and managing technology projects in a state - of - the - art facility in Everett, WA.
Process Engineer developing manufacturing processes for energetic products at Northrop Grumman. Involved in troubleshooting and process improvement for energetic production.