Senior Site Reliability Engineer – Observability at Dimensional Fund Advisors | Hybrid Hired

About the role

Senior Site Reliability Engineer for observability platforms at Dimensional, ensuring reliability and scaling the infrastructure. Collaborating with teams on operations and engineering projects.

Responsibilities

Serve as a primary escalation point for production support involving the ELK Stack, Grafana, and New Relic
Own platform health, capacity planning, and performance tuning for on-premises observability infrastructure – including Elasticsearch cluster management, index lifecycle policies, and retention strategies
Monitor and maintain SLOs for the observability platforms, ensuring the tools engineers depend on are highly available and performant
Support engineering teams in onboarding to observability platforms – helping teams instrument their applications, build dashboards, and define meaningful alerts
Manage patching, upgrades, and configuration management across the observability stack
Collaborate with security to harden platform configurations and manage software vulnerabilities
Contribute to on-call rotations and maintain runbooks and escalation procedures
Design and build tooling/automation to reduce toil and improve the experience for teams using observability platforms
Lead or contribute to platform modernization initiatives – e.g., improving ingestion pipelines, scaling platform capacity, standardizing Grafana dashboard and alerting patterns, or evaluating new capabilities within the existing stack
Develop and maintain infrastructure-as-code (Terraform, Helm, Ansible, etc.) for platform components
Build and enforce standards around logging metrics and alerting that help engineering teams adopt observability best practices at scale
Participate in design reviews and contribute to the overall platform roadmap

Requirements

Bachelor’s degree in a technical field or equivalent practical experience
5+ years of experience in SRE, DevOps, or platform engineering roles
Deep hands-on experience with the ELK Stack – Elasticsearch cluster operations, Logstash pipeline development, Kibana, and index lifecycle management
Strong experience with Grafana, including data source integrations, dashboard design, and alerting
Solid understanding of observability principles
Experience operating on-premises infrastructure, including capacity planning, server management, and the operational tradeoffs with managed cloud services
Proficiency in Python for automation and tooling; familiarity with shell scripting
Strong Linux systems knowledge and comfort working with configuration management tools (e.g., Ansible, Chef, Puppet, etc.)
Demonstrated ability to drive incidents to resolution and communicate clearly under pressure
A bias toward automation and a low tolerance for repetitive manual work

Benefits

comprehensive benefits
educational initiatives
special celebrations of our history, culture, and growth

Similar roles

Browse all Devops Engineer jobs

5 hours ago

VG

DevOps Engineer, German required – GCP

ventx GmbH

DevOps Engineer at ventx GmbH based in München, operating in a hybrid setup. Responsibilities include CI/CD implementations and cloud infrastructure management.

Hybrid Role

München Germany Devops Engineer

€55,000 - €85,000 per year

5 hours ago

NS

DevOps Engineer – Mid-Senior

Nord Security

DevOps Engineer responsible for infrastructure solutions in mobile data services at Saily. Working with AI and developing CI/CD processes within the company.

Hybrid Role

Vilnius Lithuania Devops Engineer

€4,700 - €6,600 per month

5 hours ago

NS

DevOps Engineer – Mid-Senior

Nord Security

DevOps Engineer responsible for infrastructure implementation at Saily, enhancing mobile data connectivity solutions. Engaging in complex problem - solving within the Infrastructure team in a hybrid environment.

Hybrid Role

Warsaw Poland Devops Engineer

PLN 20,500 - PLN 28,400 per month

6 hours ago

AD

Senior DevOps Engineer

Advansys

Senior DevOps Engineer managing cloud solutions at FORTE CLOUD. Handling deployment, migration, and integration while ensuring high quality and scalability.

Hybrid Role

Nasr City Egypt Devops Engineer

12 hours ago

DA

DevOps Engineer

DATAGROUP

Dev Ops Engineer at DATAGROUP managing applications and cloud technology transformations. Collaborating with clients and teams to enhance IT landscapes and operations.

Onsite Role

Rostock Germany Devops Engineer

21 hours ago

NN

Lead DevOps Architect

NewRich Network

DevOps Engineer helping deploy MVP, CRM, and billing systems for Newrich Network. Focused on infrastructure, automation, and building for scale with potential to go full - time.

Hybrid Role

Toronto Canada Devops Engineer

yesterday

GC

Cloud Operations Engineer, Multi-Cloud Environments

Gramian Consulting

Cloud Operations Engineer supporting and maintaining multi - cloud public infrastructure for enterprise customers. Working in structured ITIL environment and contributing to operational excellence.

Hybrid Role

Saint-Cloud France Devops Engineer

2 days ago

MA

Azure DevOps, Terraform

Minor Hotels Europe and Americas

Cloud Engineer developing Infrastructure - as - Code with Terraform and Azure DevOps. Managing Azure infrastructure and leading incident response within cross - functional teams.

Onsite Role

Navi Mumbai India Devops Engineer

2 days ago

MA

DevOps Engineer

Minor Hotels Europe and Americas

DevOps Engineer building and maintaining authentication platforms in multi - cloud environments. Using technologies like Terraform, Ansible, and Python for automation and optimization.

Onsite Role

Gurgaon India Devops Engineer

2 days ago

SK

DevSecOps Engineer

Skillfield

DevSecOps Engineer at Skillfield working on secure CI/CD pipelines for mobile - first delivery. Collaborating with teams to embed security and automation in the delivery lifecycle.

Hybrid Role

Melbourne Australia Devops Engineer