Senior Site Reliability Engineer – AI/ML Optimized GPU Clusters at The Next Chapter | Hybrid Hired

About the role

Senior Site Reliability Engineer at a company operating one of the largest GPU infrastructures. Responsible for ensuring service fault-tolerance and using cloud technology for infrastructure solutions.

Responsibilities

Ensure fault-tolerance, scale, and uninterrupted operations for the service
Use cutting-edge cloud technology to solve a variety of infrastructure problems
Implement and improve CI/CD processes

Requirements

Solid experience with programming languages (like Go, Python, or C++)
Experience in environments with a multitude of GPUs distributed over multiple nodes
Good understanding of classic algorithms and data structures
Commercial experience with, and deep understanding of, Unix/Linux systems and network technology
Solid experience with CI/CD and IaC
Experience with containerization and configuration management (Ansible, Salt, Terraform, Docker, Kubernetes, Helm)

Benefits

Competitive salary and comprehensive benefits package
Opportunities for professional growth
Flexible working arrangements
Dynamic and collaborative work environment

Similar roles

Browse all Devops Engineer jobs

1 hour ago

AL

Junior DevOps Engineer

Alea

Junior DevOps Engineer at ALEA maintaining infrastructure and managing CI/CD pipelines for optimal performance and security in a hybrid setup.

Hybrid Role

Barcelona Spain Devops Engineer

3 hours ago

CS

DevOps Engineer – Tech 4

Castalia Systems

DevOps Engineer at Castalia Systems automating and optimizing toolchain and CI/CD pipelines. Designing Azure infrastructure and ensuring collaboration between development and operations teams.

Onsite Role

Dayton United States Devops Engineer

3 hours ago

CS

DevOps Engineer, Tech 3

Castalia Systems

DevOps Engineer optimizing cloud infrastructure at Castalia Systems. Design, deploy, and manage Azure environments for federal government projects.

Onsite Role

Dayton United States Devops Engineer

4 hours ago

HT

Senior Engineer, DevOps

Hex Trust

Senior DevOps Engineer managing Kubernetes and AI - driven workflows at Hex Trust. Supporting blockchain infrastructure while implementing best DevOps practices.

Hybrid Role

Ho Chi Minh City Vietnam Devops Engineer

4 hours ago

RE

Cloud Release Engineer I

RELX

Cloud Release Engineer I at RELX managing deployment pipelines for Lexis applications and ensuring efficient delivery processes.

Onsite Role

Cebu Philippines Devops Engineer

4 hours ago

LE

Lead DevSecOps Developer

Leidos

Lead DevSecOps Software Developer at Leidos enhancing automation for air traffic operations. Collaborating on safety - critical systems within a hybrid work environment.

Hybrid Role

Gaithersburg United States Devops Engineer

$87,100 - $157,450 per year

5 hours ago

EE

Platform DevOps Engineer

EEOC

DevSecOps Engineer overcoming client challenges using the latest tools at Booz Allen Hamilton. Collaborating on clean code and infrastructure enhancements to build user - oriented solutions.

Hybrid Role

Colorado Springs United States Devops Engineer

$61,900 - $141,000 per year

11 hours ago

FO

Site Reliability Engineer II

Forcepoint

Site Reliability Engineer improving reliability and availability of Forcepoint products through automation and operational efficiency. Engaging in incident response and collaborating with development teams.

Onsite Role

Bengaluru India Devops Engineer

11 hours ago

GE

Senior DevOps Engineer – Internal Tooling, APIs

Genesys

DevOps Engineer responsible for internal tooling and API development to enhance deployment and operational efficiency at Genesys Cloud. Build automation to improve service health and scalability.

Hybrid Role

Toronto Canada Devops Engineer

CA$115,300 - CA$149,200 per year

13 hours ago

DL

SRE, Technical Referent

dLocal

Site Reliability Engineer focused on designing and maintaining observability solutions for fintech company. Collaborating across teams and automating infrastructure for global payment processing.

Hybrid Role

Sao Paulo Brazil Devops Engineer