Senior Site Reliability Engineer developing and operating Azure Red Hat OpenShift managed cloud services. Collaborating with a global team to solve complex challenges in a blameless environment.
Responsibilities
Develop, scale, and operate Azure Red Hat OpenShift managed cloud services.
Contribute code to increase the scalability and reliability of the service.
Contribute software tests and participate in peer review to increase the quality of our codebase.
Help and develop peers’ capabilities through knowledge sharing, mentoring, and collaboration.
Participate in a regular on-call schedule, including occasional paid weekends and holidays.
Practice sustainable incident response and blameless postmortems.
Resolve customer issues escalated from the Red Hat Global Support team.
Work within a small agile team to develop and improve SRE software, support peers, plan and self-improve.
Requirements
Bachelor’s degree in Computer Science, Engineering, or related field; equivalent practical experience will also be considered.
Strong experience (5+ years) in at least one programming language (Golang, C, C++, Python, Java) and software life cycles
Hands-on experience with public cloud platforms (AWS, GCP, Azure). Preferably Azure
Direct experience with Kubernetes or OpenShift is a major plus.
4+ years desired debugging, optimizing code and automating routine tasks.
Experience with Docker based containers
Strong collaboration and problem-solving skills in distributed, team-based environments.
Experience troubleshooting as-a-service offerings (SaaS/PaaS) and working with complex distributed systems.
Working knowledge of Linux/Unix operating systems.
Proven ability to automate repetitive tasks and debug performance issues.
Ability to collaboratively troubleshoot and solve problems in a remote and distributed team setting.
Benefits
Red Hat relies on teamwork and openness for its success.
We learn from our failures in a blameless environment to support the continuous improvement of the team.
Professional development opportunities
Flexible work arrangements
Health insurance
Paid time off
Job title
Senior Site Reliability Engineer – OpenShift, Kubernetes, Azure, Golang, Linux
DevOps and Build Engineer for NVIDIA developing and maintaining CI/CD pipelines. Collaborating with teams to enhance compiler technologies and optimize build performance in a diverse environment.
Senior AWS DevOps Developer responsible for managing AWS infrastructure for enterprise public budgeting software at Euna Solutions. Collaborating on cloud projects and enhancing system reliability and performance.
Principal AI Site Reliability Engineer driving operational excellence for critical contact center applications at Fidelity. Leading automation and observability initiatives to improve reliability and efficiency.
Data Transport Infrastructure DevOps Engineer at Leidos modernizing global - scale multi - cloud environments for USAF missions. Involves developing cloud - native solutions and ensuring security best practices.
DevOps Engineer responsible for building and optimizing AWS - based infrastructure and backend systems at Allguth GmbH. Part of a team focused on innovative mobility solutions in Munich region.
(Senior) DevOps Engineer specializing in ML solutions implementation and management in Germany. Focused on CI/CD pipelines, automation, and cloud services.
Specialist DevSecOps joining Periferia IT Group, a leader in digital transformation. Work in a dynamic environment with continuous learning and professional development opportunities.
Join Zinkworks as a Senior Platform Engineer designing scalable IaC - driven cloud platforms for a large - scale enterprise contact centre. Focused on automation, reliability, and platform ownership in a hybrid work environment.
Asset Reliability Engineer providing maintenance advice and service innovations. Join Sensorfact, the leading smart monitoring platform, to modernize the industrial sector.