Senior Site Reliability Engineer at PROS | Hybrid Hired

About the role

Senior Site Reliability Engineer managing cloud infrastructure for SaaS solutions at PROS Holdings. Focusing on reliability, automation, and team collaboration in a hybrid work environment.

Responsibilities

Design, implement, and maintain secure, scalable infrastructure across cloud environments
Analyze cloud environment requirements from various sources, document system designs, and implement necessary modifications
Automate repetitive system tasks and manage system-related activities for internal and external clients, including Professional Services support
Ensure system reliability through robust failover mechanisms, disaster recovery processes, and 24/7 support strategies
Design, implement, and improve monitoring tools to meet SLOs, ensuring a “Monitor by Design” approach is adopted across product teams
Continuously drive reliability improvements through proactive initiatives, data-driven SLO adjustments, and advanced monitoring/alerting solutions
Lead and coordinate disaster recovery testing exercises and capacity planning to enhance system reliability
Identify and reduce operational toil through automation and tool development
Apply and enforce security best practices across cloud environments, while mentoring team members on SLO achievement
Facilitate cross-team communication, provide training, and maintain clear documentation (e.g., runbooks and procedures)
Support cloud environment management and propose technology changes to improve performance and reliability.

Requirements

7+ years of experience as a System Administrator, DevOps Engineer, SRE, or similar role
Deep knowledge of Linux administration, including performance monitoring, tuning and troubleshooting
Experience with cloud network design (Azure preferred, AWS or GCP also considered)
Proficiency in scripting (e.g., Bash, Python) for automation
Experience with version control software (preferably Git)
Experience with configuration management tools (e.g., Puppet, Foreman, Ansible, or similar)
Knowledge of container orchestration tools (e.g., Kubernetes, Docker Swarm, etc.)
In-depth knowledge of monitoring and logging solutions for cloud infrastructure (e.g., Prometheus, Grafana, etc.)
Bachelor’s degree in Computer Science or a related field
Excellent time management, organizational, crisis management, and problem-solving skills
Self-starter, able to work independently without direct supervision
Willingness to innovate, learn, and share knowledge
Excellent verbal and written communication skills
Experience developing and implementing IT security best practices and procedures
Willingness to participate in on-call rotations and respond to incidents in a timely and effective manner
Excellent command of the English language.

Benefits

Health insurance
Flexible work arrangements
Professional development opportunities

Similar roles

Browse all Devops Engineer jobs

58 minutes ago

BR

DevOps Engineer

Brillio

Senior Engineer Cloud Engineering role focused on AWS migration and automation. Collaborating with teams to innovate cloud patterns and infrastructure best practices.

Hybrid Role

Guadalajara Mexico Devops Engineer

1 hour ago

NV

Senior Cloud Operations Engineer

NVIDIA

Senior Operations Engineer driving efficiency and reliability in NVIDIA's global business operations. Collaborating with IT subsystems and automating operational workflows for organizational impact.

Onsite Role

Santa Clara United States Devops Engineer

$184,000 - $287,500 per year

1 hour ago

BO

Lead DevOps Developer

Boeing

Lead or Senior DevOps Developer joining Boeing Defense, Space and Security for advanced technology missions. Involves CI/CD, cloud systems design, and collaboration with government customers.

Onsite Role

Seal Beach United States Devops Engineer

$146,200 - $239,200 per year

3 hours ago

FT

Senior Site Reliability Engineer (SRE)

FCamara Consulting & Training

Site Reliability Engineer ensuring high availability and performance for digital platforms in retail. Collaborating with engineering teams for automation and observability practices.

Hybrid Role

Santo André Brazil Devops Engineer

3 hours ago

EX

Associate Site Reliability Engineer, SRE

Exegy

Associate Site Reliability Engineer supporting the reliability and performance of global IT infrastructure at Exegy. Engage with senior engineers and learn foundational systems engineering skills.

Hybrid Role

St. Louis United States Devops Engineer

7 hours ago

FI

Principal Site Reliability Engineer – Software Engineering

FIS

Site Reliability Engineer driving innovation and growth for Banking Solutions, Payments, and Capital Markets business. Responsible for application reliability and incident response in a hybrid work environment.

Hybrid Role

Atlanta United States Devops Engineer

7 hours ago

TI

DevSecOps – M/F

Tiime

DevSecOps role at Tiime ensuring implementation of security practices in products. Collaborate with teams for cloud security and incident management in a hybrid workspace.

Hybrid Role

Paris France Devops Engineer

12 hours ago

FI

Senior Site Reliability Engineer

Fixify

Senior Site Reliability Engineer responsible for designing reliable infrastructure supporting Fixify's SaaS platform. Collaborating with product engineering teams and maintaining operational standards for infrastructure performance.

Hybrid Role

Ireland Devops Engineer

13 hours ago

IN

DevOps Engineer

Internetstiftelsen

DevOps Engineer working with critical infrastructure systems for Swedish internet services. Focused on building and managing robust systems and contributing to automation and operational improvements.

Hybrid Role

Stockholm Sweden Devops Engineer

14 hours ago

OC

DevSecOps Consultant

Orange Cyberdefense

DevSecOps Consultant integrating security into IT development and operational processes. Advising clients on seamless integration of security requirements into DevOps workflows.

Hybrid Role

Germany Devops Engineer