Head of CloudOps SRE leading operational excellence and reliability at S&P Global’s multi-cloud infrastructure. Directing teams to enhance security, performance, and cost efficiencies.
Responsibilities
Lead and Inspire: Direct and empower a team of operational engineers, fostering a high-performance culture aligned with values like inclusion, agility, and continuous learning.
Ensure Operational Excellence: Own the reliability, availability, and performance of the multi-cloud (AWS, Azure, GCP) and on-premise compute and storage estate, targeting high availability and ensuring minimal downtime.
Drive Security and Compliance: Oversee critical operational tasks, including efficient patching, rapid remediation of all critical vulnerabilities, and proactive security enhancements to maintain a hardened and compliant infrastructure.
Strategize FinOps: Identify and support FinOps strategies (cost optimization, resource rightsizing, intelligent tiering) to achieve significant and sustained operational cost savings.
Influence Strategy and Governance: Chair key leadership meetings and workshops to align divisional leaders on cloud strategy, governance, and DevOps alignment.
Champion Transformation: Drive major infrastructure programs, including cloud adoption and the integration of GenAI technologies (Azure OpenAI, AWS Bedrock) to enhance enterprise product scalability and capabilities.
Measure and Report: Design and leverage detailed resource dashboards and metrics to enable data-driven operational decisions, and provide actionable insights.
Requirements
Extensive experience (15+ years) in Cloud Operations, SRE, or Reliability Engineering in an enterprise-scale, multi-cloud environment (AWS, Azure, GCP).
Proven success in leading and mentoring large operational or SRE teams, demonstrating a focus on organizational upskilling and cultural transformation.
Expertise in implementing and scaling Cloud FinOps principles and tooling to deliver measurable cost savings and resource efficiency.
Strong technical background in managing large-scale compute, storage, and backup estates, with a focus on security hardening, patching, and vulnerability management.
Demonstrated ability to drive large-scale migration and remediation programs and operational efficiency initiatives.
Experience integrating or adopting GenAI/LLM technologies into operational or platform capabilities is highly desirable.
Exceptional executive communication, influencing, and stakeholder management skills.
Benefits
Health & Wellness: Health care coverage designed for the mind and body.
Flexible Downtime: Generous time off helps keep you energized for your time on.
Continuous Learning: Access a wealth of resources to grow your career and learn valuable new skills.
Invest in Your Future: Secure your financial future through competitive pay, retirement planning, a continuing education program with a company-matched student loan contribution, and financial wellness programs.
Family Friendly Perks: It’s not just about you. S&P Global has perks for your partners and little ones, too, with some best-in class benefits for families.
Beyond the Basics: From retail discounts to referral incentive awards—small perks can make a big difference.
Entry - level DevOps Engineer at Nokia focusing on building and maintaining CI environment for LTE and 5G solutions. Engage with high - end telecommunication technologies and support development workflows.
AI Security Control Developer/Site Reliability Engineer for RBC's enterprise AI ecosystem. Design, implement, and validate security controls to protect AI systems with 24/7 reliability.
Senior Site Reliability Engineer ensuring scalability and reliability for NGINX systems and SaaS platforms. Collaborating across teams to drive automation and system performance.
Site Reliability Engineer ensuring reliability and performance of data platform services for Veepee. Collaborating on cloud migration, Kubernetes operations, and observability best practices.
Senior Lead Site Reliability Engineer overseeing critical systems stability and incident management. Leading Java applications reliability and supporting a dynamic technology environment.
Infrastructure Architect connecting clients and Kyndryl. Leading projects from start to finish, ensuring technical solutions meet client needs at Kyndryl.
DevOps Engineer automating and configuring network monitoring and automation solutions for Telia’s telecom operations in Finland. Ensuring performance, resilience, and high observability of critical platforms.
Client Services Consultant specializing in DevOps Mainframe Operations with experience in automation best practices. Analyzing Life Cycle Management data needs and evaluating solutions for Endevor - related operations.
Senior AWS DevOps Engineer at LexisNexis shaping global CI/CD platform. Collaborating with teams to deliver secure, reliable, and scalable delivery pipelines.
Cloud Engineer at MetroStar focusing on building and securing cloud - native systems. Managing Kubernetes workloads and CI/CD pipelines in Agile teams with an emphasis on security.