Site Reliability Engineering Senior Manager leading multiple SRE teams at Netwealth. Shaping strategy and operational practices in a collaborative environment.
Responsibilities
Embed SRE principles and shared reliability ownership across engineering tribes
Define and lead SRE strategy, standards, and operating models
Scale incident management, on-call, and support practices as platforms grow
Drive reliability decisions using SLIs, SLOs, SLAs, error budgets, and observability data
Balance reliability improvements with delivery velocity and commercial outcomes
Influence senior leaders through evidence-based conversations
Lead managers and senior specialists through organisational and cultural change
Requirements
Extensive experience leading SRE, reliability engineering, or large-scale production operations
Proven people leadership, including managing managers and senior specialists
Strong knowledge of SRE fundamentals and data-driven optimisation
Expertise in scaling incident management and operational excellence
Clear stakeholder engagement and concise communication skills
Financial services or regulated environment experience (desirable)
Benefits
Family-friendly support: Paid parental leave and a fully funded school holiday program
Wellness perks: CU Health (virtual healthcare), income protection, flu shots, wellness weeks, retail discounts and financial wellbeing services
A vibrant culture: social events, trivia nights, and corporate sports
Employee Resource Groups: LGBTQIA+, DAWN (Development and Accelerating Women at Netwealth), Culture Group and Carers Group
Community Impact: Paid volunteering and our Netwealth Impact Group
Senior DevOps Engineer I managing automation tooling and multi - cloud infrastructure at Spring Health. Collaborating with AI and Infrastructure teams in a hybrid Seattle office.
Site Reliability Engineer for cloudified backup platform using Commvault technology at Expleo. Joining a dynamic team to ensure backup infrastructure scalability and reliability.
Site Reliability Engineer responsible for designing and maintaining scalable services with high availability. Collaborating with development teams to enhance reliability and operational excellence.
Technical Staff leading the architecture, reliability, and modernization of enterprise ALM and DevOps tools. Driving strategy and influencing product development in collaboration with various teams.
Site Reliability Engineer responsible for reliability and availability, collaborating with development teams on scalable systems. Applying software engineering practices to improve production operations.
DevOps Engineer in the Security Data and AI Lab at Lloyds Banking Group driving data and cloud infrastructure's influence on product operations and customer service improvements.
Senior Platform DevOps Engineer at Code Metal designing and implementing cloud and hybrid infrastructure to support customer deployments and internal platforms. Collaborating with software and security teams for reliable delivery.
DevOps Platform Intern managing cloud infrastructure and deployment pipelines for AI - native software delivery. Partnering with a Product Development Intern, set up and manage containerized applications on Azure Kubernetes Service.
UNIX DevOps Engineer managing AIX and Solaris server operations for a Swiss telecom company. Focusing on automation, optimization and 7x24h monitoring responsibilities across multiple locations.