Senior Manager of Site Reliability Engineering overseeing Workday Kubernetes based platform. Leading teams while ensuring high availability and collaborating with federal agencies.
Responsibilities
Manage and lead the teams ensuring the Workday Kubernetes based platform is maintained and healthy
Maintain core platform components for high availability, scalability, and security
Automate infrastructure provisioning and application deployments using tools like Terraform and Argo CD
Provide support and solve platform-related issues collaborating with development teams
Implement and maintain security standard methodologies ensuring compliance
Build and maintain comprehensive documentation for platform components and processes
Actively participate in knowledge sharing within the team
Coach and mentor team members for career growth
Requirements
5+ years of managing and leading site reliability engineering teams
5+ years of hands-on experience working with large scale cloud infrastructure, automation, and DevOps methodologies
Bachelor's degree in a computer related field or equivalent work experience
Proficiency in infrastructure automation tools like Terraform
Experience with building, maintaining, and consuming CI/CD pipelines and tools like Argo CD
Strong analytical and problem-solving skills
Deep understanding of Agile Methodology principles
Strong understanding of Continual Improvement Process principles
Benefits
Workday Bonus Plan eligibility
Role-specific commission/bonus
Annual refresh stock grants
Flexible working hours
Professional development opportunities
Job title
Senior Manager, Site Reliability Engineering, Operations
DevOps Engineer focused on SRE principles and AWS - centric infrastructure for VERBI Software. Managing reliability, scalability, and modernization with hands - on approach and technical mentorship.
Senior DevOps Engineer translating high - level requirements into scalable AWS architectures. Leading delivery of robust solutions and elevating engineering standards across the organization.
DevSecOps Engineer at Onepoint working with tech innovations for clients. Responsible for secure CI/CD pipelines and vulnerability management in collaboration with development teams.
DevSecOps role focused on securing CI/CD pipelines and ensuring best practices in collaboration with developers. Contributing to policies and incident management in a large tech consultancy.
DevSecOps role at Zurich focusing on platform code development and security integration. Collaborating on standards, project pipelines, and cloud services.
Site Reliability Engineer improving operational characteristics of corporate applications with a focus on availability and performance. Collaborating with stakeholders to ensure service level objectives are met.
DevOps Engineer at Capgemini responsible for designing and maintaining CI/CD pipelines within secure environments. Join the team to innovate and optimize technology for sustainability and inclusiveness.
Senior SRE managing Linux environments, Kubernetes, and CI/CD tools at Capgemini. Overseeing day - to - day Linux operations, performance tuning, and automation.