Senior Site Reliability Engineer at Broadridge managing infrastructure design and operational support. Collaborating with teams to improve automation, performance, and reliability of services in a hybrid environment.
Responsibilities
Work within and across teams to design, develop, test, implement, and support technical solutions across a full-stack of development tools and technologies.
Translate business requirements into technical designs, considering automation, availability, performance, scale and cost.
Ensure technical & security best practices along with Broadridge standards are adhered to in the design of technical infrastructure
Participate in technical design sessions and works closely with multiple teams, including application development teams, infrastructure teams, vendors, and clients, if needed, to review the infrastructure designs for new projects.
Deliver high quality technical infrastructure, on-time, following Broadridge processes.
Automate the implementation and operational support of the infrastructure.
Provide estimates of all priority and non-priority projects along with recommended scope or schedule changes based on capacity and unforeseen challenges.
Participate in technical implementation to ensure the quality of the infrastructure, automation and the overall productivity of the SRE (Site Reliability Engineering) team.
Track Service Level Indicators (SLI) to ensure the health of technical infrastructure and Broadridge services.
Troubleshoot production issues affecting Broadridge services as needed, taking appropriate corrective actions.
Conduct preventative maintenance to ensure capacity, scaling, security and availability of Broadridge services.
Understand dependencies between infrastructure components, vendor software, custom software and other parts of the processing stack that support Broadridge Services
Collaborate with peers and other technical teams, such as development teams, architecture, database teams, storage teams, server teams, security teams to prevent and shorten production incidents.
Define Service Level Objectives (SLOs) for Broadridge Services
Implement additional operational improvements for automation, monitoring and incident management to increase the reliability of Broadridge services.
Guide more junior associates through established processes.
Requirements
Bachelor’s degree in computer science, Computer Engineering, or in a related field.
8+ years of experience with commercial service infrastructure at both a software and infrastructure level
Experience in managing datacenter hosted and AWS hosted application.
8+ years of experience within a programming and application system environment, with solid experience and a working knowledge in the following technologies: OS: Linux, Windows
Skills: Functional skills – System Design and Architecture, DevOps / Deployment automation, Troubleshooting, Service Monitoring.
Passionate teammate who understands and respects personal & cultural differences
Ability to work under pressure and be highly adaptable
Strong written and communications skills for collaboration with various teams and upper management
Solid analytical skills, especially in area of translating business requirements into technical design – with a continuous focus on aligning technical roadmap with the immediate and long-term Business strategy
Ability to adapt and embrace change and support business strategy and vision.
Knowledge of next-generation design patterns/architecture like micro-services, layered pattern, cloud.
Strong aptitude for learning new skills and new technologies.
Benefits
Please visit www.broadridgebenefits.com for more information on our comprehensive benefit offerings.
Job title
Senior Site Reliability Engineer – Hybrid, Flexible Options
DevOps Engineer managing Kubernetes deployments for health tech company. Collaborating with engineering teams to enhance healthcare services using advanced technologies.
DevOps Engineer at PointClickCare, empowering innovative healthcare with Kubernetes and automation expertise. Work remotely while supporting crucial healthcare technology solutions.
Entry Level DevOps Engineer at Podimo, building scalable cloud infrastructure for a podcast platform. Collaborate with development teams and leverage AI tools to enhance the platform.
DevOps Engineer managing AWS infrastructure while contributing to backend code in Node.js and Python. Join Auterion building AI - powered software for autonomous systems.
Cloud DevOps Engineer managing Azure infrastructure at Medical Guardian. Overseeing technical operations and security response in a hybrid work environment.
SRE Linux/Unix System Administrator at Broadridge with strong Unix/Linux Bourne/Bash Scripting skills. Collaborating in a hybrid, fast - paced environment to manage critical systems.
Senior Site Reliability Engineer at Rootly embedding with teams to enhance service performance and reliability. Own CI/CD pipelines and drive capacity planning efforts in a fast - paced environment.
DevOps Engineer improving CI/CD pipelines and best practices for Datatonic's AI and data projects. Collaborate with clients to enhance infrastructure and drive innovation in tech.
Senior/Principal DevOps Engineer developing robust CI/CD pipelines for ClubWPT Gold at a hypergrowth startup. Collaborate globally to revolutionize online gaming experiences while maintaining high technical standards.
DevOps Engineer responsible for the health, performance, and automation of gaming platform services. Focused on CI/CD pipelines, infrastructure services, and application monitoring.