Site Reliability Engineer driving innovation and growth for Banking Solutions, Payments, and Capital Markets business. Responsible for application reliability and incident response in a hybrid work environment.
Responsibilities
Design and maintain monitoring solutions for infrastructure, application performance, and user experience
Implement automation tools to streamline tasks, scale infrastructure, and ensure seamless deployments
Ensure application reliability, availability, and performance, minimizing downtime and optimizing response times
Lead incident response, including identification, triage, resolution, and post-incident analysis
Conduct capacity planning, performance tuning, and resource optimization
Collaborate with security teams to implement best practices and ensure compliance
Manage deployment pipelines and configuration management for consistent and reliable app deployments
Develop and test disaster recovery plans and backup strategies
Collaborate with development, QA, DevOps, and product teams to align on reliability goals and incident response processes
Participate in on-call rotations and provide 24/7 support for critical incidents
Requirements
Proficiency in development technologies, architectures, and platforms (web, API)
Experience with cloud platforms (AWS, Azure, Google Cloud) and IaC tools
Knowledge of monitoring tools (Prometheus, Grafana, DataDog) and logging frameworks (Splunk, ELK Stack)
Experience in incident management and post-mortem reviews
Strong troubleshooting skills for complex technical issues
Proficiency in scripting languages (Python, Bash) and automation tools (Terraform, Ansible)
Experience with CI/CD pipelines (Jenkins, Harness, GitLab CI/CD, Azure DevOps)
Ownership approach to engineering and product outcomes
Excellent interpersonal communication, negotiation, and influencing skills
Bachelor’s degree in Computer Science, Computer Engineering, or a related field, or equivalent experience
Benefits
Opportunities to innovate in fintech
Tools for personal and professional growth
Inclusive and diverse work environment
Resources to invest in your community
Competitive salary and benefits
Job title
Principal Site Reliability Engineer – Software Engineering
Entry - level DevOps Engineer at Nokia focusing on building and maintaining CI environment for LTE and 5G solutions. Engage with high - end telecommunication technologies and support development workflows.
AI Security Control Developer/Site Reliability Engineer for RBC's enterprise AI ecosystem. Design, implement, and validate security controls to protect AI systems with 24/7 reliability.
Senior Site Reliability Engineer ensuring scalability and reliability for NGINX systems and SaaS platforms. Collaborating across teams to drive automation and system performance.
Site Reliability Engineer ensuring reliability and performance of data platform services for Veepee. Collaborating on cloud migration, Kubernetes operations, and observability best practices.
Senior Lead Site Reliability Engineer overseeing critical systems stability and incident management. Leading Java applications reliability and supporting a dynamic technology environment.
Infrastructure Architect connecting clients and Kyndryl. Leading projects from start to finish, ensuring technical solutions meet client needs at Kyndryl.
DevOps Engineer automating and configuring network monitoring and automation solutions for Telia’s telecom operations in Finland. Ensuring performance, resilience, and high observability of critical platforms.
Client Services Consultant specializing in DevOps Mainframe Operations with experience in automation best practices. Analyzing Life Cycle Management data needs and evaluating solutions for Endevor - related operations.
Senior AWS DevOps Engineer at LexisNexis shaping global CI/CD platform. Collaborating with teams to deliver secure, reliable, and scalable delivery pipelines.
Cloud Engineer at MetroStar focusing on building and securing cloud - native systems. Managing Kubernetes workloads and CI/CD pipelines in Agile teams with an emphasis on security.