AI Security Control Developer/Site Reliability Engineer for RBC's enterprise AI ecosystem. Design, implement, and validate security controls to protect AI systems with 24/7 reliability.
Responsibilities
Build, operate, and continuously validate the security controls that protect RBC's enterprise AI ecosystem
Design and implement detective and preventative controls, metrics, alerting, and automated response mechanisms
Ensure AI security boundaries operate reliably 24/7
Be on-call on a rotating basis, ensuring security control reliability around the clock
Design and implement alerting architecture, escalation paths, triage runbooks, and self-healing mechanisms to ensure 24/7 operational resilience
Build comprehensive metrics dashboards and alerting for control health, detection accuracy, and coverage
Build and maintain automated testing pipelines for continuous validation of LLM Gateway controls
Build strong partnerships and collaboration with AI platform teams, MLOps, DevOps, IAM, and Security Architecture
Requirements
Bachelor's degree (or equivalent) in Computer Science, Software Engineering, Cybersecurity, or a related field
5+ years of software engineering or SRE experience with strong, production-level Python proficiency
Experience building containerized services using Kubernetes or OpenShift
Experience with Open Policy Agent (OPA) or policy-as-code frameworks
Experience with SRE practices including SLOs/SLIs, error budgets, observability, alerting, incident response, and on-call rotations
Experience defining and implementing infrastructure and application pipelines
Ability to deliver robust, production-ready AI security solutions and platforms, drive continuous improvement, advocate for safety and privacy-by-design, and communicate effectively with technical and business stakeholders
Excellent organizational, communication, interpersonal, and motivational skills in achieving business objectives
Benefits
A comprehensive Total Rewards Program including bonuses and flexible benefits
Competitive compensation, commissions, and stock where applicable
Leaders who support your development through coaching and managing opportunities
Ability to make a difference and lasting impact
Work in a dynamic, collaborative, progressive, and high-performing team
Flexible work/life balance options
Opportunities to do challenging work
Opportunities to take on progressively greater accountabilities
Access to a variety of job opportunities across business
Job title
AI Security Control Developer/Site Reliability Engineer – Global Security
Lead DevOps Developer at Boeing, focusing on CI/CD and cloud infrastructure management. Collaborating with teams to automate processes and improve system performance across environments.
Vulnerability & Configuration Management Engineer responsible for vulnerability management and remediation processes at Relax Gaming. Collaborate with IT teams to improve security measures across various platforms.
DevOps Engineer for designing and maintaining Azure - based hybrid cloud infrastructure for a company specializing in nature - based smart city solutions. Leading cloud architecture and mentoring engineers as part of a high - impact team.
SRE responsible for ensuring reliability and performance of IT systems at a digital transformation company specializing in public sector efficiency. Collaborating on system health, incident response, and automation tasks.
DevOps Senior role at Beyond Soluções managing CI/CD for .NET and Kubernetes applications. Collaborating on cloud solutions while fostering a culture of innovation and quality.
Senior Software Engineer at PayPal managing cloud infrastructure and DevOps solutions. Delivering complete SDLC solutions and guiding engineering teams for scalable and reliable services.
Senior Site Reliability Engineer at Diligent leading reliability, automation, and observability across cloud infrastructure. Build tools for incident response and enhance performance in fast - paced environments.
Perception Deployment Engineer deploying deep learning models on embedded systems at Caterpillar. Collaborating with cross - functional teams for integration and optimization of perception modules in vehicles.
Principal Site Reliability Engineer at AT&T required to design scalable solutions for critical operations with minimal downtime. Collaborating with teams to monitor and improve system performance in cloud environments.
DevOps Engineer managing AI SaaS infrastructure at a high - growth European company. Supporting AI model deployment and ensuring platform security and compliance with multiple systems integration.