Senior Azure Site Reliability Engineer ensuring the reliability and performance of the Vew SaaS platform on Microsoft Azure. Collaborating with teams to design and implement resilient systems.
Responsibilities
Implement and maintain highly available, scalable, and fault-tolerant systems on Azure
Monitor system health and performance metrics to ensure reliability and proactively address issues
Develop and maintain automation scripts and tools for provisioning, deployment, monitoring, and scaling of services
Configure and maintain monitoring solutions to provide real-time visibility into system health and performance
Respond to and resolve incidents, including root cause analysis, mitigation, and communication with stakeholders
Ensure systems and infrastructure adhere to security best practices and compliance requirements
Identify areas for optimization and implement solutions to improve system reliability, performance, and efficiency
Requirements
Bachelor's degree in Computer Science, Engineering, or related field
Proven experience as a Site Reliability Engineer or similar role, preferably in a SaaS environment
Strong proficiency in Microsoft Azure services, including compute, networking, storage, and monitoring
Experience with automation tools and scripting languages such as PowerShell
Solid understanding of containerization technologies (e.g., Docker, Kubernetes) and orchestration tools
Experience with Bicep/Terraform and ARM templates for Infrastructure as Code (IaC)
Hands-on experience with monitoring and logging tools such as Azure Monitor, Grafana, Prometheus, or Datadog
Knowledge of security best practices, compliance standards (e.g., ISO27001, SOC 2, GDPR), and relevant regulations
Excellent problem-solving skills and the ability to troubleshoot complex technical issues
Strong communication and collaboration skills, with the ability to work effectively in a cross-functional team environment
Azure certifications such as Azure Administrator Associate or Azure Solutions Architect Expert are a 'nice to have'.
DevOps Master/Specialist working on banking solutions, automating CI/CD pipelines and managing cloud infrastructure. Requires experience in DevOps and low - code technologies.
DevOps Engineer in the US helping with digital transformation projects for international clients. Utilizing AWS, Terraform, and CI/CD tools in a global operations team.
DevOps Engineer responsible for building and maintaining scalable AI systems on Azure cloud. Collaborating with teams to ensure operational excellence for enterprise - grade AI solutions.
Junior MLOps Engineer helping to design and maintain AI/ML systems at Bupa. Collaborating with teams to operationalize machine learning models and automate workflows.
DevOps Engineer developing and managing scalable AWS infrastructures for a PropTech startup. Collaborating within a growing tech team to achieve ambitious goals in the legal conveyancing space.
Senior DevOps Engineer leading the design and optimization of cloud infrastructure at Growth Acceleration Partners. Ensuring secure and cost - effective deployments within fast - paced product development environment.
Advanced Dev Ops Engineer optimizing infrastructure solutions for engineering teams at a consulting and technology services company. Ensuring secure and cost - effective deployments in a fast - paced environment.
Entry - level DevOps Engineer at Nokia focusing on building and maintaining CI environment for LTE and 5G solutions. Engage with high - end telecommunication technologies and support development workflows.
AI Security Control Developer/Site Reliability Engineer for RBC's enterprise AI ecosystem. Design, implement, and validate security controls to protect AI systems with 24/7 reliability.