Senior DevOps Engineer managing Azure-based infrastructure solutions at Reward Gateway. Collaborating with development teams to ensure efficient software development and deployment processes.
Responsibilities
Design, implement, and manage cloud-based infrastructure solutions on Microsoft Azure.
Oversee deployment, maintenance, and scaling of Azure services, ensuring high availability and reliability for applications.
Work closely with software development teams to understand application requirements and provide necessary support for continuous integration and continuous deployment (CI/CD) practices.
Foster a collaborative environment by sharing best practices, tools, and methodologies with development teams.
Develop and maintain automated CI/CD pipelines to streamline the deployment process for applications.
Integrate testing automation into deployment pipelines, ensuring high-quality deliverables.
Implement monitoring and alerting tools to track application performance, system health, and infrastructure usage.
Analyse performance metrics and logs to troubleshoot issues and optimize application and infrastructure performance.
Collaborate with security teams to implement best practices for infrastructure security, including access controls, network security, and data protection.
Ensure compliance with industry regulations and internal policies regarding data protection and system management.
Produce clear and comprehensive documentation on system architecture, deployment processes, and operational procedures.
Provide regular reports on system performance, deployment status, and incident management to technical leadership.
Requirements
Proven experience in a DevOps or Site Reliability Engineering role, preferably within cloud environments.
Strong hands-on experience with Microsoft Azure, including services such as Azure App Service, Azure Functions, Azure Kubernetes Service, and Azure DevOps.
Proficiency in scripting languages (e.g., PowerShell, Python, Bash) for automation tasks.
Familiarity with containerization technologies (e.g., Docker, Kubernetes) and orchestration tools.
Experience with CI/CD tools (e.g., Azure DevOps, Jenkins, GitLab CI/CD) and version control systems (e.g., Git).
Strong understanding of network protocols, security, and infrastructure as code concepts (e.g., Terraform, ARM templates).
Excellent problem-solving skills and the ability to troubleshoot complex issues in a collaborative environment.
Strong communication skills, with the ability to convey technical concepts to both technical and non-technical stakeholders.
Benefits
A flexible holiday plan of up to 40 days per year
£400 a year Wellbeing Allowance
Private Medical Insurance
Allowance for professional development books, E-books, podcasts
Contributory pension Scheme
Employee, friends and family discounts across 1200+ retail, hospitality and lifestyle brands
DevOps Engineer for designing and maintaining Azure - based hybrid cloud infrastructure for a company specializing in nature - based smart city solutions. Leading cloud architecture and mentoring engineers as part of a high - impact team.
SRE responsible for ensuring reliability and performance of IT systems at a digital transformation company specializing in public sector efficiency. Collaborating on system health, incident response, and automation tasks.
DevOps Senior role at Beyond Soluções managing CI/CD for .NET and Kubernetes applications. Collaborating on cloud solutions while fostering a culture of innovation and quality.
Senior Software Engineer at PayPal managing cloud infrastructure and DevOps solutions. Delivering complete SDLC solutions and guiding engineering teams for scalable and reliable services.
Senior Site Reliability Engineer at Diligent leading reliability, automation, and observability across cloud infrastructure. Build tools for incident response and enhance performance in fast - paced environments.
Perception Deployment Engineer deploying deep learning models on embedded systems at Caterpillar. Collaborating with cross - functional teams for integration and optimization of perception modules in vehicles.
Principal Site Reliability Engineer at AT&T required to design scalable solutions for critical operations with minimal downtime. Collaborating with teams to monitor and improve system performance in cloud environments.
DevOps Engineer managing AI SaaS infrastructure at a high - growth European company. Supporting AI model deployment and ensuring platform security and compliance with multiple systems integration.
Engineering Manager leading teams for observability platforms at LexisNexis. Owns operational excellence across software delivery lifecycle in Raleigh, NC.
Reliability Engineer optimizing site facility infrastructure and utility systems at Roche. Conducting root cause analyses and developing maintenance plans to enhance reliability and efficiency.