Senior Cloud Operations & Infrastructure Developer improving cloud-based application reliability for industrial software company. Involves deployment, monitoring, and incident response in cloud environments.
Responsibilities
Provide timely and effective response to incidents to minimise the impact on our customers and keep colleagues updated as required.
Work with development teams to advise and contribute to improvements to operational stability, security, cost management and reporting requirements for our cloud solutions.
Continually develop and improve Site Reliability Engineering processes, adding value through optimisation, automation and effective reporting.
Oversee relevant Cloud deployments to ensure successful implementation, track and report progress, resolve or escalate release issues.
Proactively manage cloud environments to minimise service impacting issues, including infrastructure as code, certificates, storage, health status, backup status etc.
Ensure new services meet required operational readiness standards before being accepted into operations support practices.
Ensure on-going compliance to security practices and policies.
Provide subject matter expertise to business stakeholders as required.
Maintain clear and accurate operational documentation.
Requirements
Knowledge and experience of operational support, software development and deployment methodologies and principles.
Strong programming skills in languages such as C#, Go or Node.js.
Expertise in cloud platforms such as Azure, AWS, or Google Cloud.
Proficient in containerization technologies like Docker or Kubernetes.
Experience with Infrastructure as Code (IaC) tools like Terraform, Bicep or CloudFormation.
Knowledge of PowerShell, Python or Node.js scripting.
Understanding of cloud security, networking, and storage solutions.
Experience of Azure Service Operator (ASO), Helm and GitOps practices.
Strong written, verbal and presentation skills, able to convey information clearly and concisely to technical and non-technical audiences.
Principal Engineer leading design and implementation of secure architectures for Walmart’s AI Security Team. Responsibilities include risk management, capacity planning, and cross - team collaboration.
Communications Desk Infrastructure Engineer responsible for maintaining and troubleshooting APS communication systems. Supporting critical operational and public safety communication needs across Arizona.
Student Assistant in IT Infrastructure Engineering at Liebherr - Hamburg. Supporting network solutions, system configurations and project management tasks.
Infrastructure Architect required for designing a next - gen hosting platform in Kubernetes at Enova Consulting. Collaborating closely with engineers and partners for a hybrid infrastructure solution.
Cloud Infrastructure Engineer ensuring AWS service reliability and performance at Perlego. Collaborating with teams and managing infrastructure in a hybrid working environment.
Senior Infrastructure Engineer designing and building hybrid networks for ICEYE’s satellite operations. Ensuring high - throughput and reliability between ground stations and cloud environments.
AI Infrastructure Engineer designing and implementing AI solutions for Xsolla's infrastructure tasks across GCP and multi - cloud environments. Collaborating with senior engineers to execute AI strategy.
Data Transport Infrastructure Engineer at Leidos supporting U.S. Air Force Cloud One Architecture. Involves developing scalable cloud - native solutions and mentorship roles in a hybrid remote setting.
Principal Software Engineer on Walmart's AI Security team analyzing threats and implementing robust security architectures. Collaborate across domains and mentor on AI safety and secure engineering practices.
Data Center Infrastructure Architect designing scalable and resilient optical cabling for hyper - scale data centers. Implementing physical solutions and automating fiber mapping for efficiency.