Senior DevOps Engineer ensuring the reliability and performance of Kami's applications. Join a dynamic team in Auckland, New Zealand, and support a growing education platform.
Responsibilities
Analyze and optimize system reliability, performance, and resource utilization of cloud infrastructure
Develop and maintain automation scripts for deployment, monitoring, and maintenance tasks.
Implement infrastructure as code (IaC) to automate the provisioning and configuration of infrastructure components.
Design and implement monitoring solutions to proactively identify and address issues.
Participate in on-call rotations and respond to incidents to ensure system stability and performance.
Conduct capacity planning to anticipate future resource needs and optimize infrastructure scalability.
Define and track reliability metrics to measure and improve system performance.
Prepare and present reports on system reliability and performance.
Work closely with software development teams to influence and improve the reliability and scalability of applications.
Conduct post-incident reviews to identify root causes and implement preventive measures.
Troubleshoot complex issues in a production environment.
Requirements
7+ years of experience in a DevOps, SRE or similar role
Bachelor's degree in Computer Science, Information Technology, or a related field.
Relevant experience in software engineering, systems administration, or a related field.
Proficiency in programming languages (e.g. Python, Go, Ruby)
Strong scripting skills for automation tasks (e.g. Bash, Python)
Hands-on experience and in-depth knowledge of cloud platforms (e.g. Google Cloud, AWS) and container orchestration tools (e.g. Kubernetes)
A proficient understanding of core networking concepts (e.g. TCP/IP, DNS, load balancing)
Familiarity with Infrastructure as Code (IaC) tools (e.g. Terraform) and/or configuration management tools (e.g. Ansible, Puppet, Chef)
Experience with infrastructure monitoring, logging and alerting tools (e.g. Datadog, Prometheus, Grafana, PagerDuty), and log analysis
Strong collaboration and communication skills to work effectively with cross-functional teams
Ability to analyze complex systems and troubleshoot issues effectively.
Benefits
A people-first employer that is on an inspiring mission to build the future of education while changing the lives of millions
Continuous learning and development opportunities, including subsidised course fees, certifications, conferences, and free access to Udemy and more
Ingénieur Infrastructure DevOps chez Bull, renforçant l'équipe AdminLab Echirolles. Travailler sur des infrastructures Linux et des pratiques d'automatisation dans un environnement HPC.
Product Quality & Reliability Engineer developing quality/reliability standards for Applied Materials. Design methods for testing products and analyze operational data in a supportive team environment.
DevOps System Engineer creating and managing infrastructure for ESET's global SaaS service. Collaborating with tech teams to maintain secure and stable operations.
Provides expertise in business applications design and functionality. Supports users and validates technical designs for alignment with business needs.
Senior Site Reliability Engineer supporting the reliability and performance of Broadridge’s fintech platform. Collaborating with senior engineers on automation, infrastructure, and production stability.
DevOps Engineer at Mindera focusing on Windows environments and Azure cloud solutions. Involves system modernization, automation, and migration projects with collaborative teams.
DevSecOps Lead supporting Synthesized's cloud automation strategy with a focus on security and compliance. Collaborating closely with development teams to shape cloud architecture and enhance deployment processes.
DevOps Engineer managing technical implementation and operational maintenance for Consort Group's ecosystem. Collaborating in project phases and optimizing processes in a hybrid work environment.
DevOps Engineer at AddSecure designing and developing modern cloud infrastructure. Involved with IoT solutions and scaling services using AWS, Azure, and Terraform.
Engineer responsible for designing and maintaining SCM, CI/CD, and Software Delivery processes for an international engineering services company. Collaborate in a hybrid environment with advanced technology projects.