DevOps Engineer responsible for the upkeep of NinjaOne cloud infrastructure and deployment management. Collaborates with teams to ensure platform availability and implement CI/CD processes.
Responsibilities
Design, develop, and deploy secure, compliant infrastructure in AWS.
Drive automation in infrastructure deployments while shaping the long-term technology strategy and planning.
Lead the development and maintenance of a cutting-edge CI/CD framework and quality ecosystem, integrating new testing and automation tools to streamline workflows.
Define and uphold best practices, procedures, and processes to ensure consistent, high-quality delivery.
Take charge of delivering complex infrastructure projects, leveraging your expertise across all infrastructure technologies to ensure scalable and maintainable solutions.
Proactively monitor infrastructure alerts and metrics, offering technical escalation points for production support both during business hours and after-hours.
Troubleshoot and support cloud infrastructure, infrastructure as code, monitoring systems, alerting, and lead root cause analysis efforts.
Own assigned tasks and tickets, ensuring successful resolution.
Requirements
6+ years of experience working in a cloud-based DevOps environment, specifically AWS.
Hands-on experience developing, securing, and operating highly automated and sophisticated cloud infrastructure solutions in AWS.
Proficient with tools like AWS CloudFormation, CDK, Terraform, and SSM Scripts for managing AWS entities and infrastructure automation.
Comfortable with agile tools (Jira, Confluence, etc.), version control systems (Git, BitBucket), and deployment strategies for both iterative and continuous development.
Deep understanding of CI/CD tools, with CircleCI experience being a plus.
Experience in developing, deploying, and integrating software solutions seamlessly within cloud infrastructure.
Expert in cloud operations, ensuring the scalability, reliability, and maintainability of cloud-native applications.
Ability to coach and guide a team, fostering skill growth and career development.
5+ years of experience working with *nix operating systems (Linux, MacOS, BSD-variants, IBM AIX).
5+ years of scripting experience with languages like Shell (bash, zsh, etc.), Go, Python, PERL, Tcl/Tk.
5+ years of experience in production environment monitoring, alerting, and managing change and configuration.
5+ years working with cloud-based data storage technologies, especially S3 and EFS.
Familiarity with NoSQL is a plus.
3+ years working with message-oriented middleware and queueing technologies like AWS SNS, SQS.
Experience with RDBMS (RDS, Aurora, Postgres) is required; NoSQL experience will make you stand out.
Previous experience with caching technologies like Elasticache is highly desirable.
Site Reliability Engineer for cloudified backup platform using Commvault technology at Expleo. Joining a dynamic team to ensure backup infrastructure scalability and reliability.
Site Reliability Engineer responsible for designing and maintaining scalable services with high availability. Collaborating with development teams to enhance reliability and operational excellence.
Technical Staff leading the architecture, reliability, and modernization of enterprise ALM and DevOps tools. Driving strategy and influencing product development in collaboration with various teams.
Site Reliability Engineer responsible for reliability and availability, collaborating with development teams on scalable systems. Applying software engineering practices to improve production operations.
DevOps Engineer in the Security Data and AI Lab at Lloyds Banking Group driving data and cloud infrastructure's influence on product operations and customer service improvements.
Senior Platform DevOps Engineer at Code Metal designing and implementing cloud and hybrid infrastructure to support customer deployments and internal platforms. Collaborating with software and security teams for reliable delivery.
DevOps Platform Intern managing cloud infrastructure and deployment pipelines for AI - native software delivery. Partnering with a Product Development Intern, set up and manage containerized applications on Azure Kubernetes Service.
UNIX DevOps Engineer managing AIX and Solaris server operations for a Swiss telecom company. Focusing on automation, optimization and 7x24h monitoring responsibilities across multiple locations.
Staff Site Reliability Engineer designing and building backend services for NordVPN. High - ownership role focusing on system architecture and operational excellence.