DevOps Engineer improving reliability and stability of cloud services at Madhive. Responsibilities include CI/CD tooling, monitoring, and cloud infrastructure management.
Responsibilities
Improve the reliability and stability of Madhive’s cloud services, operating primarily in a mix of AWS and Google Cloud, with more of the latter.
Design, build, and maintain CI/CD tooling for Infrastructure as Code and internal services (GitHub Workflows, CloudBuild).
Develop and support monitoring, alerting, and observability systems to ensure platform health.
Automate deployment and management of cloud infrastructure using Terraform, Helm, and other IaC tooling.
Administer, monitor, and optimize databases to ensure performance, reliability, and availability.
Implement database backup, recovery, and scaling strategies to support large-scale distributed systems.
Enforce cloud security best practices (IAM, permissions, policies).
Identify opportunities to optimize cloud services and databases for efficiency and cost control.
Collaborate with cross-functional guilds to establish operational standards and reduce risk.
Stay current on emerging cloud and database technologies, evaluating for potential adoption.
Requirements
Strong understanding of cloud infrastructure, networking, containerization, and distributed systems (GCP preferred, AWS/Azure a plus).
Hands-on experience with Infrastructure as Code (Terraform or similar).
Proficiency with Bash and command-line utilities; Golang experience required (PHP/Python/JavaScript nice to have).
Experience with containerization and orchestration (Docker, Kubernetes).
Solid background in Database Administration: provisioning, scaling, tuning, monitoring, backup/recovery, and troubleshooting.
Familiarity with database performance optimization and observability tools.
Experience with monitoring systems (Google Cloud Monitoring Suite, Datadog, Cloudwatch, etc.).
Strong troubleshooting and problem-solving skills, with a systematic approach.
Excellent written and verbal communication skills; able to document and share best practices.
Comfortable in a fast-paced environment with a growth mindset and eagerness to learn.
Benefits
We embrace our differences and believe they fuel our creativity.
We come from varied backgrounds and think that’s important.
We are all trail-blazing team players who think big and want to make an impact.
We are committed to cultivating a culture of inclusion and collaboration.
We welcome diversity in education, culture, opinions, race, ethnicity, gender identity, veteran status, religion, disability, sexual orientation, and beliefs.
Jr. DevOps Engineer supporting and improving CI/CD pipelines and Linux systems at Swift. Collaborating with senior engineers in a hands - on learning environment.
Senior DevOps Engineer I managing automation tooling and multi - cloud infrastructure at Spring Health. Collaborating with AI and Infrastructure teams in a hybrid Seattle office.
Site Reliability Engineer for cloudified backup platform using Commvault technology at Expleo. Joining a dynamic team to ensure backup infrastructure scalability and reliability.
Site Reliability Engineer responsible for designing and maintaining scalable services with high availability. Collaborating with development teams to enhance reliability and operational excellence.
Technical Staff leading the architecture, reliability, and modernization of enterprise ALM and DevOps tools. Driving strategy and influencing product development in collaboration with various teams.
Site Reliability Engineer responsible for reliability and availability, collaborating with development teams on scalable systems. Applying software engineering practices to improve production operations.
DevOps Engineer in the Security Data and AI Lab at Lloyds Banking Group driving data and cloud infrastructure's influence on product operations and customer service improvements.
Senior Platform DevOps Engineer at Code Metal designing and implementing cloud and hybrid infrastructure to support customer deployments and internal platforms. Collaborating with software and security teams for reliable delivery.
DevOps Platform Intern managing cloud infrastructure and deployment pipelines for AI - native software delivery. Partnering with a Product Development Intern, set up and manage containerized applications on Azure Kubernetes Service.