Senior DevOps Engineer building and operating developer platforms for reliable production shipping at Demandbase in Hyderabad. Focused on improving developer experience and cloud infrastructure.
Responsibilities
Build and operate the platforms, tooling, and workflows that enable engineers to ship reliably to production.
Partner with software, data, and security engineering teams to identify friction across the software delivery lifecycle and address it through automation, platform abstractions, and improved workflows.
Design and evolve developer-facing platforms and tooling that standardize how services and pipelines are built, deployed, and operated.
Enable self-service workflows with opinionated defaults that improve reliability, security, and consistency without slowing teams down.
Use developer feedback, operational data, and production signals to prioritize and drive the DevEx roadmap.
Design, build, and maintain CI/CD orchestration that supports high release velocity, strong security guardrails, and local-to-production parity, preferably using GitLab CI/CD.
Standardize build, test, and deployment patterns across application and data workloads.
Support modern deployment strategies and GitOps-based workflows.
Build, operate, and evolve Kubernetes-based platforms across AWS and GCP, including EKS and GKE.
Enable teams to run workloads on Kubernetes by providing clear operational guardrails, platform defaults, and documented best practices.
Manage multi-account cloud environments with a focus on security, scalability, and ease of use.
Design and maintain infrastructure using Infrastructure as Code, including Terraform and Crossplane.
Build and operate internal platform components such as GitOps tooling, secret management systems, and service mesh infrastructure.
Operate and evolve observability platforms (e.g., Prometheus, Mimir, Thanos, Grafana, Datadog) to provide actionable signals for platform and application teams.
Define and apply SLIs, SLOs, alerting strategies, and incident response practices.
Lead and participate in blameless post-mortems, translating learnings into platform improvements and reduced operational toil.
Support engineering teams running data pipelines and batch workloads on platforms such as Airflow, EMR, and Dataproc.
Standardize deployment, observability, and operational patterns for data workloads.
Improve reliability and operability of data platforms through shared tooling and best practices.
Serve as a technical leader within DevEx, promoting best practices in platform engineering, reliability, and secure software delivery.
Mentor engineers and influence teams through strong technical design, documentation, and collaboration.
Drive adoption of internal platforms through strong defaults, clear documentation, and self-service tooling.
Requirements
8+ years of overall engineering experience, including hands-on software development and cloud infrastructure ownership.
Strong software engineering fundamentals with experience in at least one general-purpose programming language (e.g., Go, Python, Java).
5+ years of experience building and operating cloud infrastructure on AWS and/or GCP at scale.
Proven experience managing multi-account cloud environments, including IAM, networking, and security best practices.
Strong proficiency with Infrastructure as Code, preferably Terraform and Crossplane.
Extensive experience operating Kubernetes platforms in production, including EKS and/or GKE.
Experience managing multiple Kubernetes clusters, including upgrades, networking, and security.
Hands-on experience with service mesh technologies such as Istio in multi-cluster environments.
Deep experience designing and operating CI/CD systems that support high release velocity, preferably GitLab CI/CD.
Experience building developer-facing tooling that improves local-to-production parity and reduces cognitive load.
Familiarity with GitOps practices and modern deployment strategies.
Experience supporting data platforms such as Airflow, EMR, and Dataproc.
Strong experience building and operating observability platforms including Prometheus, Mimir, Thanos, Grafana, and Datadog.
Solid understanding of SLIs, SLOs, alerting, and incident response.
Demonstrated ability to partner with engineering teams to identify pain points and improve developer experience.
Strong communication skills, including experience participating in or leading blameless post-mortems.
Benefits
Group Medical
Personal Accident
Term Life Insurance
Preventive healthcare including dental, vision, and OPD needs
DevOps and Build Engineer for NVIDIA developing and maintaining CI/CD pipelines. Collaborating with teams to enhance compiler technologies and optimize build performance in a diverse environment.
Senior AWS DevOps Developer responsible for managing AWS infrastructure for enterprise public budgeting software at Euna Solutions. Collaborating on cloud projects and enhancing system reliability and performance.
Principal AI Site Reliability Engineer driving operational excellence for critical contact center applications at Fidelity. Leading automation and observability initiatives to improve reliability and efficiency.
Data Transport Infrastructure DevOps Engineer at Leidos modernizing global - scale multi - cloud environments for USAF missions. Involves developing cloud - native solutions and ensuring security best practices.
DevOps Engineer responsible for building and optimizing AWS - based infrastructure and backend systems at Allguth GmbH. Part of a team focused on innovative mobility solutions in Munich region.
(Senior) DevOps Engineer specializing in ML solutions implementation and management in Germany. Focused on CI/CD pipelines, automation, and cloud services.
Specialist DevSecOps joining Periferia IT Group, a leader in digital transformation. Work in a dynamic environment with continuous learning and professional development opportunities.
Join Zinkworks as a Senior Platform Engineer designing scalable IaC - driven cloud platforms for a large - scale enterprise contact centre. Focused on automation, reliability, and platform ownership in a hybrid work environment.
Asset Reliability Engineer providing maintenance advice and service innovations. Join Sensorfact, the leading smart monitoring platform, to modernize the industrial sector.