Senior Manager, DevOps responsible for scaling and owning platform operations at FloQast. Collaborating with cross-functional teams and managing DevOps Engineers in a hybrid setting.
Responsibilities
Lead, mentor, and scale a DevOps organization; build career paths and leadership bench
Define and execute the DevOps, reliability, and observability strategy aligned with business goals
Own platform reliability, availability, and performance for a production SaaS platform
Establish and mature observability practices (metrics, logs, traces, alerts, dashboards)
Drive infrastructure initiatives across AWS focused on scalability, resilience, and modernization
Own and mature incident management including on-call, response, executive communication, and postmortems
Oversee day-to-day operational excellence including CI/CD, deployments, and environment health
Set and manage cloud cost strategy, forecasting, and optimization in partnership with Finance
Partner with Security and Compliance on SOC2, SOX, and audit readiness
Support AI/ML and data platform workloads as part of the broader infrastructure strategy
Requirements
10+ years of DevOps / SRE / Infrastructure experience
4+ years managing DevOps or Platform teams
Deep expertise with AWS at scale (multi-account, networking, IAM)
Strong hands-on background with Terraform, Kubernetes, and CI/CD
Proven ownership of incident management and operational maturity
Experience building and operating observability platforms for SaaS systems
Experience with AI/ML or data-intensive platforms
Observability tools such as Datadog, Grafana, Prometheus, OpenTelemetry
DevOps and Build Engineer for NVIDIA developing and maintaining CI/CD pipelines. Collaborating with teams to enhance compiler technologies and optimize build performance in a diverse environment.
Senior AWS DevOps Developer responsible for managing AWS infrastructure for enterprise public budgeting software at Euna Solutions. Collaborating on cloud projects and enhancing system reliability and performance.
Principal AI Site Reliability Engineer driving operational excellence for critical contact center applications at Fidelity. Leading automation and observability initiatives to improve reliability and efficiency.
Data Transport Infrastructure DevOps Engineer at Leidos modernizing global - scale multi - cloud environments for USAF missions. Involves developing cloud - native solutions and ensuring security best practices.
DevOps Engineer responsible for building and optimizing AWS - based infrastructure and backend systems at Allguth GmbH. Part of a team focused on innovative mobility solutions in Munich region.
(Senior) DevOps Engineer specializing in ML solutions implementation and management in Germany. Focused on CI/CD pipelines, automation, and cloud services.
Specialist DevSecOps joining Periferia IT Group, a leader in digital transformation. Work in a dynamic environment with continuous learning and professional development opportunities.
Join Zinkworks as a Senior Platform Engineer designing scalable IaC - driven cloud platforms for a large - scale enterprise contact centre. Focused on automation, reliability, and platform ownership in a hybrid work environment.
Asset Reliability Engineer providing maintenance advice and service innovations. Join Sensorfact, the leading smart monitoring platform, to modernize the industrial sector.