Sr. Site Reliability Engineer (SRE) III providing technical solutions for the federal government. Collaborating in a high-performing team focused on reliability and application scalability.
Responsibilities
Design, deploy, and maintain mission-critical application workloads on virtualized or containerized environments (e.g., VMWare or Kubernetes), ensuring scalability, availability, and compliance with government requirements.
Develop and sustain automated CI/CD pipelines, monitoring, and configuration management workflows to support reliable software delivery and operational observability across development, integration, staging, and production environments.
Provision, configure, and maintain developer environments and toolchains to support rapid, secure, and efficient development workflows, enabling mission-aligned software delivery.
Identify developer friction across the software development lifecycle and implement solutions to reduce that friction and provide developer-first environments.
Establish and maintain a high level of customer trust and confidence through deep technical expertise, and use creativity to provide innovative solutions that fit the customer’s mission needs.
Requirements
Active Top Secret with SCI eligibility security clearance.
Certification meeting DoD 8140 (e.g., Security+, or higher).
Bachelor’s degree in Computer Science or related engineering field is preferred; relevant experience may substitute.
7+ years of experience in software development, systems engineering, or operations roles with responsibility for availability, performance, and reliability of production systems.
Demonstrated experience blending software engineering and systems administration practices to support highly available, scalable applications.
Experience designing and managing monitoring, alerting, and observability solutions to meet defined Service Level Objectives.
Experience leading or participating in incident response, root cause analysis, and continuous improvement activities.
Experience with Ansible and Desired State Configuration.
Experience with GitLab CI/CD automation and Bash scripting.
Experience supporting container-native storage and object storage solutions (e.g., MinIO, S3-compatible services, and PortWorx).
Experience with enterprise load-balancing solutions (e.g., F5 or similar platforms).
Ability to contribute immediately with minimal ramp-up in a mission-critical operational environment.
Site Reliability Engineer at Thales managing secure cloud environments on AWS and GCP. Ensuring compliance, security, and availability of critical cloud platforms with DevSecOps practices.
DevOps Intern at CCC Intelligent Solutions focusing on cloud infrastructure management and AI model deployment. Gain hands - on experience in DevOps and cloud automation with AWS and Azure.
Systemadministrator and DevOps - Engineer managing ongoing systems in web - based software development. Collaborating on infrastructure and supporting product development with a small team in Bremen.
DevOps Engineer collaborating with teams to ensure reliable software delivery. Focus on CI/CD workflows and platform services within a hybrid work environment.
DevSecOps Engineer embedding security controls into AI development workflows within a financial services environment. Focus on securing AI - generated code through CI/CD pipeline enhancements and SAST integration.
DevOps Engineer ensuring the stability, scalability, and security of systems in all environments at a leading fintech company. Focused on automation and CI/CD to enhance operational excellence and collaboration.
DevOps Engineer focused on SRE principles and AWS - centric infrastructure for VERBI Software. Managing reliability, scalability, and modernization with hands - on approach and technical mentorship.
Senior DevOps Engineer translating high - level requirements into scalable AWS architectures. Leading delivery of robust solutions and elevating engineering standards across the organization.