Site Reliability Engineer supporting Vista Global’s production environments and cloud infrastructure. Delivering solutions using AWS, Terraform, Ansible, Docker, and Kubernetes in a hybrid model.
Responsibilities
Architect, build, and maintain scalable, secure cloud infrastructure in AWS using Terraform and Ansible
Design and manage containerized platforms using Docker, Kubernetes (EKS), ECS/Fargate, and Helm
Implement and optimize CI/CD pipelines in GitLab, enabling reliable and automated deployments
Apply Best practices to ensure high availability, performance, monitoring, and incident response in 24/7 production environments
Support and troubleshoot production systems, focusing on scalability, resilience, and technical debt reduction
Mentor engineers and guide teams on cloud-native architecture and DevOps best practices
Maintain documentation (Confluence/Jira) and communicate effectively with technical and non-technical stakeholders
Requirements
Extensive in-depth experience with cloud-based provisioning, monitoring, troubleshooting, and related SRE and DevOps technologies
3+ years in a technical role, ability to teach and influence engineers
Strong experience architecting cloud infrastructure with AWS
Strong experience with Linux and infrastructure as code IAC
Strong experience with containerization/orchestration technologies
Strong understanding of multiple source control systems such as GitLab or GitHub
Strong Experience with CI/CD automation and configuration management
Experience working in a 24/7 on-call, highly transactional or streaming production environment
Must be able to ensure Agile/Scrum concepts and principles are adhered to
DevOps Specialist creating and overseeing Azure hybrid cloud infrastructures for EVLO's battery energy storage solutions. Collaborating with teams to implement cutting - edge technologies in a dynamic environment.
Software Quality and Release Engineer developing and maintaining C++/Python software solutions for aerospace and defense industry. Collaborating on CI/CD automation and feedback documentation.
Site Reliability / DevOps Engineer developing Big Data platforms for clients in Telco and Retail industries. Focus on stability, scalability, and performance of large - scale data processing systems.
Senior DevOps Engineer building and managing big data platforms for clients in telecommunications and finance industries. Ensuring stability, scalability, and performance across cloud and on - premise environments.
Site Reliability Engineer ensuring reliability, automation, and observability across cloud infrastructures for Diligent. Leading initiatives to improve performance in fast - paced environments.
Senior DevOps Engineer leading DevOps design and implementation for gaming projects at Stillfront. Collaborating with international teams to enhance gaming infrastructure and reduce costs.
Mainframe DevOps Engineer at Kyndryl enhancing mainframe delivery practices and migrating SCM to Azure DevOps. Requires extensive Mainframe development experience and DevOps skills.
DevOps/MLOps Engineer designing, automating, and maintaining scalable infrastructure for federal client. Collaborating with software engineers and data scientists for resilient solutions.