Site Reliability Engineer responsible for architecting cloud infrastructure and containerized platforms at Vista Global. Implementing CI/CD pipelines and mentoring teams on best practices for production environments.
Responsibilities
Architect, build, and maintain scalable, secure cloud infrastructure in AWS using Terraform and Ansible.
Design and manage containerized platforms using Docker, Kubernetes (EKS), ECS/Fargate, and Helm.
Implement and optimize CI/CD pipelines in GitLab, enabling reliable and automated deployments.
Apply Best practices to ensure high availability, performance, monitoring, and incident response in 24/7 production environments.
Support and troubleshoot production systems, focusing on scalability, resilience, and technical debt reduction.
Mentor engineers and guide teams on cloud-native architecture and DevOps best practices.
Maintain documentation (Confluence/Jira) and communicate effectively with technical and non-technical stakeholders.
Requirements
Extensive in-depth experience with cloud-based provisioning, monitoring, troubleshooting, and related SRE and DevOps technologies, in addition to networking knowledge
3+ years in a technical role, ability to teach and influence engineers.
Strong experience architecting cloud infrastructure with AWS.
Strong experience with Linux and infrastructure as code IAC.
Strong experience with containerization/orchestration technologies.
Strong understanding of multiple source control systems such as GitLab or GitHub.
Strong Experience with CI/CD automation and configuration management.
Experience working in a 24/7 on-call, highly transactional or streaming production environment.
Must be able to ensure Agile/Scrum concepts and principles are adhered to, must be able to be a voice of reason.
Understanding of the SDLC.
Scripting and good foundational understating of programming with Python, and Bash (can include Typescript).
Demonstrates extensive knowledge of the principles, concepts, and theories in own discipline, and broad knowledge of principles and concepts of other functions.
Has developed extensive business knowledge and keeps current on industry trends.
Having a customer focus, drive for results, and strong ethics & values.
Must be a team player but also be able to work independently with minimal direction and supervision.
Possess a good understanding of multiple business applications, as well as experience in minicomputer or client/server environments including, but not limited to, the implementation and support of resource planning, sales automation, marketing, finance, and distribution systems.
Azure SRE Engineer responsible for designing and maintaining secure, scalable Azure cloud infrastructure. Driving automation and operational excellence for leading organizations in technology transformation.
Senior Manager of Site Reliability Engineering overseeing Workday Kubernetes based platform. Leading teams while ensuring high availability and collaborating with federal agencies.
Site Reliability Engineer focusing on AWS cloud environments, SRE practices, and system reliability within GFT's team. Collaborating on cloud migrations and observability initiatives.
Senior DevOps Analyst enhancing infrastructure automation in a transformative technology firm. Collaborating on innovative projects in sectors like healthcare, finance, and utilities in Brazil.
Consultant at Minsait supporting technical decisions in infrastructure automation and developing solutions. Collaborating with teams for maintaining and evolving automation platforms.
Practical Trainee focusing on hardware reliability engineering at Sonova. Support reliability improvement initiatives and work closely with experienced engineers on real - life product challenges.
Configuration Management Engineering Technician supporting naval shipbuilding projects with engineering documentation and configuration integrity. Establishing and maintaining relationships with stakeholders in the shipbuilding community.
Principal Configuration Management Engineering Technician contributing to major shipbuilding programs for national security. Leading Configuration Management teams and ensuring data integrity for advanced naval vessels.
Senior Configuration Management Engineering Technician at Babcock supporting naval engineering programmes across multiple ship configurations. Influencing critical decisions and contributing to engineering outcomes for national defence.
DevOps Engineer designing and managing scalable Azure cloud infrastructure for a financial technology company. Collaborating with teams to enhance system reliability and automate application delivery pipelines.