Site Reliability Engineer enhancing system reliability and deployment practices at OpenLoop. Collaborating with cross-functional teams for incident management and performance tuning.
Responsibilities
Partner with engineering teams to improve system reliability and deployment practices
Engage with Openloop teams on SRE guidelines and best practices about automation and infrastructure
Work with security teams to implement secure, compliant infrastructure
Ensure 24/7 system availability and rapid incident response
Implement and maintain disaster recovery and business continuity plans
Skilled at performance tuning — identifying bottlenecks at infra, app, and database layers.
Advocate for blameless culture and continuous improvement.
Collaborate closely with product and engineering to make reliability a shared responsibility.
Requirements
2 - 3 years of experience in infrastructure, DevOps, or Site Reliability Engineering
Good background in AWS, particularly with serverless architectures
Understanding of observability and incident management
Strong knowledge in at least one programming language (Typescript, Python, Go, etc.). Previous experience as a Developer is a plus
Knowledge of Linux/Unix systems and networking
Experience with Infrastructure as Code (AWS CDK, Cloudformation)
Experience managing monitoring and observability tools (Prometheus, Grafana, ELK, etc.)
Knowledge of CI/CD pipelines and deployment automation (Github Actions, GitLab CI, etc)
Understanding of database systems and performance optimization
Leadership & Communication
English (C1) fluency
Excellent verbal and written communication skills
Ability to translate technical concepts to non-technical audiences
Good problem-solving and decision-making capabilities
Experience with agile methodologies
Benefits
Formal employment (“Planilla”) under a Peruvian entity — all legal benefits in soles (CTS, Gratificaciones, etc.).
Full-time schedule: Monday–Friday, 9am–6pm.
Unlimited vacation days 🏖️ — yes, we mean it!
EPS healthcare (Rimac) covered 100%.
Oncology insurance (Rimac) covered 100%.
AFP retirement plan.
Coworking access in Miraflores, Lima — with free beverages, talks, bicycle parking, and amazing city views.
DevOps Engineer at Cloudogu working with development and operations for reliable software delivery. Focusing on CI/CD, infrastructure automation, and platform services in an agile environment.
Jr. DevOps Engineer supporting and improving CI/CD pipelines and Linux systems at Swift. Collaborating with senior engineers in a hands - on learning environment.
Senior DevOps Engineer I managing automation tooling and multi - cloud infrastructure at Spring Health. Collaborating with AI and Infrastructure teams in a hybrid Seattle office.
Site Reliability Engineer for cloudified backup platform using Commvault technology at Expleo. Joining a dynamic team to ensure backup infrastructure scalability and reliability.
Site Reliability Engineer responsible for designing and maintaining scalable services with high availability. Collaborating with development teams to enhance reliability and operational excellence.
Technical Staff leading the architecture, reliability, and modernization of enterprise ALM and DevOps tools. Driving strategy and influencing product development in collaboration with various teams.
Site Reliability Engineer responsible for reliability and availability, collaborating with development teams on scalable systems. Applying software engineering practices to improve production operations.
DevOps Engineer in the Security Data and AI Lab at Lloyds Banking Group driving data and cloud infrastructure's influence on product operations and customer service improvements.
Senior Platform DevOps Engineer at Code Metal designing and implementing cloud and hybrid infrastructure to support customer deployments and internal platforms. Collaborating with software and security teams for reliable delivery.