Lead Site Reliability Engineering for dLocal, a fintech enabling payments in emerging markets. Oversee strategy, team growth, and operational excellence in a high-impact role.
Responsibilities
Define and drive the SRE strategy, vision, and roadmap for dLocal.
Lead and grow a multi-region SRE organization, including SRE Technical Referents and engineers at different seniority levels.
Partner closely with Product, Engineering, and Platform leaders to ensure we can scale safely, with clear reliability guardrails and strong operational excellence.
This is a high-impact, hands-on leadership role reporting to VP of Cloud Platform for someone who can move comfortably between strategy, architecture, and execution, while coaching and empowering a senior, distributed team.
Requirements
Solid experience leading SRE / Production Engineering / Platform teams in high-availability, high-scale environments (fintech, payments, or similarly critical domains is a plus).
Proven track record managing managers and senior ICs, building and scaling distributed technical teams.
Deep hands-on expertise in: Reliability engineering: SLIs/SLOs, error budgets, capacity planning, resilience and disaster recovery.
Senior Platform DevOps Engineer at Code Metal designing and implementing cloud and hybrid infrastructure to support customer deployments and internal platforms. Collaborating with software and security teams for reliable delivery.
DevOps Platform Intern managing cloud infrastructure and deployment pipelines for AI - native software delivery. Partnering with a Product Development Intern, set up and manage containerized applications on Azure Kubernetes Service.
UNIX DevOps Engineer managing AIX and Solaris server operations for a Swiss telecom company. Focusing on automation, optimization and 7x24h monitoring responsibilities across multiple locations.
Staff Site Reliability Engineer designing tools for Threat Protection Pro and NordLynx protocol. Working on globally distributed backend services for NordVPN with a focus on security and privacy.
Senior Site Reliability Engineer managing VPN and DNS services to ensure performance and reliability. Collaborating with application teams to maintain security and quality across global infrastructure operations.
Senior Site Reliability Engineer managing globally distributed VPN and DNS services. Optimizing service performance and handling security posture in a hybrid work environment.
Senior Site Reliability Engineer focused on observability for NordVPN. Designing monitoring systems and collaborating with data teams on anomaly detection.
Senior Site Reliability Engineer ensuring content accessibility across global edge infrastructure for NordVPN. Designing and troubleshooting systems critical to internet traffic management.
Staff Site Reliability Engineer designing and building backend services for NordVPN. High - ownership role focusing on system architecture and operational excellence.
Senior Site Reliability Engineer focused on traffic engineering at NordVPN. Working to enhance the world's most advanced VPN and online security solutions.