Senior Site Reliability Engineer managing the reliability and operational health of the Loan Origination System for a fintech company. Collaborating with engineering teams in Brazil and the US to improve system reliability.
Responsibilities
Ensure the availability, performance, and reliability of the LOS through proactive monitoring and incident response.
Partner with product and engineering teams to define and maintain SLOs/SLAs, introduce error budgets, and drive accountability.
Collaborate on architectural improvements aimed at increasing resilience, scalability, and observability.
Lead incident analysis and postmortems, and implement preventive actions.
Design, build, and operate infrastructure as code (IaC) using Terraform.
Improve observability tooling and practices using Datadog, enhancing alerting, tracing, and system dashboards.
Participate in on-call rotations and respond to production incidents.
Automate operational processes and promote a DevOps culture across squads.
Requirements
5+ years of experience in Site Reliability Engineering or DevOps roles.
Proven experience managing and improving production systems in a cloud-native environment (preferably AWS).
Strong experience with observability tools and practices.
Experience defining and driving adoption of SLIs, SLOs, and SLAs.
Experience in operating event-driven systems and distributed architectures.
Solid understanding of Terraform and infrastructure as code best practices.
Strong debugging and troubleshooting skills across the stack.
Comfortable writing and reviewing production-grade code (preferably in Java).
Excellent written and verbal communication in English.
A pragmatic and collaborative mindset, with a passion for system reliability and operational excellence.
Bachelor's degree in computer science or similar fields preferred.
Benefits
Great Perks – We offer generous salaries, monthly lunches, a robust employee recognition and talent development program to enhance your career with us.
Culture - We are believers in maintaining a healthy work-life balance. While we work hard and care deeply about our customers and partners, we want you to have room for your family, friends, and yourself.
Growth - Company growth provides unprecedented career growth. FHF’s extraordinary year over year growth in revenue and new markets provides opportunity for you to establish and develop your career growth. We engage each employee to build a career plan that benefits everyone and we have a proven record of investing in *you*.
Cloud Engineer working with Azure DevOps and digital transformation in a global team at EY. Collaborating on cloud engineering projects and supporting CI/CD pipeline development.
DevOps Engineer creating better conditions for developers in Saab's defence technology. Collaborating with developer teams for effective continuous development and delivery of software.
Ingénieur Infrastructure DevOps chez Bull, renforçant l'équipe AdminLab Echirolles. Travailler sur des infrastructures Linux et des pratiques d'automatisation dans un environnement HPC.
Product Quality & Reliability Engineer developing quality/reliability standards for Applied Materials. Design methods for testing products and analyze operational data in a supportive team environment.
DevOps System Engineer creating and managing infrastructure for ESET's global SaaS service. Collaborating with tech teams to maintain secure and stable operations.
Provides expertise in business applications design and functionality. Supports users and validates technical designs for alignment with business needs.
Senior Site Reliability Engineer supporting the reliability and performance of Broadridge’s fintech platform. Collaborating with senior engineers on automation, infrastructure, and production stability.
DevOps Engineer at Mindera focusing on Windows environments and Azure cloud solutions. Involves system modernization, automation, and migration projects with collaborative teams.
DevSecOps Lead supporting Synthesized's cloud automation strategy with a focus on security and compliance. Collaborating closely with development teams to shape cloud architecture and enhance deployment processes.
DevOps Engineer managing technical implementation and operational maintenance for Consort Group's ecosystem. Collaborating in project phases and optimizing processes in a hybrid work environment.