Senior SRE driving incident management and operational excellence in financial software solutions. Working with innovation and technology in Brazil's leading software company's team.
Responsibilities
Lead high-impact incidents end-to-end, including investigation, mitigation, service recovery and technical communication.
Resolve problems without ready-made solutions or documentation, investigating deep root causes and proposing definitive fixes.
Make decisive technical decisions under pressure, evaluating risks and impacts in production environments.
Create, review and standardize operational playbooks and incident response procedures.
Evolve reliability, resilience and observability practices for the operation.
Operate across production, staging and development environments, ensuring continuous availability.
Identify automation opportunities and improve operational workflows.
Provide technical support to internal and external customers with a consultative approach and clear communication.
Participate in technical calls and meetings with customers to analyze, update and drive issue resolution.
Act as a mentor, guiding less experienced professionals during incidents and promoting best practices.
Requirements
Bachelor's degree in a Technology-related field.
Strong experience in advanced troubleshooting, including:
– Deep log analysis;
– Validation of complex environments;
– Structured evidence collection;
– Investigation of critical incidents;
Experience supporting high-criticality production environments.
Proven ability to document, communicate and lead technically during incidents.
Benefits
Meal allowance or food card;
Flexible Benefit (Flash);
Health insurance;
Partners for psychological, legal, financial and nutritional support (CLUDE, C4LIFE and ASQ);
Psicologia Viva;
Dental assistance;
Childcare assistance;
Support for children with special needs;
Fertility treatment assistance;
Extended maternity and paternity leave;
Commuter allowance or Home Office allowance (for telework contracts);
Gympass (Wellhub) and TotalPass;
Flexible working hours;
Life insurance;
Partner discounts club;
Partnership with Sesc;
No dress code (casual dress);
Day off on your birthday;
Beca (education incentive program);
PPR or Bonus — based on achievement of targets and results.
Principal AI Site Reliability Engineer driving operational excellence for critical contact center applications at Fidelity. Leading automation and observability initiatives to improve reliability and efficiency.
Data Transport Infrastructure DevOps Engineer at Leidos modernizing global - scale multi - cloud environments for USAF missions. Involves developing cloud - native solutions and ensuring security best practices.
DevOps Engineer responsible for building and optimizing AWS - based infrastructure and backend systems at Allguth GmbH. Part of a team focused on innovative mobility solutions in Munich region.
(Senior) DevOps Engineer specializing in ML solutions implementation and management in Germany. Focused on CI/CD pipelines, automation, and cloud services.
Specialist DevSecOps joining Periferia IT Group, a leader in digital transformation. Work in a dynamic environment with continuous learning and professional development opportunities.
Join Zinkworks as a Senior Platform Engineer designing scalable IaC - driven cloud platforms for a large - scale enterprise contact centre. Focused on automation, reliability, and platform ownership in a hybrid work environment.
Asset Reliability Engineer providing maintenance advice and service innovations. Join Sensorfact, the leading smart monitoring platform, to modernize the industrial sector.
Cloud Operations Engineer responsible for securing AWS infrastructure at Avalon Healthcare Solutions. Collaborating on SRE best practices and ensuring system reliability and performance.
Design Release Engineer designing, developing, and releasing seat systems for Ford vehicles. Ensuring engineering deliverables meet quality, cost, and timing targets while collaborating with cross - functional teams.
DevOps Engineer responsible for maintaining FME infrastructure and development pipelines at Safe Software. Collaborate in an agile team focused on constant improvement and automation.