DevOps Analyst managing the lifecycle of services for fitness management software. Supporting design, development, and automation to enhance service performance at Tecnofit.
Responsibilities
Support the entire service lifecycle, from inception and architecture to deployment and operations;
Provide consultancy and support for platform design, development, and capacity planning;
Maintain production services, measuring and monitoring availability, latency, and overall system health;
Improve systems sustainably through automation, aiming for solutions that are performant, scalable, and highly available;
Develop automations and processes for development teams.
Requirements
Advanced proficiency with AWS and Kubernetes;
Experience in highly critical, resilient environments;
Minimum 3 years as an SRE or Senior DevOps engineer;
Ability to lead complex technical projects and serve as a technical reference;
Proficiency with tools such as Prometheus, Grafana, and the ELK stack;
Technical and strategic mindset to operate in War Rooms / incident response;
Meal benefit: R$43.00 per day (only R$1 monthly contribution!) — also paid during vacation 🎉
Unimed Health Plan: no co‑payment from the start of the trial period and premiums paid by Tecnofit! 🩺
Dental plan (DentalUni): available during the trial period with no cost to the Tecnofitter! 🦷
Childcare assistance via reimbursement program 👶🏻
Life insurance: Allianz Seguros 🚩
Membership to Clube Gazeta do Povo!
Commuting allowance (optional): up to 6% payroll deduction. Note: we operate a hybrid model (3 days per week in the office) 🚌
Perks & snacks: we have Uneed and KUK shops in the office! For Tecnofitters who come in, we also offer fruit salads, coffee, tea and more! ☕ 🍎
Physical and mental health support: reimbursement for physical activities and outdoor adventures, as well as psychological, psychiatric and nutrition consultations 🏃🏽
Senior Site Reliability Engineer at Diligent leading reliability, automation, and observability across cloud infrastructure. Build tools for incident response and enhance performance in fast - paced environments.
Perception Deployment Engineer deploying deep learning models on embedded systems at Caterpillar. Collaborating with cross - functional teams for integration and optimization of perception modules in vehicles.
Principal Site Reliability Engineer at AT&T required to design scalable solutions for critical operations with minimal downtime. Collaborating with teams to monitor and improve system performance in cloud environments.
DevOps Engineer managing AI SaaS infrastructure at a high - growth European company. Supporting AI model deployment and ensuring platform security and compliance with multiple systems integration.
Engineering Manager leading teams for observability platforms at LexisNexis. Owns operational excellence across software delivery lifecycle in Raleigh, NC.
Reliability Engineer optimizing site facility infrastructure and utility systems at Roche. Conducting root cause analyses and developing maintenance plans to enhance reliability and efficiency.
DevOps SME designing, implementing, and operating multi - cloud platforms for The Missing Link. Collaborating with engineering, security, and operations teams while embedding DevOps best practices.
Site Reliability Engineer improving reliability of cloud infrastructure for an AI - specialized company. Taking ownership of monitoring and incident response processes in hybrid - working style.
DevOps Engineer leading automation for sophisticated release/deployment pipelines at Securonix. Focused on Python, Ansible, and cloud services to enhance security operations.
Senior Analyst on Data Platform DevOps at AIMCo, responsible for building data operations and collaborating with teams on innovative solutions. Focused on ensuring data quality and integrity across technologies.