Senior Data & Site Reliability Engineer at Stefanini ensuring the reliability and operation of data platforms and analytical services.
Responsibilities
El Data & Site Reliability Engineer Senior es responsable de garantizar la confiabilidad, estabilidad y operación continua de las plataformas de datos y servicios analíticos de la organización.
Este rol combina las mejores prácticas de Site Reliability Engineering (SRE) y Data Reliability Engineering (DRE), enfocándose en la prevención de incidentes, automatización de procesos, reducción del tiempo de recuperación ante fallos (MTTR) y mejora de la experiencia operativa de extremo a extremo.
Lidera la definición y gobierno de indicadores de servicio (SLIs/SLOs) como frescura, completitud, latencia, confiabilidad y disponibilidad, impulsando la evolución hacia modelos operativos IOps y NoOps.
Requirements
Mínimo 2 años o más de experiencia en roles de SRE, DRE, DevOps o ingeniería de plataformas de datos en ambientes productivos.
Experiencia comprobable liderando incidentes críticos y proyectos de automatización en entornos de datos.
2+ años de experiencia en roles SRE, DRE, DataOps o Platform Engineering
Dominio de Apache Airflow: gestión de DAGs, depuración, optimización de pipelines
Experiencia con dbt (data build tool): modelos, pruebas, linaje de datos
Conocimiento de Amazon Redshift: administración, optimización de consultas, WLM
Manejo de Grafana + Prometheus: dashboards, alertas, PromQL
Experiencia con OpsGenie o herramienta equivalente de gestión de alertas
Conocimiento de AWS Glue, Lambda, CloudWatch
Familiaridad con metodologías SRE: error budgets, SLOs, SLIs, SLAs
Experiencia con Jira Service Management o herramienta ITSM equivalente.
Graduate Reliability Engineer at GKN Aerospace enhancing operational excellence through data analysis and project participation within large structural assemblies.
Site Reliability Engineer at WRITER, ensuring 24/7 availability and performance of AI - powered workflows. Collaborating on scalable infrastructure solutions while impacting enterprise customer trust.
Engineer at Trading Technologies improving platform stability through coding and automation. Focus on building advanced monitoring tools for global trading operations.
Senior ML Ops/DevOps developing MLOps platform components at Capco Poland for financial digital transformation. Responsibilities include CI/CD, model deployment, monitoring, and team collaboration.
Senior DevOps Engineer at Verisk, focusing on AWS infrastructure and CI/CD pipeline automation. Ensuring high availability and security through collaboration with development and QA teams.
Senior DevOps & Infrastructure Engineer at IMAGO focusing on automation and infrastructure improvements. Building reliable infrastructure and leading CI/CD optimization in a dynamic environment.
DevOps Specialist creating and overseeing Azure hybrid cloud infrastructures for EVLO's battery energy storage solutions. Collaborating with teams to implement cutting - edge technologies in a dynamic environment.
Software Quality and Release Engineer developing and maintaining C++/Python software solutions for aerospace and defense industry. Collaborating on CI/CD automation and feedback documentation.
Senior DevOps Engineer building and managing big data platforms for clients in telecommunications and finance industries. Ensuring stability, scalability, and performance across cloud and on - premise environments.
Site Reliability / DevOps Engineer developing Big Data platforms for clients in Telco and Retail industries. Focus on stability, scalability, and performance of large - scale data processing systems.