Cloud/DevOps Specialist at N5X managing and optimizing critical cloud infrastructures for Brazilian energy trading. Collaborating with a multidisciplinary team to ensure high availability and performance.
Responsibilities
Design and provision cloud infrastructure on AWS or Azure using Terraform and CDK to ensure reproducible and consistent environments between development and production
Create and optimize CI/CD pipelines using GitHub Actions for Java applications and microservices, aiming for full automation of the deployment cycle
Manage and scale applications using AWS Fargate or EKS to ensure high availability and performance in pre-trade systems
Implement proactive monitoring with logging and tracing to anticipate incidents and ensure complete observability of the business
Design high-availability and disaster recovery strategies focused on resilience and operational continuity
Define security controls in IAM and VPC applying the principle of least privilege and ensuring compliance by design
Monitor and optimize cloud costs through FinOps practices to avoid waste and ensure financial efficiency
Evolve backup routines and periodic restore validation to ensure data integrity and availability
Promote a DevOps culture within the team to reduce manual work and support developers in resolving infrastructure issues
Requirements
Degree in Computer Science, Information Systems, Systems Analysis and Development (ADS), or related technology fields
Proven ability to design and operate cloud infrastructure on AWS and/or Azure, focusing on high availability, resilience, and security
Hands-on experience with Infrastructure as Code (Terraform, CDK, or similar), ensuring environment reproducibility and governance
Proficiency with CI/CD pipelines (e.g., GitHub Actions), implementing automated deployments, versioning, and safe rollbacks
Experience with containers (Docker) and orchestration (ECS/Fargate, EKS, or Kubernetes) in production environments
Strong knowledge of networking (TCP/IP, DNS, HTTP/S), VPCs, subnets, load balancers, and secure connectivity
Experience with observability (monitoring, logging, tracing) applied to distributed systems
Practical experience in cloud cost management (FinOps), including setting budgets, alerts, and optimizing consumption
Experience with scripting (Python, Bash, or Go) for automating operational tasks
Ability to operate systems with low-latency requirements, treating degradations as critical events
Fluency in Portuguese and English.
Benefits
Bonus: Up to two monthly salaries per year
Work model: Hybrid — 3 days in the office per week; participation in in-person team rituals (currently quarterly), in-person meetings with stakeholders, and events
Meal voucher: R$ 43.68 per business day
Food allowance: R$ 832.00 per month
Transportation: Round-trip to the office on in-person days with no payroll deduction
Health plan: SulAmérica with co-participation for the employee and their dependents — children and spouse
Life insurance: MetLife
Childcare assistance: Reimbursement of up to 40% of the base salary for children up to 24 months, and 35% for children between 24 and 71 months
Financial assistance for employees with children with disabilities: Amount equivalent to 50% of the base salary
Cloud/Devops Specialist responsible for designing a hybrid architecture combining cloud and on - premises infrastructure for energy trading systems. Collaborating with a multidisciplinary team in a dynamic environment.
Reliability Engineering Specialist utilizing reliability tools and models to improve asset performance at Enbridge. Collaborating across teams to guide investment decisions for safe operations.
DevOps Engineer responsible for structuring and supporting cloud DevOps architecture in Brazil. Working strategically on automation and CI/CD practices with development teams in Pernambuco.
DevSecOps Software Engineer developing secure CI/CD pipelines for Boeing's military software systems. Collaborate with cross - functional teams and implement automation and security best practices.
DevOps Manager responsible for managing a team for multi - cloud solutions supporting the USAF Cloud One project. Focus on scalable cloud - native solutions and CI/CD practices.
Lead Site Reliability Engineer overseeing SRE practices across Azure and GCP platforms. Driving reliability improvements and leading a team at Lloyds Banking Group.
DevOps Engineer responsible for managing Microsoft Intune operations at Bundesdruckerei GmbH. Focused on ensuring secure digital solutions for identity and data protection in Berlin.
Senior Site Reliability Engineer driving observability and reliability for business - critical systems at Incedo. Collaborating with engineering teams to enhance system resilience and performance.
DevSecOps Specialist securing the software development lifecycle at Vanguard. Collaborating with teams to improve application security tooling and processes, and provide development guidance.
Site Reliability Engineer automating infrastructure deployment for Scaleway's sovereign cloud products. Collaborating with product teams to enhance observability and reliability of the platform.