Join a tech company to manage Linux server infrastructure as a Sysadmin/SRE. Focus on automation, reliability, and technical support within a hybrid work setting.
Responsibilities
Configure, maintain, and scale Kubernetes clusters in on-premises environments, ensuring high availability and performance.
Manage and optimize the infrastructure of physical and virtual servers, emphasizing automation and environment reliability.
Automate repetitive provisioning, configuration, and monitoring tasks for servers and applications using tools such as Ansible, Terraform, Puppet, etc.
Implement and manage API Gateway solutions to control traffic and optimize communication between microservices and systems.
Create and maintain monitoring and alerting systems to provide real-time visibility into the health of infrastructure and services using tools such as Dynatrace, Datadog, Prometheus, Grafana, ELK Stack, or similar.
Provide real-time support for infrastructure issues and collaborate with development teams to diagnose and resolve incidents.
Propose and implement infrastructure improvements focused on automation, security, performance, and reduction of operational costs.
Create and maintain detailed technical documentation of procedures, processes, and configurations.
Requirements
Strong expertise in Kubernetes, including installation, configuration, maintenance, and cluster scalability. (Certified Kubernetes Administrator - CKA)
Expertise in Linux systems administration in on-premises environments, including installation, configuration, and maintenance of physical and virtual servers. (Red Hat Certified Engineer - RHCE)
Experience with infrastructure automation using tools such as Ansible, Terraform, Puppet, or similar.
Strong knowledge of API Gateways (e.g., Kong, Apigee), with experience configuring and managing API traffic in on-premises environments.
Software Engineer focused on mobile DevOps at T - Mobile, designing scalable software solutions for CI/CD environments. Collaborating with teams to deliver mobile applications with high reliability and performance.
Junior Dev Ops Engineer building and maintaining analytics platforms at Rabobank. Collaborating with experienced engineers using Azure, Cloud, Databricks, and Terraform.
Medior Java Developer responsible for Global Client Data System in cloud environment. Collaborating in international teams to enhance data services at Rabobank.
Senior DevOps Engineer designing and improving Zscaler - based services for secure internet access. Collaborating with global teams and working in a complex IT environment for Rabobank.
DevOps Engineer operating and improving Kafka infrastructure on private cloud for Telia. Collaborating on advanced messaging solutions and driving DevSecOps practices with open - source platforms.
DevOps Engineer at Helpshift responsible for GCP infrastructure and AI deployment pipelines. Ensuring production monitoring, security, and CI/CD excellence with a hybrid work model.
DevOps Engineer in a digital venture building a technology platform for B2B marketplace. Collaborating with teams to improve delivery speed, code quality, and automate processes.
Senior Cloud Site Reliability Engineer responsible for daily operations of Solace Cloud services across cloud platforms. Ensuring reliability and efficiency in a hybrid work environment.
Senior DevOps Engineer at Parser focusing on deploying and maintaining cloud - based products with AWS. Collaborating across technical teams and ensuring robust solutions for business needs.
Safety and Reliability Engineer focusing on safety assessments and reliability evaluations at Collins Aerospace. Lead analyses and ensure designs meet certification standards.