Site Reliability Engineer at Equifax ensuring reliability and performance of distributed fault-tolerant systems. Collaborating with teams to build cost-effective systems with high uptime metrics.
Responsibilities
Work in a DevSecOps environment responsible for the building and running of large-scale, massively distributed, fault-tolerant systems.
Work closely with development and operations teams to build highly available, cost effective systems with extremely high uptime metrics.
Work with cloud operations team to resolve trouble tickets, develop and run scripts, and troubleshoot
Create new tools and scripts designed for auto-remediation of incidents and establishing end-to-end monitoring and alerting on all critical aspects
Build infrastructure as code (IAC) patterns that meets security and engineering standards using one or more technologies (Terraform, scripting with cloud CLI, and programming with cloud SDK).
Participate in a team of first responders in a 24/7, follow the sun operating model for incident and problem management.
Requirements
BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent job experience required
2-5 years of experience in software engineering, systems administration, database administration, and networking.
1+ years of experience developing and/or administering software in public cloud
Experience in monitoring infrastructure and application uptime and availability to ensure functional and performance objectives.
Experience in languages such as Python, Bash, Java, Go JavaScript and/or node.js
Demonstrable cross-functional knowledge with systems, storage, networking, security and databases
System administration skills, including automation and orchestration of Linux/Windows using Terraform, Chef, Ansible and/or containers (Docker, Kubernetes, etc.)
Proficiency with continuous integration and continuous delivery tooling and practices
Cloud Certification Strongly Preferred
Benefits
Hybrid work setting
Comprehensive compensation and healthcare packages
Attractive paid time off
Organizational growth potential through online learning platform with guided career tracks
DevOps Engineer managing Kubernetes deployments for health tech company. Collaborating with engineering teams to enhance healthcare services using advanced technologies.
DevOps Engineer at PointClickCare, empowering innovative healthcare with Kubernetes and automation expertise. Work remotely while supporting crucial healthcare technology solutions.
Entry Level DevOps Engineer at Podimo, building scalable cloud infrastructure for a podcast platform. Collaborate with development teams and leverage AI tools to enhance the platform.
DevOps Engineer managing AWS infrastructure while contributing to backend code in Node.js and Python. Join Auterion building AI - powered software for autonomous systems.
Cloud DevOps Engineer managing Azure infrastructure at Medical Guardian. Overseeing technical operations and security response in a hybrid work environment.
SRE Linux/Unix System Administrator at Broadridge with strong Unix/Linux Bourne/Bash Scripting skills. Collaborating in a hybrid, fast - paced environment to manage critical systems.
Senior Site Reliability Engineer at Rootly embedding with teams to enhance service performance and reliability. Own CI/CD pipelines and drive capacity planning efforts in a fast - paced environment.
DevOps Engineer improving CI/CD pipelines and best practices for Datatonic's AI and data projects. Collaborate with clients to enhance infrastructure and drive innovation in tech.
Senior/Principal DevOps Engineer developing robust CI/CD pipelines for ClubWPT Gold at a hypergrowth startup. Collaborate globally to revolutionize online gaming experiences while maintaining high technical standards.
DevOps Engineer responsible for the health, performance, and automation of gaming platform services. Focused on CI/CD pipelines, infrastructure services, and application monitoring.