Network SRE engineer enhancing network operations, ensuring high availability and reliability while supporting user satisfaction. Collaborating on automation and operational improvements in a technical environment.
Responsibilities
Owning the operational aspect of the network infrastructure, ensuring its high availability and reliability.
Partnering with architecture and deployment teams to guarantee that new implementations are supportable and align with production standards.
Advocating for and implementing automation to reduce toil and enhance operational efficiency.
Monitoring network performance, identifying areas for improvement, and coordinating with relevant teams to execute enhancements.
Collaborating with SMEs to resolve production issues swiftly and effectively, maintaining customer satisfaction.
Identifying opportunities for operational improvements and partnering with teams to develop solutions that drive excellence and sustainability in network operations.
Requirements
BS degree in Computer Science, Electrical Engineering, or a related technical field, or equivalent experience.
Minimum of 8 years of industry experience in network site reliability engineering, network automation, network operations, or related areas.
Experience on both campus and data center networks.
Familiarity with network management tools such as Prometheus, Grafana, Alert Manager, Nautobot/Netbox, BigPanda.
Expertise in automating networks using frameworks such as Salt, Ansible, or similar.
In depth experience in one or more of the following: Python, Go.
Knowledge in network technologies such as TCP/UDP, IPv4/IPv6, Wireless, BGP, VPN, L2 switching, , Firewalls, Load Balancers, EVPN, VxLAN, Segment Routing.
Proven track record in network operations.
Skills with ServiceNow and Jira.
Knowledge of Linux system fundamentals is a plus.
Systematic problem-solving approach, coupled with excellent communication skills and a sense of ownership and drive.
DevOps and Build Engineer for NVIDIA developing and maintaining CI/CD pipelines. Collaborating with teams to enhance compiler technologies and optimize build performance in a diverse environment.
Senior AWS DevOps Developer responsible for managing AWS infrastructure for enterprise public budgeting software at Euna Solutions. Collaborating on cloud projects and enhancing system reliability and performance.
Principal AI Site Reliability Engineer driving operational excellence for critical contact center applications at Fidelity. Leading automation and observability initiatives to improve reliability and efficiency.
Data Transport Infrastructure DevOps Engineer at Leidos modernizing global - scale multi - cloud environments for USAF missions. Involves developing cloud - native solutions and ensuring security best practices.
DevOps Engineer responsible for building and optimizing AWS - based infrastructure and backend systems at Allguth GmbH. Part of a team focused on innovative mobility solutions in Munich region.
(Senior) DevOps Engineer specializing in ML solutions implementation and management in Germany. Focused on CI/CD pipelines, automation, and cloud services.
Specialist DevSecOps joining Periferia IT Group, a leader in digital transformation. Work in a dynamic environment with continuous learning and professional development opportunities.
Join Zinkworks as a Senior Platform Engineer designing scalable IaC - driven cloud platforms for a large - scale enterprise contact centre. Focused on automation, reliability, and platform ownership in a hybrid work environment.
Asset Reliability Engineer providing maintenance advice and service innovations. Join Sensorfact, the leading smart monitoring platform, to modernize the industrial sector.