Operations Center Manager leading the command center team at IT company ensuring high availability of critical IT infrastructure. Requires extensive experience in process-driven IT operations.
Responsibilities
Manage the daily activities of the Operations Center (NOC), ensuring 24/7 coverage and rapid response to alerts.
Lead, mentor, and train a team of System Administrators and L1/L2 Support Engineers.
Manage shift schedules, handovers, and on-call rotations to ensure zero coverage gaps.
Oversee the health of the entire IT estate: Servers (Windows/Linux), Networks (LAN/WAN), Cloud (AWS/Azure), and Virtualization (VMware/Hyper-V).
Administer and tune monitoring platforms (e.g., SolarWinds, Nagios, Datadog, Zabbix, Logic Monitor, Elastic, Splunk etc).
Refine alert thresholds to reduce "alert fatigue" and ensure the team focuses on actionable signals.
Design and maintain real-time dashboards for leadership, visualizing uptime, latency, and system health.
Ensure patching schedules are executed on time and compliant with security policies.
Requirements
12+ years of experience in IT Operations, Infrastructure Support, or NOC environments.
2+ years of experience in a leadership or team lead role.
Deep understanding of the ITIL Framework (Certification is highly preferred).
Hands-on experience with Monitoring Tools: Proficiency in configuring and managing tools like Logic Monitor, Elastic, SolarWinds, PRTG, Nagios, Datadog, or New Relic.
Solid technical background in Server Administration (Windows/Linux) and basic Networking concepts (DNS, TCP/IP, Firewalls).
Crisis Management: Ability to stay calm and decisive during high-pressure outages.
Communication: capable of translating complex technical issues into clear business updates for executives.
Analytical Thinking: A data-driven approach to identifying trends and inefficiencies.
ITIL v3 or v4 Foundation/Intermediate Certification.
Experience with ITSM tools like ServiceNow, Jira Service Management, or BMC Remedy etc.
Basic scripting skills (PowerShell, Bash, or Python) for automation.
Experience in a Hybrid Cloud environment (On-prem + Azure/AWS).
INOC Technician III responsible for monitoring and triaging incidents in Xcel Energy's core network operations. Collaborating with teams to ensure efficient incident management and operational support.
Advanced Network Engineer specializing in layer 2 and 3 connectivity products for Superloop. Responsible for troubleshooting and resolving escalated technical issues raised by customers.
Network Operations Lead overseeing the 24x7 operations of CBP's Network Operations Center. Ensuring network availability, performance, and reliability across geographically dispersed sites.
Network Operations Engineer providing level 3 support for customers at Megaport, a leading Network as a Service company. Focused on scalability, automation, and maintaining robust network operations.
Network Operations Lead at Severn Trent Water driving delivery of environmental metrics with significant budget responsibilities. Leading team performance while ensuring health and safety standards.
Network Operations Center Technician supporting the Air National Guard operations in Des Moines, IA. Joining an IT team for Distributed Mission Operations events and cybersecurity aspects.
NOC Technician at Nokia managing all network alarms for customer networks. Responsible for alarm validation, ticket creation, and technical escalation.
NOC engineer responsible for server support, incident response, and system maintenance in a hybrid environment. Joining a talented international team to ensure optimal system performance.
Network Control Global Operations Leader at Hitachi Energy ensuring engineering excellence across regions. Leading operations and optimizing engineering standards for global project delivery.