NOC Engineer responsible for monitoring and troubleshooting cloud infrastructure. Join the Technology Operations Center team at Operative managing performance and service availability.
Responsibilities
Monitor cloud environments (AWS, GCP etc.) for resource performance, availability, and security
Detect and report network and system anomalies in cloud environments such as downtime, latency issues, or performance degradation
Escalate critical issues related to cloud services, resources, and applications to senior technical teams for prompt resolution
Assist in monitoring and managing cloud resources (servers, virtual machines, storage, databases, etc.), ensuring proper allocation and optimization
Review and analyse logs from cloud services and platforms (e.g., CloudWatch, ELK) to identify patterns or issues that need resolution
Perform regular health checks on cloud infrastructure, services, and applications to ensure uptime and prevent issues
Setup monitoring and perform automation tasks, such as auto-scaling, load balancing, and resource provisioning in the cloud
Maintain and update records of cloud infrastructure status, incidents, troubleshooting steps, and resolutions
Provide status updates to internal stakeholders or customers regarding cloud-related incidents or maintenance schedules
Work with senior cloud engineers and IT teams to resolve cloud infrastructure issues and optimize performance
Requirements
Bachelor’s degree in computer science, Information Technology, Cloud Computing, or related field (or equivalent)
Minimum 1 to 2 Years.
Must have skills: Monitoring & Observability Tools knowledge (Grafana, New Relic, Zabbix, ELK, AWS CloudWatch etc.)
Familiarity with Cloud platforms (AWS, GCP etc.) and ability to monitor, manage, and troubleshoot cloud infrastructure and services.
Working knowledge of AWS CloudWatch including creating monitors, setting up alerts, and analysing logs to detect and troubleshoot infrastructure issues.
Familiarity with Networking concepts (TCP/IP, DNS, DHCP, etc.) and cloud networking configurations.
Understanding of virtual machines, cloud storage, and cloud databases.
Must have Python/Shell scripting knowledge. (Atleast working knowledge is desirable).
Good knowledge & understanding of Operating Systems (Linux, Windows).
Advanced Network Engineer specializing in layer 2 and 3 connectivity products for Superloop. Responsible for troubleshooting and resolving escalated technical issues raised by customers.
Network Operations Lead overseeing the 24x7 operations of CBP's Network Operations Center. Ensuring network availability, performance, and reliability across geographically dispersed sites.
Network Operations Engineer providing level 3 support for customers at Megaport, a leading Network as a Service company. Focused on scalability, automation, and maintaining robust network operations.
Network Operations Lead at Severn Trent Water driving delivery of environmental metrics with significant budget responsibilities. Leading team performance while ensuring health and safety standards.
Network Operations Center Technician supporting the Air National Guard operations in Des Moines, IA. Joining an IT team for Distributed Mission Operations events and cybersecurity aspects.
NOC Technician at Nokia managing all network alarms for customer networks. Responsible for alarm validation, ticket creation, and technical escalation.
NOC engineer responsible for server support, incident response, and system maintenance in a hybrid environment. Joining a talented international team to ensure optimal system performance.
Network Control Global Operations Leader at Hitachi Energy ensuring engineering excellence across regions. Leading operations and optimizing engineering standards for global project delivery.
Network Operations Engineer responsible for implementing and managing secure network policies at Liebherr CMCtec India. Overseeing network access control, VPN, and wireless networks in Pune.