NOC Engineer responsible for monitoring and troubleshooting cloud infrastructure. Join the Technology Operations Center team at Operative managing performance and service availability.
Responsibilities
Monitor cloud environments (AWS, GCP etc.) for resource performance, availability, and security
Detect and report network and system anomalies in cloud environments such as downtime, latency issues, or performance degradation
Escalate critical issues related to cloud services, resources, and applications to senior technical teams for prompt resolution
Assist in monitoring and managing cloud resources (servers, virtual machines, storage, databases, etc.), ensuring proper allocation and optimization
Review and analyse logs from cloud services and platforms (e.g., CloudWatch, ELK) to identify patterns or issues that need resolution
Perform regular health checks on cloud infrastructure, services, and applications to ensure uptime and prevent issues
Setup monitoring and perform automation tasks, such as auto-scaling, load balancing, and resource provisioning in the cloud
Maintain and update records of cloud infrastructure status, incidents, troubleshooting steps, and resolutions
Provide status updates to internal stakeholders or customers regarding cloud-related incidents or maintenance schedules
Work with senior cloud engineers and IT teams to resolve cloud infrastructure issues and optimize performance
Requirements
Bachelor’s degree in computer science, Information Technology, Cloud Computing, or related field (or equivalent)
Minimum 1 to 2 Years.
Must have skills: Monitoring & Observability Tools knowledge (Grafana, New Relic, Zabbix, ELK, AWS CloudWatch etc.)
Familiarity with Cloud platforms (AWS, GCP etc.) and ability to monitor, manage, and troubleshoot cloud infrastructure and services.
Working knowledge of AWS CloudWatch including creating monitors, setting up alerts, and analysing logs to detect and troubleshoot infrastructure issues.
Familiarity with Networking concepts (TCP/IP, DNS, DHCP, etc.) and cloud networking configurations.
Understanding of virtual machines, cloud storage, and cloud databases.
Must have Python/Shell scripting knowledge. (Atleast working knowledge is desirable).
Good knowledge & understanding of Operating Systems (Linux, Windows).
Network Operations Engineer responsible for monitoring network health and resolving incidents at Atos. Collaborating with teams using tools to manage network performance.
Operations Center Manager leading the command center team at IT company ensuring high availability of critical IT infrastructure. Requires extensive experience in process - driven IT operations.
Tier II NOC Engineers providing technical support and troubleshooting for customers' network concerns at Comcast. Handling escalations, configurations, and ensuring network performance.
Senior Director managing operational compliance and governance in network policy operations. Collaborating with various teams to ensure controlled, auditable management processes and leadership reporting.
Network Operations Specialist responsible for monitoring and maintaining Claro's Telecommunications network. Implementing work orders and ensuring continuity and quality of service across the network.
NOC Manager overseeing 24x7 operations of Network Operations Center for Managed Services clients. Leading NOC team and ensuring system stability, availability, and performance.
Network Operations Subject Matter Expert supporting NASA's Network and Telecommunications organizations with focus on Cybersecurity improvements. Utilize deep technical expertise in network engineering and problem solving skills.
NOC Engineer I maintaining IT network infrastructure for Dexcom. Handling incidents escalated from NOC Analysts and participating in process improvements.
Network Operations - Tools Engineer designing and integrating monitoring solutions for enterprise applications. Collaborating with IT teams to ensure optimal system performance and readiness.