Monitor cloud environments (AWS, GCP etc.) for resource performance, availability, and security
Detect and report network and system anomalies in cloud environments such as downtime, latency issues, or performance degradation
Escalate critical issues related to cloud services, resources, and applications to senior technical teams for prompt resolution
Assist in monitoring and managing cloud resources (servers, virtual machines, storage, databases, etc.), ensuring proper allocation and optimization
Review and analyse logs from cloud services and platforms (e.g., CloudWatch, ELK) to identify patterns or issues that need resolution
Perform regular health checks on cloud infrastructure, services, and applications to ensure uptime and prevent issues
Setup monitoring and perform automation tasks, such as auto-scaling, load balancing, and resource provisioning in the cloud
Maintain and update records of cloud infrastructure status, incidents, troubleshooting steps, and resolutions
Provide status updates to internal stakeholders or customers regarding cloud-related incidents or maintenance schedules
Work with senior cloud engineers and IT teams to resolve cloud infrastructure issues and optimize performance
Requirements
Bachelor’s degree in computer science, Information Technology, Cloud Computing, or related field (or equivalent)
Minimum 1 to 2 Years.
Must have skills: Monitoring & Observability Tools knowledge (Grafana, New Relic, Zabbix, ELK, AWS CloudWatch etc.)
Familiarity with Cloud platforms (AWS, GCP etc.) and ability to monitor, manage, and troubleshoot cloud infrastructure and services.
Working knowledge of AWS CloudWatch including creating monitors, setting up alerts, and analysing logs to detect and troubleshoot infrastructure issues.
Familiarity with Networking concepts (TCP/IP, DNS, DHCP, etc.) and cloud networking configurations.
Understanding of virtual machines, cloud storage, and cloud databases.
Must have Python/Shell scripting knowledge. (Atleast working knowledge is desirable).
Good knowledge & understanding of Operating Systems (Linux, Windows).
NOC - Techniker:in ensuring the operation of the internet network at TNG Stadtnetz GmbH. Collaborating with team members to handle incidents and optimize network performance.
NOC Analyst responsible for monitoring applications and ensuring service connectivity across offices. Requires technical skills to maintain optimal performance and resolve disruptions.
Designing and planning service tools for performance monitoring at BT, a leader in secure connectivity. Collaborating across teams to develop solutions for business requirements.
Network Operations Engineer managing implementation and support of network technologies across Data Centres. Collaborating with various vendors on operational network support.
Junior Engineer in Network Operations supporting Organon's global network continuity and assisting with network - related issues. Requires a Bachelor's degree and a desire to learn networking technologies.
NOC Technician providing first - line and second - line technical support for data protection services at Verinext. Collaborating with teams to resolve incidents and maintain service continuity.
Network Control Technician role in Control Operations at Affinity Water, providing technical support during water network incidents. Assisting in managing operations on a 24/7 basis.
Network Operations Manager for TIM ensuring high performance in fiber optic networks operations. Managing teams and improving operational metrics in a complex telecommunications environment.
NOC Shift Lead overseeing network engineers and technicians at CACI. Ensuring network uptime and operational efficiency in a 24/7 environment with strong leadership skills.