About the role

CloudOps SRE ensuring reliability and efficiency of Cloud Services hosted in AWS and Azure. Collaborating closely with development, operations, and infrastructure teams for cloud infrastructure management.

Responsibilities

We are seeking a talented CloudOps SRE to join our team and help ensure the reliability, scalability, and efficiency of our Cloud Services. You will work closely with our development, operations, and infrastructure teams to design, implement, and maintain our Cloud infrastructure hosted within AWS and Azure. You will be responsible formonitoring the health and performance of the Cloud Infrastructure, Customer environments, optimizing resource utilization, and implementing automation to streamline operations. This is an exciting opportunity to be part of a dynamic team and contribute to the success of our cloud-native applications.

5+ years’ experience managing Azure/AWS IaaS, SaaS
Strong experience in deploying, and managing Azure or AWS infrastructure in a Production environment
Ability to work independently and as a team. Multi-task to a high degree
In-depth knowledge of AWS, Azure Cloud Infrastructure administration
Experience with infrastructure-as-code tools such as Terraform
Experience with monitoring and logging tools Prometheus, Grafana, and Graylog
Solid understanding of networking concepts and protocols (TCP/IP)
Strong scripting and automation skills (e.g., Bash, Python)
Experience with CI/CD tools like Jenkins or Azure Pipelines
Excellent problem-solving and troubleshooting skills
Strong communication and collaboration skills to work effectively in cross-functional teams
Defining and implementing Service Level Indicators and Service Level Objectives
Building strong observability practices within end customers platforms
Create dashboards and configuring alerts to provide real-time visibility into system health
Experience in designing, analysing, and troubleshooting distributed systems
Knowledge of Linux/Unix fundamentals and TCP/IP networking
Excellent communication skills when dealing with both technical and non-technical stakeholders
**Preferred Qualifications/Skills:**
AWS/Azure certified
Experience with configuration management tools like Ansible and Chef

Professional growth and Development opportunities.
Working within a team of friendly, skilled people where help is always within reach
Flexible working hours
4 recharge days, where the entire company goes on a brief pause in all geographies for 1 day each quarter. This day can be spent in whatever way helps you recharge, to regain energy, and dive back into the next workday
High-end laptop (Dell or Mac)
Competitive pay and bonus
18 vacation days in a year in addition to 15 days Sick Leave/ Casual leave per calendar year.
16 hours of paid volunteer time off per year
26 weeks of paid maternity leave and one week of paid paternity leave.
Health Insurance of up to 7 lacs for self, spouse, 4 dependent children, and parents. 100% of the premium is paid by Vendavo and it covers the employee, spouse, children, and their parents.
Group Term Insurance coverage up to three times of their Annual CTC . Dependents are not covered.
Group Personal Accident coverage up to three times of Annual CTC. Dependents are not covered.
Provident fund contributions