CloudOps SRE ensuring reliability and efficiency of Cloud Services hosted in AWS and Azure. Collaborating closely with development, operations, and infrastructure teams for cloud infrastructure management.
Responsibilities
We are seeking a talented CloudOps SRE to join our team and help ensure the reliability, scalability, and efficiency of our Cloud Services. You will work closely with our development, operations, and infrastructure teams to design, implement, and maintain our Cloud infrastructure hosted within AWS and Azure. You will be responsible formonitoring the health and performance of the Cloud Infrastructure, Customer environments, optimizing resource utilization, and implementing automation to streamline operations. This is an exciting opportunity to be part of a dynamic team and contribute to the success of our cloud-native applications.
Strong experience in deploying, and managing Azure or AWS infrastructure in a Production environment
Ability to work independently and as a team. Multi-task to a high degree
In-depth knowledge of AWS, Azure Cloud Infrastructure administration
Experience with infrastructure-as-code tools such as Terraform
Experience with monitoring and logging tools Prometheus, Grafana, and Graylog
Solid understanding of networking concepts and protocols (TCP/IP)
Strong scripting and automation skills (e.g., Bash, Python)
Experience with CI/CD tools like Jenkins or Azure Pipelines
Excellent problem-solving and troubleshooting skills
Strong communication and collaboration skills to work effectively in cross-functional teams
Defining and implementing Service Level Indicators and Service Level Objectives
Building strong observability practices within end customers platforms
Create dashboards and configuring alerts to provide real-time visibility into system health
Experience in designing, analysing, and troubleshooting distributed systems
Knowledge of Linux/Unix fundamentals and TCP/IP networking
Excellent communication skills when dealing with both technical and non-technical stakeholders
**Preferred Qualifications/Skills:**
AWS/Azure certified
Experience with configuration management tools like Ansible and Chef
Benefits
Professional growth and Development opportunities.
Working within a team of friendly, skilled people where help is always within reach
Flexible working hours
4 recharge days, where the entire company goes on a brief pause in all geographies for 1 day each quarter. This day can be spent in whatever way helps you recharge, to regain energy, and dive back into the next workday
High-end laptop (Dell or Mac)
Competitive pay and bonus
18 vacation days in a year in addition to 15 days Sick Leave/ Casual leave per calendar year.
16 hours of paid volunteer time off per year
26 weeks of paid maternity leave and one week of paid paternity leave.
Health Insurance of up to 7 lacs for self, spouse, 4 dependent children, and parents. 100% of the premium is paid by Vendavo and it covers the employee, spouse, children, and their parents.
Group Term Insurance coverage up to three times of their Annual CTC . Dependents are not covered.
Group Personal Accident coverage up to three times of Annual CTC. Dependents are not covered.
Cloud Engineer specializing in hybrid - cloud platform design and operation at Dun & Bradstreet. Collaborating closely with team members to enhance developer self - service and automation capabilities.
DevOps Engineer II evolving cloud infrastructure and CI/CD pipelines at HackerRank. Collaborating with teams to design, build, and optimize systems for developer productivity.
DevOps Engineer managing CI/CD pipelines and cloud infrastructure for mobile apps at Air Apps. Collaborating with teams to ensure app performance and reliability.
DevOps Engineer at Vodafone Romania delivering resilient infrastructure for software development lifecycle. Collaborating with Digital Squads and optimizing CI/CD pipelines for efficient deployments.
Mechanical/Reliability Engineer responsible for mechanical installations in Bergen op Zoom. Analyzing maintenance strategies and leading projects to enhance reliability.
Senior DevOps Engineer responsible for cloud infrastructure and deployments. Optimizing AWS services and ensuring system security and reliability for Verizon.
Senior DevOps Engineer responsible for automating infrastructure and building CI/CD pipelines for collaborative robotics company. Collaborating with global engineering teams from the Bangalore office.
Site Reliability Engineer Intern at Tencent working on gaming services and cloud native solutions. Collaborating with global teams to eliminate toil and enhance reliability.
Cloud/DevOps Specialist at N5X managing and optimizing critical cloud infrastructures for Brazilian energy trading. Collaborating with a multidisciplinary team to ensure high availability and performance.
Cloud/Devops Specialist responsible for designing a hybrid architecture combining cloud and on - premises infrastructure for energy trading systems. Collaborating with a multidisciplinary team in a dynamic environment.