CloudOps SRE ensuring reliability and efficiency of Cloud Services hosted in AWS and Azure. Collaborating closely with development, operations, and infrastructure teams for cloud infrastructure management.
Responsibilities
We are seeking a talented CloudOps SRE to join our team and help ensure the reliability, scalability, and efficiency of our Cloud Services. You will work closely with our development, operations, and infrastructure teams to design, implement, and maintain our Cloud infrastructure hosted within AWS and Azure. You will be responsible formonitoring the health and performance of the Cloud Infrastructure, Customer environments, optimizing resource utilization, and implementing automation to streamline operations. This is an exciting opportunity to be part of a dynamic team and contribute to the success of our cloud-native applications.
Strong experience in deploying, and managing Azure or AWS infrastructure in a Production environment
Ability to work independently and as a team. Multi-task to a high degree
In-depth knowledge of AWS, Azure Cloud Infrastructure administration
Experience with infrastructure-as-code tools such as Terraform
Experience with monitoring and logging tools Prometheus, Grafana, and Graylog
Solid understanding of networking concepts and protocols (TCP/IP)
Strong scripting and automation skills (e.g., Bash, Python)
Experience with CI/CD tools like Jenkins or Azure Pipelines
Excellent problem-solving and troubleshooting skills
Strong communication and collaboration skills to work effectively in cross-functional teams
Defining and implementing Service Level Indicators and Service Level Objectives
Building strong observability practices within end customers platforms
Create dashboards and configuring alerts to provide real-time visibility into system health
Experience in designing, analysing, and troubleshooting distributed systems
Knowledge of Linux/Unix fundamentals and TCP/IP networking
Excellent communication skills when dealing with both technical and non-technical stakeholders
**Preferred Qualifications/Skills:**
AWS/Azure certified
Experience with configuration management tools like Ansible and Chef
Benefits
Professional growth and Development opportunities.
Working within a team of friendly, skilled people where help is always within reach
Flexible working hours
4 recharge days, where the entire company goes on a brief pause in all geographies for 1 day each quarter. This day can be spent in whatever way helps you recharge, to regain energy, and dive back into the next workday
High-end laptop (Dell or Mac)
Competitive pay and bonus
18 vacation days in a year in addition to 15 days Sick Leave/ Casual leave per calendar year.
16 hours of paid volunteer time off per year
26 weeks of paid maternity leave and one week of paid paternity leave.
Health Insurance of up to 7 lacs for self, spouse, 4 dependent children, and parents. 100% of the premium is paid by Vendavo and it covers the employee, spouse, children, and their parents.
Group Term Insurance coverage up to three times of their Annual CTC . Dependents are not covered.
Group Personal Accident coverage up to three times of Annual CTC. Dependents are not covered.
DevOps Engineer working closely with engineering and security teams to optimize CI/CD pipelines and manage infrastructure. Ensuring security and compliance for mission - critical financial applications.
Build and scale cloud infrastructure that powers Heidi's healthcare AI platform. Work with AWS and Azure while enhancing automation and reliability in an innovative healthtech startup.
Infrastructure - as - Code DevOps Engineer designing and managing cloud - native platforms at Vodafone. Collaborating with agile teams for digital transformation and business success.
Director of Data Engineering leading a strategic DevOps team within Enterprise AI. Balancing leadership with hands - on expertise to enable AI technology adoption.
Join a Data Engineering Team as a Senior DevOps to support multiple Data & AI initiatives. Utilize cloud technologies and enhance data pipelines in a collaborative environment.
Principal Site Reliability Engineer at Early Warning designing performance and resiliency patterns for applications and infrastructure. Collaborating with development teams to improve systems and data integrity.
DevOps Engineer contributing to CI/CD setup and Azure services management. Collaborates with teams to ensure efficient project delivery in a hybrid environment.
IT DevOps Specialist at BMW responsible for analyzing requirements and implementing software solutions in AWS cloud environments. Collaborating internationally within agile teams for digital transformation projects.
DevOps Engineer at Vistra designing, implementing, and maintaining robust CI/CD pipelines and cloud infrastructure. Enabling software delivery across multiple technology stacks with a focus on AWS.
Manage complex customer rollouts and initial system deployments at Talex.ai. Bridging technical development with real - world application in robotics and AI systems.