Cloud Operations Engineer designing and implementing highly reliable cloud solutions. Leading cloud infrastructure initiatives for production operations and customer success in a growing team.
Responsibilities
Provide technical leadership in cloud architecture, operational excellence, reliability, and cost optimization across large-scale production environments.
Stay current with industry trends and best practices, and leverage AI technologies and cloud service provider platforms (AWS, Google Cloud, and Azure) to improve operational efficiency, scalability, security, and resiliency.
Design and ensure secure, reliable, and high-performance communication across multiple regions and cloud service providers.
Configure, tune, and operate middleware services, including SQL and NoSQL databases, messaging and streaming platforms, and related infrastructure components.
Evaluate, recommend, and lead the adoption of CloudOps and DevOps tools, platforms, and automation solutions.
Troubleshoot complex production infrastructure and application issues, providing deep technical expertise and hands-on support when required.
Drive root cause analysis (RCA), implement corrective actions, and establish preventive measures to avoid recurrence.
Collaborate closely with engineering cloud architects in system design discussions, architecture reviews, and whiteboard sessions.
Partner with Development, QA, SRE, and external service providers or carriers to resolve issues and improve system reliability.
Design, implement, and evolve deployment automation platforms for Kubernetes-based microservices.
Improve service availability, performance, and scalability through automation, tooling, capacity planning, and process improvements.
Analyze system and service performance, identify bottlenecks, and deliver actionable recommendations to improve efficiency and resilience.
8+ years of experience in a CloudOps / DevOps role.
Hands on experience with AWS or any public cloud (Azure, GCP etc.).
Knowledge of Linux, security and networking fundamentals.
Working knowledge of container-based architecture and deployment (Docker, Kubernetes.)
Working knowledge of deployment automation development (Terraform, Helm, ArgoCD).
Experience in diagnosing and resolving complex application problems.
Working knowledge of Elasticsearch, PostgreSQL, Redis, Ignite, Flink, Kafka, and RabbitMQ.
Experience with monitoring tools (Nagios, Grafana, Prometheus)
Experience with cloud security and compliance implementation is a plus.
Strong follow-through and initiative to stay with issues until they are resolved.
Comfortable working within a distributed team located in multiple time zones.
Benefits
Inclusion is one of our core values and in our DNA. We are committed to fostering an inclusive workplace that embraces our differences and creates an atmosphere where all our employees thrive because of their differences, not in spite of them.
DevOps Engineer working closely with engineering and security teams to optimize CI/CD pipelines and manage infrastructure. Ensuring security and compliance for mission - critical financial applications.
Build and scale cloud infrastructure that powers Heidi's healthcare AI platform. Work with AWS and Azure while enhancing automation and reliability in an innovative healthtech startup.
Infrastructure - as - Code DevOps Engineer designing and managing cloud - native platforms at Vodafone. Collaborating with agile teams for digital transformation and business success.
Director of Data Engineering leading a strategic DevOps team within Enterprise AI. Balancing leadership with hands - on expertise to enable AI technology adoption.
Join a Data Engineering Team as a Senior DevOps to support multiple Data & AI initiatives. Utilize cloud technologies and enhance data pipelines in a collaborative environment.
Principal Site Reliability Engineer at Early Warning designing performance and resiliency patterns for applications and infrastructure. Collaborating with development teams to improve systems and data integrity.
DevOps Engineer contributing to CI/CD setup and Azure services management. Collaborates with teams to ensure efficient project delivery in a hybrid environment.
IT DevOps Specialist at BMW responsible for analyzing requirements and implementing software solutions in AWS cloud environments. Collaborating internationally within agile teams for digital transformation projects.
DevOps Engineer at Vistra designing, implementing, and maintaining robust CI/CD pipelines and cloud infrastructure. Enabling software delivery across multiple technology stacks with a focus on AWS.
Manage complex customer rollouts and initial system deployments at Talex.ai. Bridging technical development with real - world application in robotics and AI systems.