Design, automate and manage a highly available and scalable cloud deployment that allows development teams to deploy and run their services.
Collaborating with engineering and Architects teams to evaluate and identify optimal cloud solutions, also leveraging scalability, high-performance and security.
Design and implement sustainable cloud and platform services.
Build a robust, scalable and stable infrastructure.
Manage hosting external containers in Private cloud.
Extensively automated deployments and managed applications in GCP.
Developing and maintaining cloud solutions in accordance with best practices.
Ensuring efficient functioning of data storage and processing functions in accordance with company security policies and best practices in cloud security.
Collaborate with Engineering teams to identify optimization strategies, help develop self-healing capabilities
Experience in developing a strong observability capabilities
Identifying, analysing, and resolving infrastructure vulnerabilities and application deployment issues.
Regularly reviewing existing systems and making recommendations for improvements.
Requirements
Proven work experience in designing, deploying and operating mid to large scale public cloud environments.
Proven work experience in Docker/Kubernetes (image building, k8s schedule)
Experience in package, config and deployment management via Helm, Kustomize, ArgoCD.
Proven working experience in onboarding and troubleshooting Cloud Services.
Proven work experience in provisioning Infrastructure as Code (IaC) using Terraform Enterprise or community edition.
Proven work experience in writing custom terraform providers/plug-ins with Sentinel Policy as Code
Professional Certification is an advantage
Public Cloud >> GCP is a good to have.
Strong knowledge in Github, DevOps (Cloud Build is an advantage)
Should be proficient in scripting and coding, that include traditional languages like Python, PowerShell, GoLang,Java, JS and Node.js.
Proven working experience in Messaging Middleware - Apache Kafka, RabbitMQ, Apache ActiveMQ
Proven working experience in API gateway, Apigee is an advantage.
Proven working experience in API development, REST.
Proven working experience in Sec and IAM, SSL/TLS, OAuth and JWT.
Extensive knowledge and hands-on experience in Grafana and Prometheus micro libraries.
Exposure to Cloud Monitoring and logging.
Experience with distributed storage technologies like NFS, HDFS, Ceph, S3 as well as dynamic resource management frameworks (Mesos, Kubernetes, Yarn)
Experience with automation tools should be a priority
Previous success in technical engineering
Must have > 5 overall experience
Must have >3 years of experience in public cloud
Must have >3 years of experience in Cloud Infrastructure provisioning
Must have >3 years of experience in Cloud Engineering
Must have >3 years of coding/automation experience(with Python/golang/shell)
Senior DevOps Engineer leading cloud - native solutions at Sparksoft Corporation. Driving automation and system reliability within a fast - paced Agile team.
Platform Engineer focusing on supporting CI/CD pipelines and Kubernetes at PCCW. Responsible for ensuring platform services' reliability and performance, with night - time support as needed.
Site Reliability Engineer at Bumble optimizing large - scale Linux environments and ensuring system stability. Focusing on troubleshooting, incident recovery, and performance tuning in complex infrastructures.
Senior DevOps Manager overseeing CI/CD processes for NVIDIA Networking products. Leading a team and collaborating with global teams to enhance R&D efficiency and infrastructure.
DevOps Manager overseeing engineering team developing scalable CI/CD processes for NVIDIA Networking products. Enhancing global R&D efficiency in a technology - focused company.
Join Operations Team as Senior Site Reliability Engineer driving operational excellence for cybersecurity solutions. Collaborate across teams to manage production platforms and optimize infrastructure.
Software Developer - DevOps System Administrator working within the SCMT team to enhance software application efficiency. Collaborating on tools and scripts for application lifecycle management.
DevOps Engineer managing CI/CD pipelines and Kubernetes deployments at Stefanini. Collaborating with teams to optimize application health and deployment processes.
DevOps Engineer working with development teams for seamless feature integration and deployment automation. Focus on CI/CD pipelines, monitoring solutions, and continuous process optimization.