Director of Platform and Infrastructure Engineering leading Katalon’s cloud-based infrastructure operations and CloudOps strategy. Ensuring high availability, reliability, and security for global users.
Responsibilities
Lead the architecture, design, and end-to-end management of Katalon’s cloud infrastructure (AWS/GCP), ensuring security, scalability, and high availability.
Drive CloudOps strategy and roadmap aligned with CTO direction; oversee provisioning, monitoring, observability, incident response, and disaster recovery.
Build and optimize CI/CD pipelines, improve deployment frequency, system reliability, and MTTR, and promote SRE best practices.
Manage cloud budgets, optimize hosting costs, reduce cloud impact on COGS, and identify cost-saving opportunities.
Lead and develop CloudOps/SRE/DevOps teams, building strong capabilities in automation, reliability, cost optimization, and operational excellence.
Collaborate with Engineering, Security and Finance to ensure platform performance, security readiness, and accurate cost planning.
Ensure compliance with cloud security standards and regulations, lead audits and remediation efforts, and maintain CloudOps policies, runbooks, and SLAs.
Requirements
Bachelor’s degree in Computer Science, Engineering, or related field; advanced degree (MBA/MSc) preferred.
10+ years of experience in CloudOps/DevOps/SRE or infrastructure engineering, including 5+ years in senior leadership roles.
Proven success leading cloud or platform operations teams in a SaaS/product environment, with experience managing distributed or global teams.
Strong expertise in AWS/GCP (Azure optional), cloud architecture, networking, security, CI/CD (e.g., Argo, GitHub CI/CD), and Infrastructure as Code (Terraform, CloudFormation).
Deep hands-on knowledge of containers and orchestration (Docker, Kubernetes), observability tools, and incident management practices.
Demonstrated ability to optimize cloud costs, partner with Finance on budgeting/forecasting, and improve operational efficiency.
Exceptional problem-solving, communication, stakeholder management, and executive influencing skills.
Ability to build and scale high-performing teams and drive operational excellence in fast-paced environments.
Nice to Have: Experience in regulated industries with a strong focus on security and compliance.
Familiarity with Agile, DevOps, and SRE best practices.
Experience with modern monitoring and logging tools (Prometheus, Grafana, ELK).
Benefits
Competitive Pay & Bonuses: We believe in rewarding great work! You'll receive an attractive salary package plus performance bonuses to help you meet your financial goals.
Your Health & Happiness Matter: Take care of yourself with our comprehensive health coverage, flexible work options, and generous time off. We understand that life happens outside of work too!
Location-Tailored Benefits: Enjoy a complete benefits package designed specifically for your country, giving you the best coverage where you live.
Everything You Need to Succeed: Work with top-of-the-line equipment and enjoy modern facilities, plus helpful allowances to support your work setup.
A Place Where You Belong: Join our worldwide family where we celebrate what makes each of us unique. Here, everyone has a voice and equal opportunities to shine.
Room to Grow & Thrive: Your success is our success! We foster a trust-based culture where you can develop your skills, take on new challenges, and be recognized for your achievements.
Site Infrastructure Engineer managing HVAC and utility systems at SABIC. Overseeing maintenance, project activities, and long - term asset strategies for operational efficiency.
Key engineer developing and operating Web Application Firewall (WAF) platforms at Lloyds Banking Group. Enhancing security and performance while working with modern engineering practices.
Lead Infrastructure Engineer driving Edge Security capabilities for Lloyds Banking Group. Focusing on web access protection, Zero Trust architectures, and modern security engineering approaches.
Senior System Administrator & Infrastructure Engineer managing reliable infrastructure and driving DevOps practices at IMAGO. Collaborating with development teams and providing technical guidance to ensure best practices.
Infrastructure Engineer maintaining high availability of systems at mortgage platform provider Pylon. Focus on developer productivity and codebase quality with instant feedback from peers.
Infrastructure Systems Engineer II managing production application support for Conduent. Collaborating on ITIL processes and incident management while working in a 24/7 environment.
OT Cybersecurity Specialist responsible for secure IT - OT infrastructures in industrial operations. Engaging in secure deployments, integrating cybersecurity frameworks, and providing expert support.
Ingeniero de Infraestructura y Seguridad colaborando en el diseño de arquitecturas seguras en CRG Solutions. Integrando buenas prácticas de ciberseguridad y gestionando incidentes en entornos Windows y Linux.
Senior Infrastructure Engineer managing global IT infrastructure for aviation solutions, focusing on VMware, Nutanix, and Windows Server environments. Collaborating with teams to ensure high availability and optimal performance in a hybrid work model.
Cloud Support Engineer maintaining operational stability and automation for Azure cloud platforms. Working collaboratively across IT teams to ensure infrastructure reliability and security.