Director of Platform and Infrastructure Engineering leading Katalon’s cloud-based infrastructure operations and CloudOps strategy. Ensuring high availability, reliability, and security for global users.
Responsibilities
Lead the architecture, design, and end-to-end management of Katalon’s cloud infrastructure (AWS/GCP), ensuring security, scalability, and high availability.
Drive CloudOps strategy and roadmap aligned with CTO direction; oversee provisioning, monitoring, observability, incident response, and disaster recovery.
Build and optimize CI/CD pipelines, improve deployment frequency, system reliability, and MTTR, and promote SRE best practices.
Manage cloud budgets, optimize hosting costs, reduce cloud impact on COGS, and identify cost-saving opportunities.
Lead and develop CloudOps/SRE/DevOps teams, building strong capabilities in automation, reliability, cost optimization, and operational excellence.
Collaborate with Engineering, Security and Finance to ensure platform performance, security readiness, and accurate cost planning.
Ensure compliance with cloud security standards and regulations, lead audits and remediation efforts, and maintain CloudOps policies, runbooks, and SLAs.
Requirements
Bachelor’s degree in Computer Science, Engineering, or related field; advanced degree (MBA/MSc) preferred.
10+ years of experience in CloudOps/DevOps/SRE or infrastructure engineering, including 5+ years in senior leadership roles.
Proven success leading cloud or platform operations teams in a SaaS/product environment, with experience managing distributed or global teams.
Strong expertise in AWS/GCP (Azure optional), cloud architecture, networking, security, CI/CD (e.g., Argo, GitHub CI/CD), and Infrastructure as Code (Terraform, CloudFormation).
Deep hands-on knowledge of containers and orchestration (Docker, Kubernetes), observability tools, and incident management practices.
Demonstrated ability to optimize cloud costs, partner with Finance on budgeting/forecasting, and improve operational efficiency.
Exceptional problem-solving, communication, stakeholder management, and executive influencing skills.
Ability to build and scale high-performing teams and drive operational excellence in fast-paced environments.
Nice to Have: Experience in regulated industries with a strong focus on security and compliance.
Familiarity with Agile, DevOps, and SRE best practices.
Experience with modern monitoring and logging tools (Prometheus, Grafana, ELK).
Benefits
Competitive Pay & Bonuses: We believe in rewarding great work! You'll receive an attractive salary package plus performance bonuses to help you meet your financial goals.
Your Health & Happiness Matter: Take care of yourself with our comprehensive health coverage, flexible work options, and generous time off. We understand that life happens outside of work too!
Location-Tailored Benefits: Enjoy a complete benefits package designed specifically for your country, giving you the best coverage where you live.
Everything You Need to Succeed: Work with top-of-the-line equipment and enjoy modern facilities, plus helpful allowances to support your work setup.
A Place Where You Belong: Join our worldwide family where we celebrate what makes each of us unique. Here, everyone has a voice and equal opportunities to shine.
Room to Grow & Thrive: Your success is our success! We foster a trust-based culture where you can develop your skills, take on new challenges, and be recognized for your achievements.
Infrastructure Engineer building an AI - powered platform for crisis management in Sweden. Collaborating with cross - functional teams to save lives through innovative tools and solutions.
Infrastructure Architect at Pague Menos responsible for designing secure, scalable IT infrastructure architectures. Collaborating with teams to implement and optimize both on - premise and cloud solutions.
IT Infrastructure Specialist managing physical and virtual server environments for Premier League Studios. Ensuring robust workflows and high - performance infrastructure in a hybrid work setting.
Manager of Platform Engineering at a leading insurance company shaping the future of API platforms. Fostering innovation and collaboration while driving platform stability and resiliency.
Infrastructure Engineer responsible for building, monitoring, and securing IT infrastructure for NLACRC. Collaborates with IT personnel and external support to ensure robust infrastructure.
Infrastructure Engineering Intern working on cloud solutions at a global growth engine for commerce. Collaborating on secure, scalable systems and contributing to performance optimization.
Infrastructure Engineer supporting IT service management and implementing complex system solutions. Collaborating with business units and training junior team members in a hybrid environment.
Infrastructure Engineering Lead overseeing edge security initiatives for Lloyds Banking Group. Driving the development of security capabilities and mentoring engineering teams.
Lead Infrastructure Engineer focusing on web access protection and security strategies at Lloyds Banking Group. Managing infrastructure improvements and team leadership in enterprise environments.
Senior Infrastructure Engineer maintaining IT infrastructure and datacentre operations for Walkers Global. Installing, configuring, and troubleshooting various hardware and cloud services in a hands - on role.