Engineering Manager leading cloud and site reliability engineering teams. Championing AI workflows and platform evolution at Taxfix with an international team.
Responsibilities
Lead and grow the team
Hire, coach, and develop cloud/SRE engineers - run meaningful 1:1s, set development goals, and actively manage performance
Build a high-performance team culture rooted in psychological safety, ownership, and continuous improvement
Champion AI adoption within the team - encourage AI-assisted workflows and continuously raise the bar on how AI is used to improve productivity
Evaluate capacity, balance workloads, and advocate for the resources your team needs
Own delivery and reliability
Own the team's outcomes against OKRs - prioritise effectively, track progress with metrics, and delegate without micromanaging
Ensure rigorous deployment pipelines, production readiness, and incident management practices
Drive SLO adherence and operational excellence across production infrastructure
Shape platform strategy
Partner with Technical Leadership and Architecture to align infrastructure work with the technology strategy
Lead your team's contribution to platform consolidation and evolution across the Taxfix group
Support AI/ML platform needs - including agent observability, AI workload infrastructure, and AI development lifecycle tooling
Evaluate technical trade-offs - balancing reliability, cost, security, and delivery speed - and communicate them clearly to stakeholders
Partner across the org
Bridge your team and its stakeholders: Product Engineering, AI Engineering, Security, Data, and Architecture
Align priorities with peer EMs across Platform Engineering
Proactively surface blockers, manage dependencies, and keep information flowing
Requirements
3+ years leading an engineering team (hiring, coaching, performance management, career development)
Strong technical background in cloud infrastructure - you can guide architectural decisions and hold a high quality bar
Experience with at least one major cloud provider (GCP preferred; AWS or Azure transferable)
Familiarity with Kubernetes in production, CI/CD pipelines, and Infrastructure as Code (Terraform or similar)
Active user of AI-assisted development tools (Claude, Copilot, Cursor, or similar)
Exposure to AI/ML supporting infrastructure such as agent observability, model serving, AI development lifecycle, or ML pipeline operations
Track record of driving team outcomes using metrics, OKRs, or KPIs
Effective communicator across engineering, product, and leadership audiences
Experience with Agile/Scrum
Nice to have: Experience with platform consolidation, multi-cloud environments, or infrastructure migrations
GCP Landing Zones, GitOps (ArgoCD/Kargo), or service mesh
Cloud cost optimization experience
Experience supporting AI/ML workloads at scale (GPU scheduling, model deployment infrastructure, vector databases)
Background managing platform/infrastructure teams specifically
Benefits
A chance to do meaningful, people-centric work with an international team of passionate professionals.
Holistic well-being with free mental health coaching sessions and yoga.
A monthly allowance to spend on an extensive range of services that you can use and roll over as flexibly as you like.
Employee stock options for all employees—because everyone deserves to benefit from the success they help to create.
30 annual vacation days and flexible working hours.
Work from abroad for up to six weeks every year. Just align with your team, and then enjoy your trip.
Plenty of opportunities to socialise as a team. In addition to internal tech meetups, our international team hosts regular get-togethers—virtually and in person when possible.
Free tax declaration filing, of course, through the Taxfix app—and internal support for all personal tax-related questions.
Have a four-legged friend in your life? We’re happy to have dogs join us in the office.
Junior DevOps Engineer responsible for designing and deploying scalable infrastructure in cloud environments. Collaborating on operational enhancements and security monitoring within a high - velocity environment.
DevOps Engineer at EOS imaging enhancing cloud solutions and automating processes for healthcare applications. Collaborating on international projects to ensure data compliance and efficiency.
Primary post - sales technical owner ensuring reliability of ML workloads for strategic customers at AI company. Collaborating with teams to drive technical success and product improvements.
Site Reliability Engineer ensuring scalable infrastructure in AI product deployment for top AI companies. Involves building automated processes and collaborating across teams.
Graduate Site Reliability Engineer at SiXworks developing skills in automation and cloud technologies while working in a collaborative team environment. Focus on supporting scalable systems and services through best practices in DevOps.
Engineer supporting enterprise - scale Microsoft 365 environment at NIH. Implementing automated testing frameworks and secure development practices in Federal Government program.
Senior Cloud Engineer developing cloud - native applications and optimizing CI/CD pipelines at GRAYOAK. Collaborating in interdisciplinary teams on innovative cloud projects with a focus on data and AI.
Senior Manager Site Reliability Engineering at WEX ensuring system scalability and resilience while leading engineering best practices. Collaborating with cross - functional teams to enhance reliability across platforms.
SRE DevOps Engineer developing scalable solutions for Consumer Products and Retail Services at Capgemini. Focusing on Kubernetes, Terraform, and CI/CD automation with a flexible work culture.