Observability Platform Engineer focusing on uptime and developer empowerment at YouLend. Building a world-class observability function with meaningful alerts and dashboards.
Responsibilities
**The Role**
We’re building a world-class Observability function, and we’re looking for someone who lives for uptime, meaningful alerts, and elegant dashboards. If you’ve ever been on-call, silenced a noisy monitor, or traced a ghost bug across microservices outside core hour - we want to hear from you!
This isn’t a generic “Platform Engineer” role. You’ll be *laser-focused* on **observability, reliability, and developer empowerment**, working closely with teams to make sure we don’t just know when things break - but *why*.
Requirements
**Responsibilities:**
Designing and scaling **on-call systems** that engineers don’t dread being part of.
Building out **Datadog monitoring, alerting, dashboards, and log pipelines** for our Kubernetes-based environments.
Defining and managing **SLOs, SLIs**, and **error budgets** — and helping teams stick to them.
Creating **scorecards** and **software catalogs** so engineers know what’s healthy, what’s broken, and who owns what.
Training and enabling dev teams to own their own **observability**, **alerts**, and **incident response**.
Introducing **chaos engineering practices** (yes, we want to break things… on purpose).
Driving a culture of reliability, with **incident reviews**, shared learnings, and transparency.
**The ideal candidate will have the following skillset: **
Have **production experience** with observability tools (especially **Datadog**) in cloud-native environments.
Have set up **monitoring and alerting** across Kubernetes services.
Have built or scaled **on-call systems** in startups or large-scale environments.
Know how to reduce **alert fatigue** and love a good **MTTR** chart.
Have experience with **infrastructure as code** (Terraform preferred).
Believe that great developer experience includes **clear visibility and ownership**.
Are curious about — or already practicing — **chaos engineering**.
Have knowledge of our stack: AWS (EKS, Lambda, etc.), Datadog, OpenTelemetry, Terraform, Kubernetes (EKS), Fluent Bit, FireLens, Backstage (or custom)
**Desirable: **
Experience with **OpenTelemetry**, **Fluent Bit**, or similar.
Familiarity with **service catalog tooling** (e.g., Backstage).
Comfortable running or facilitating **game days** or **failure drills**.
Prior involvement in setting up **scorecards** for service health.
Benefits
**Why join YouLend?**
Award-Winning Workplace: YouLend has been recognised as one of the “Best Places to Work in 2024 and 2025” by the Sunday Times for being a supportive, diverse, and rewarding workplace.
Award-Winning Fintech: YouLend has been recognised as a “Top 250 Fintech Worldwide” company by CNBC.
**It’s just getting fun: **
We have developed powerful solutions, won some significant partnerships, and are growing at a rapid pace.
But the global opportunity is still massive, and YouLend is a raw organisation where we are only just getting started.
**Lots of upsides: **
High-growth (>100% growth during 2022 and 2023), so clear outlook to compensation (bonus or share option appreciation) and career growth (through growth with business).
Well-capitalised with supportive private equity backing.
Part of Banking Circle Group with a fully licensed Luxembourg bank, which can provide a balance sheet and support European expansion in otherwise complex regulated markets.
**Motivating work environment: **
A high-quality team that pushes each other to succeed through direct feedback and aligned incentives.
Strong and transparent team culture, we have each other’s backs.
Independent work environment where results matter.
Data-driven culture and emphasis on speed (anti-red tape).
**We offer a comprehensive benefits package that includes: **
Stock Options
Private Medical insurance via Vitality and Dental Insurance with BUPA
EAP with Health Assured
Enhanced Maternity and Paternity Leave
Modern and sophisticated office space in Central London
Free Gym in office building in Holborn
Subsidised Lunch via Feedr
Deliveroo Allowance if working late in office
Monthly in office Masseuse
Team and Company Socials
Football Power League / Paddle and Yoga Club
At YouLend, we champion diversity and embrace equal opportunity employment practices. Our hiring, transfer, and promotion decisions are exclusively based on qualifications, merit, and business requirements, free from any discrimination based on race, gender, age, disability, religion, nationality, or any other protected basis under applicable law.
Senior Site Reliability DevOps Specialist for Boeing, focusing on cloud technology and automation in GCP environments. Collaborate globally to enhance system reliability and performance with a diverse tech stack.
SRE Team Lead in charge of reliability strategy and operational maturity for a cybersecurity SaaS platform. Leading a specialized team to enhance system performance and incident management.
Junior DevOps Engineer implementing continuous integration and deployment architecture for the Defense Logistics Agency. Debugging cluster - based computing while using various configuration management tools.
Mobile DevOps Engineer developing hybrid applications with Ionic for a global organization. Collaborate across teams to optimize development practices and maintain mobile environment.
Site Reliability Engineer improving system reliability and performance in production environments with a focus on automation and operational efficiency. Collaborating with engineering and infrastructure teams on deliverable - focused projects.
Lead Virtualisation Engineer at Mastercard focused on service quality and performance of platform virtualisation technologies. Collaborate with teams to ensure availability, scalability, and resilience across the network in Singapore.
DevOps Engineer at LRQA, optimizing deployments and driving process improvements in a global assurance provider. Focusing on CI/CD pipelines, security best practices, and team collaboration.
Senior DevOps Engineer in a technology consulting firm connecting tech talents to impactful projects. Involves working in healthy environments with growth opportunities.
DevOps Engineer at Booz Allen enhancing critical systems for space operations. Modernizing architectures and collaborating with teams to solve complex challenges.