Tech Lead responsible for reliability and system performance of Almabase's products in India. Leading technical execution while collaborating across teams for system stability and scalability.
Responsibilities
You are responsible for reliability metrics across the engineering org
Uptime & SLA adherence
P95 latency for pages and APIs
Sentry error trends & bug leakage
Distributed system correctness (sync, async, batching, retries)
Queue depth & worker throughput
Redis performance and stability
Event/Giving day high-traffic readiness
Systemic root-cause analysis & permanent fixes
Work closely with the Sr. Technical Lead to shape architecture across all products
Improve data models and API design
Introduce distributed system patterns (idempotent flows, orchestration, fan-out)
Build scalable async pipelines
Design fail-safes, timeouts, and circuit breakers
Lead design reviews across teams
Unblock engineers on complex issues
Own cross-product refactors
Drive clean code, testing standards, and observability-first development
Define SLIs/SLOs across products
Build dashboards and alerting (Datadog, Sentry, logs, traces)
Ensure issues are detected before customers notice
Work with Core Systems to instrument distributed flows deeply
Ensure product work aligns with SOC 2 Type II requirements
Secure coding practices and proper secrets handling
Requirements
3–4+ years backend engineering experience (Python preferred)
Strong understanding of distributed systems
Deep understanding of sql databases and query optimisation
Deep experience with Redis, queues, async jobs, retries, and fan-out flows
Strong debugging skills across infra + app layers
Ability to lead design decisions across multiple teams
Solid data modeling & performance optimization experience
Experience improving system reliability at scale
Excellent communication & collaboration skills
Nice to Have: Datadog / Sentry / Elasticsearch
Experience with RE NXT, Salesforce, or large CRM systems
Prior reliability ownership for multi-product SaaS
Senior Software Engineer designing scalable backend services for financial solutions at DailyPay. Leading architectural evolution and mentoring engineers while ensuring high - quality backend operations.
Software Engineer Intern at FireMon designing and building user - friendly firewall management solutions. Collaborating across agile teams to enhance security operations and customer value.
RevOps Engineer creating operational systems for NHS growth team. Managing CRM and data infrastructure for efficient lead generation and sales processes with a high degree of ownership.
Senior Integrations Engineer designing and building healthcare data integrations for a tech startup. Working closely with health system customers to enhance integration capabilities while ensuring quality and security.
Software Engineer developing backend services for Trustpilot's fraud detection systems. Building scalable enforcement pipelines and maintaining internal tools in a hybrid work environment.
Software Engineer I at Trustpilot working on backend services for fraud detection systems. Integrating detection models and maintaining internal tools within a hybrid team.
Software Engineer building backend services for Trustpilot's fraud detection systems. Responsible for integrating models and maintaining internal tools while collaborating with a new team.
Linux & Kubernetes Administrator managing on - premise AI hardware and Linux server infrastructures at Fronius. Supporting Data Scientists and Engineers in maintaining Kubernetes and Docker environments.
Alternant Ingénieur en informatique au sein de Thales, participant à des projets inédits en services numériques. Formation pratique et académique dédiée à l'IT et à l’ingénierie.
Storage & Backup Management Lead managing SAN/NAS storage and backup platforms at Avenga. Overseeing incident response, collaboration, and compliance with data retention and regulatory requirements.