Tech Lead responsible for reliability and system performance of Almabase's products in India. Leading technical execution while collaborating across teams for system stability and scalability.
Responsibilities
You are responsible for reliability metrics across the engineering org
Uptime & SLA adherence
P95 latency for pages and APIs
Sentry error trends & bug leakage
Distributed system correctness (sync, async, batching, retries)
Queue depth & worker throughput
Redis performance and stability
Event/Giving day high-traffic readiness
Systemic root-cause analysis & permanent fixes
Work closely with the Sr. Technical Lead to shape architecture across all products
Improve data models and API design
Introduce distributed system patterns (idempotent flows, orchestration, fan-out)
Build scalable async pipelines
Design fail-safes, timeouts, and circuit breakers
Lead design reviews across teams
Unblock engineers on complex issues
Own cross-product refactors
Drive clean code, testing standards, and observability-first development
Define SLIs/SLOs across products
Build dashboards and alerting (Datadog, Sentry, logs, traces)
Ensure issues are detected before customers notice
Work with Core Systems to instrument distributed flows deeply
Ensure product work aligns with SOC 2 Type II requirements
Secure coding practices and proper secrets handling
Requirements
3–4+ years backend engineering experience (Python preferred)
Strong understanding of distributed systems
Deep understanding of sql databases and query optimisation
Deep experience with Redis, queues, async jobs, retries, and fan-out flows
Strong debugging skills across infra + app layers
Ability to lead design decisions across multiple teams
Solid data modeling & performance optimization experience
Experience improving system reliability at scale
Excellent communication & collaboration skills
Nice to Have: Datadog / Sentry / Elasticsearch
Experience with RE NXT, Salesforce, or large CRM systems
Prior reliability ownership for multi-product SaaS
Responsable Technique R&D sur des innovations dans le domaine des hautes tensions. SuperGrid Institute facilite la transition énergétique avec des solutions avancées en collaboration avec des acteurs industriels.
Software Engineer designing scalable information retrieval infrastructure for Slack. Collaborating with teams to maintain high availability and build new features.
Software Engineer developing scalable, resilient offline indexing pipelines for Slack's search infrastructure. Collaborating with product engineering to build new features and ensure system reliability.
Senior Systems/Software Engineer designing and developing complex software solutions for HPE's edge - to - cloud offerings. Leading project teams and managing internal and outsourced development partners.
ETL/Data Validation QA professional responsible for validating Informatica - to - Oracle PL/SQL migrations and data accuracy in SAP Commissions. Execute manual and automated tests and manage test cases efficiently.
Senior Software Engineer responsible for designing scalable systems at GEICO. Collaborating across teams while guiding quality practices in a fast - paced environment.
Staff Software Engineer developing reliability software for GM Autonomous Vehicles, collaborating across teams to enhance multi - sensor systems and improve data quality.
Senior Software Engineer developing and implementing vehicle simulation components for General Motors. Collaborating with technical experts to optimize performance and maintainability in vehicle modeling.
Senior Software Engineer developing and maintaining datapath software components for F5’s cybersecurity innovations. Collaborating across teams to optimize hardware and software integration.
Software Engineer building tools that shape how Homebase engineers ship software. Contributing to AWS infrastructure while improving internal developer experience as part of a collaborative team.