Hybrid Senior Software Engineer – SRE Focused

Posted 2 weeks ago

Apply now

About the role

  • Senior Software Engineer responsible for production incident response and system reliability for a B2B WealthTech startup. Managing Go and Node.js services in a hybrid tech environment.

Responsibilities

  • Lead and execute production incident response: triage, mitigation, stakeholder communication, and coordination across teams
  • Debug and fix issues across Go services (mandatory) and the broader stack (Node.js services where relevant)
  • Work across service boundaries: GraphQL/RPC, distributed tracing, dependency failures, performance bottlenecks, and safe degradation patterns
  • Troubleshoot Kubernetes workloads and deployments
  • Diagnose PostgreSQL/CNPG issues
  • Handle production bugs that span application + data pipelines (ETL/Snowflake mappings), including backfills/replays and data-quality validation
  • Build prevention: add regression tests, improve observability , and maintain runbooks/service passports
  • Drive reliability improvements: SLOs/SLIs, alert quality, release readiness checks, and operational standards across teams

Requirements

  • 7+ years in SRE / Production Engineering / Platform Engineering (reliability-focused)
  • Strong Go (mandatory): ability to read, debug, and ship production fixes in Go codebases
  • Proven experience debugging distributed systems in production (latency, error rates, timeouts, retries, cascading failures)
  • Strong hands-on experience with Kubernetes in production environments
  • Experience with Helm and GitOps workflows (FluxCD preferred; ArgoCD acceptable)
  • Solid PostgreSQL troubleshooting experience (performance, incident patterns, migrations)
  • Observability experience (metrics/logging/tracing; Datadog/Grafana/Tempo/Loki experience is a plus)
  • Strong incident leadership: calm under pressure, clear communication, structured problem-solving
  • Engineering hygiene: PR discipline, reviews, testing mindset, safe rollouts/rollbacks
  • Comfortable with IAM/security fundamentals in real production systems: OAuth2/OIDC basics, RBAC/least privilege, and safe secrets handling
  • Good to Have
  • Node.js backend experience in production
  • Experience in FinTech / regulated environments / high-availability systems (auditability, change control, incident rigor)
  • Data reliability experience: ETL monitoring, reconciliation, Snowflake operations, schema/mapping drift handling
  • Reliability patterns common to trading/fintech platforms: correctness and data integrity mindset (idempotency, reconciliation), resilient partner integrations, and strong observability for critical user journeys

Job title

Senior Software Engineer – SRE Focused

Job type

Experience level

Senior

Salary

Not specified

Degree requirement

No Education Requirement

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job