Software Engineer focusing on backend development and SRE in a B2B WealthTech startup. Collaborating on application reliability, debugging issues, and enhancing system observability.
Responsibilities
Debug and resolve production issues in APIs, workers, and data processors.
Read and understand existing Node.js / Go codebases to trace errors.
Contribute small features, bug fixes, and configs to improve application reliability.
Add observability hooks (metrics, logging, tracing) with OpenTelemetry.
Work closely with engineers to deploy fixes and enhancements via GitOps.
Participate in on-call rotation for production support.
Document and build runbooks for recurring production issues.
Requirements
Proficiency in Node.js, Go, or Python (at least one strong).
Understanding of REST/GraphQL APIs, microservices architecture.
Ability to debug stack traces, logs, runtime errors, SQL queries.
Familiarity with Kubernetes, Docker.
Exposure to CI/CD (GitHub Actions, Jenkins) and GitOps (FluxCD, ArgoCD).
Knowledge of PostgreSQL / MySQL.
Strong debugging and problem-solving skills.
Good to Have
Experience with Temporal workflows or distributed systems.
Prior exposure to observability stacks (Prometheus, Grafana, Loki, Tempo).
Interest in transitioning towards SRE/Platform engineering.
DevOps Engineer improving reliability and stability of cloud services at Madhive. Responsibilities include CI/CD tooling, monitoring, and cloud infrastructure management.
Site Reliability Engineer contributing to platform reliability at Trainline, Europe's leading rail ticketing platform. Collaborating with product engineering to ensure operational readiness and incident response.
Senior DevOps Analyst at Stefanini managing Azure DevOps for build and deploy automation. Collaborating with development squads and ensuring code quality with validation tools.
Senior DevOps Engineer leading design and management of CI/CD pipelines at Neuron7.ai. Collaborating on cloud infrastructure for scalable applications in an innovative tech environment.
Backend Software Engineer responsible for building robust backend systems for AI and analytics products. Collaborating with various teams to enhance platform reliability and performance.
Senior DevOps Engineer responsible for cloud ecosystem architecture at health - tech startup. Building HIPAA/GDPR - compliant foundations and mentoring developers.
Senior Backend Engineer building product features and maintaining infrastructure for insurance platform. Employing tools like Terraform, Kafka, Datadog and Qovery with a strong DevOps focus.
DevOps Systems Engineer supporting customer operations in Annapolis Junction, MD. Responsible for creating, sustaining, and troubleshooting complex operational data flows.
OpenShift Fresher assisting Cloud team in managing containerized applications using Red Hat OpenShift. Supporting CI/CD, deployment automation, and cloud - native application environments.
Site Reliability Engineer for Leidos ensuring reliability, performance, and scalability of complex distributed systems for the Navy - Marine Corps Intranet. Collaborating with teams to maintain and optimize network operations and services.