Software Engineer building reliable distributed systems and services for Whatnot’s hybrid workforce. Focused on improving system reliability while collaborating with cross-functional teams.
Responsibilities
build distributed systems, services, and frameworks that improve the reliability of the entire platform
focus on making reliability a built-in property of our systems as scale, traffic, and complexity continue to grow
design, build, and operate reliability-focused components, services, and frameworks
shape the standards and practices that guide how software is built and run across Whatnot
partner closely with product, platform, and infrastructure teams to embed reliability concerns into system design, development workflows, and runtime behavior
design and operate traffic control mechanisms, including circuit breakers, rate limiting, backpressure, and graceful degradation
build and evolve load testing frameworks that validate system behavior under sustained, burst, and peak event traffic patterns
build chaos and resilience testing frameworks to proactively surface failure modes and validate recovery behavior
define and implement SLOs, SLIs, and error budgets that guide engineering teams toward the right reliability tradeoffs
develop reliability tooling and services that improve incident detection, response, and automated mitigation
review service architectures and designs with a focus on failure modes, scalability limits, and operational safety
participate in incident response and drive post incident follow ups that reduce repeated failure patterns through systemic fixes
Requirements
5+ years of experience as a software engineer working on large scale distributed systems
Strong fundamentals in designing, building, and operating shared production services and frameworks
Experience with traffic control mechanisms such as circuit breakers and rate limiting
Experience building or operating load testing and chaos testing frameworks
Hands on experience with observability, monitoring, and debugging production systems
Experience working with SLOs, error budgets, and incident response processes
Comfortable in cloud native environments such as AWS or GCP with Kubernetes and infrastructure as code
Strong collaborator with clear written and verbal communication skills
Bonus: experience with high traffic, real time, or event driven systems
Benefits
flexibility to work from home or from one of our global office hubs
in-person time for planning, problem-solving, and connection
Staff Engineer at URBN developing AI - powered visual experiences. Integrating generative AI solutions with creative tools and workflows to build robust image and video generation systems.
Senior Software Engineer at Sotheby’s responsible for building and maintaining systems for art and luxury sales. Engaging in both backend engineering and client - facing product development.
Perception Architect designing sensory systems for next - generation autonomous products. Evolving perception architecture, liaising between sensing teams and core ML systems.
Staff Software Engineer designing scalable microservices and APIs for Small Business Banking at CommBank. Collaborating with a squad to shape architecture and mentor other engineers.
Technical Lead guiding discovery - driven IT initiatives and AI projects at biopharmaceutical company. Mentoring teams and ensuring successful project delivery while leveraging cutting - edge science.
Senior Full - Stack Engineer IV responsible for developing and maintaining Synthetic Monitoring solutions. Working at ING in Poland, focusing on reliable and scalable Golang and Grafana systems.
Global Technical Leader for sterile biotech products at Chiesi ensuring technical robustness throughout product lifecycle. Leading industrialization strategy and lifecycle management projects in a collaborative environment.
Software Developer responsible for the design, development, and implementation of software projects for Langan. Collaborating with teams to support company initiatives through tech solutions.
Senior Product Engineer building cutting - edge software solutions for nonprofits at Goodstack, a fast - growing impact startup. Join a team dedicated to creating scalable products that make a difference.
Field Software Engineer configuring and troubleshooting maritime sensor technologies for a global intelligence network. Collaborating remotely with field technicians and backend engineers.