Hybrid Distributed Systems Engineer – Secure Sandboxes

Posted 3 days ago

Apply now

About the role

  • Build highly scalable, highly performant, software that facilitates arbitrary code execution with strong isolation guarantees.
  • Design and build systems that allow our AI models to interface with machines in various modes, interactive terminal, GUI applications, etc.
  • Provision and operate high density compute and storage nodes (NVMe, high IOPS SSDs, high bandwidth networks), and build software that performs efficient load balancing, and resource utilization across them.
  • Instrument and optimize end to end performance including storage IO, network bandwidth, CPU, memory, and endurance constraints.
  • Develop APIs, self service platforms, and automation and tools so researchers and engineers can deploy and monitor workloads at scale.
  • Troubleshoot complex infrastructure issues across OS, drivers, hardware, storage systems (local NVMe, block storage, NFS), networking, namespace isolation, and cloud or hybrid environments.
  • Produce clean, documented code and developer workflows, and collaborate with SRE and security teams to ensure safe, reliable, and self serviceable compute offerings.

Requirements

  • Strong software engineering background (C, C++, Go, Rust, or similar systems languages).
  • Experience designing or operating sandboxed or isolated execution environments (namespaces, cgroups, container runtime internals), or strong interest in this area.
  • Experience building or operating distributed systems or parallel processing frameworks (scatter aggregate processing, worker pools, multi thread and multi process coordination, shared memory, atomics, merging strategies).
  • Solid understanding of storage and IO subsystems (NVMe, SSD endurance, write amplification), network performance, CPU and memory resource constraints in high performance compute clusters.
  • Comfortable working on low level systems (OS, threading, memory management, synchronization) as well as higher level orchestration or automation.
  • Experience with cloud infrastructure (GCP, AWS, Azure, etc.) including IaC tools such as OpenTofu, Terraform, Pulumi, or CDK is a plus.
  • Intellectual curiosity, strong ownership, and the ability to make tradeoffs in ambiguous environments such as latency versus throughput and isolation versus performance.

Benefits

  • Significant equity component
  • 401(k) with matching
  • Comprehensive health, dental, and vision insurance
  • Unlimited paid time off
  • Visa sponsorship and relocation support
  • Fast paced, mission driven environment focused on safely advancing AGI for humanity

Job title

Distributed Systems Engineer – Secure Sandboxes

Job type

Experience level

Mid levelSenior

Salary

$225,000 - $550,000 per year

Degree requirement

Bachelor's Degree

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job