
Location: On Site in San Francisco, California, United States
Employment type: Full-time
Salary: <p>$170K – $300K</p>
Posted: a year ago
As a distributed systems software engineer, you’ll be working on our in-house resource orchestration system. This system coordinates state and access to hundreds (soon thousands) of GPU compute nodes in multi-tenant clusters spanning across multiple data centers.
Design of distributed system architectures that enable high availability fault tolerant state management
Deployment automation and performance optimization of virtual machines running on bare metal that utilize GPU passthrough
Design and deployment of multi-tier high performance network attached storage systems
You have built fault tolerant distributed systems before that can manage hardware resources at scale
You enjoy creating self-correcting systems that contribute to hardware health and reliability
You have experience with Linux virtualization (Cloud Hypervisor, QEMU, libvirt, virtiofs, sr-iov, PCIe passthrough)
You appreciate and value good documentation
Some Nice to Haves
Experience with Rust (our VM orchestrator is written in Rust)
Experience with etcd
Experience with high performance storage systems (WEKA, VAST, Ceph, etc.)
Unlimited office book budget: You can buy as many books for the office as you want. You’re encouraged to spend time during the workday reading!
Generous equity grant: Team members are offered a competitive salary along with equity in the company
Retirement matching: We match 401(k) plans up to 4%
Medical, dental & vision: We offer competitive medical, dental, vision insurance for employees and dependents and cover 100% of premiums
Time off: We offer unlimited paid time off as well as 10+ observed holidays
Parental leave: We offer biological, adoptive, and foster parents paid time off to spend quality time with family
Daily lunch: We cover lunch daily for employees
Visa Sponsorships: Yes, we sponsor visas and work permits