WeSearch

Show HN: Brokkr - Scalable cluster management for GPU/HPC workloads

·4 min read · 0 reactions · 0 comments · 12 views
#distributed computing#rust#cluster management#open source#gpu computing#Brokkr#Bazel#Buck2#Pants#Linux#REAPI#gRPC#CAS
Show HN: Brokkr - Scalable cluster management for GPU/HPC workloads
⚡ TL;DR · AI summary

Brokkr is a self-hosted, open-source compute platform written in Rust that enables distributed execution of tasks like builds, tests, and ML training across a cluster of Linux machines. It implements the Bazel Remote Execution API v2, allowing compatibility with existing tools while building core distributed systems components from scratch for educational purposes. Currently in active development, Brokkr supports end-to-end job execution with caching and sandboxing but is not yet production-ready.

Key facts
Original article
GitHub
Read full at GitHub →
Opening excerpt (first ~120 words) tap to expand

Brokkr A distributed build & compute grid, written in Rust. Many hammers. One forge. Brokkr is a self-hosted, open-source compute platform that turns a fleet of Linux machines into a single, coherent grid for executing arbitrary jobs — builds, tests, ML training, transcoding, anything that fits inside a sandbox. It speaks the Bazel Remote Execution API v2 so existing tooling (bazel, buck2, pants, custom REAPI clients) works unchanged. The interesting parts of distributed computing — content-addressable storage, hermetic sandboxing, scheduling, and consensus — are implemented from scratch as the project's educational core. There is no Docker, no runc, no embedded etcd, no third-party Raft. Status: Phase 1 complete; Phases 2 and 3 in flight.

Excerpt limited to ~120 words for fair-use compliance. The full article is at GitHub.

Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Threads WhatsApp Bluesky Mastodon Email

Discussion

0 comments

More from GitHub