Tenstorrent’s Galaxy Blackhole AI servers escape the event horizon
Tenstorrent has launched its Galaxy Blackhole AI compute platform, a RISC-V-based system with 32 Blackhole accelerators per 6U chassis, offering 23 petaFLOPS of FP8 performance at a price of $110,000. The systems feature high-bandwidth memory and a scalable mesh network, enabling clustering up to 32 nodes for larger AI workloads. Performance claims include sub-four-second processing of 100,000-token prompts on a four-node cluster and real-time 720p video generation. The software stack has improved significantly since earlier hardware evaluations, with broader model support and optimized performance.
- ▪Each Galaxy Blackhole system integrates 32 Blackhole accelerators, 1 TB of GDDR6 memory, and delivers 23 petaFLOPS of FP8 performance in a 6U form factor priced at $110,000.
- ▪The accelerators are connected via a 100 Tbps Ethernet mesh, allowing scalability across multiple nodes for large language models and high-throughput AI tasks.
- ▪A four-node Galaxy Supercluster can process a 100,000-token prompt in under four seconds and generate 720p video faster than real time.
- ▪Tenstorrent claims 90% of Hugging Face models run on its platform, supported by a Python-based interface for kernel optimization.
- ▪The hardware is available through providers like Cirrascale, Equinix, and ai&, with further details expected at the TT-Deploy event on May 1.
Opening excerpt (first ~120 words) tap to expand
AI + ML 2 Tenstorrent’s Galaxy Blackhole AI servers escape the event horizon 2 RISC-V-based systems pack 32 Blackhole accelerators in a 6U, $110K chassis Tobias Mann Tue 28 Apr 2026 // 13:00 UTC Tenstorrent on Tuesday announced the general availability of its Galaxy Blackhole AI compute platform. Each of the startup's 6U systems is packed with 32 of the Blackhole accelerators we looked at last fall. The chips are interconnected in a dense Ethernet mesh by 100 Tbps of aggregate bandwidth. Combined, Tenstorrent says each Galaxy system features 1 TB of GDDR6, 16 TB/s of memory bandwidth, and 23 petaFLOPS of dense FP8 performance, all in a system that'll set you back only $110,000.
…
Excerpt limited to ~120 words for fair-use compliance. The full article is at The Register.