xAI’s Colossus supercomputing cluster uses 100,000 Nvidia Hopper GPUs – and it was all made possible using Nvidia’s Spectrum-X Ethernet networking platform


  • Nvidia and xAI are working together on the development of Colossus
  • xAI has significantly reduced ‘flow collisions’ during AI model training
  • Spectrum-X has been crucial in training the Grok AI model family

Nvidia has shed light on how xAI’s ‘Colossus’ supercomputer cluster can handle 100,000 Hopper GPUs – and it’s all thanks to the use of the chipmaker’s Spectrum-X Ethernet networking platform.

Spectrum-X, the company revealed, is designed to deliver massive performance capabilities to multi-tenant, hyperscale AI factories using the Remote Directory Memory Access (RDMA) network.