NVIDIA Rubin Platform: Everything We Know So Far
Blog post from Vast.ai
NVIDIA has introduced its Rubin AI platform, designed to revolutionize large-scale AI economics by reducing training time and inference token costs through a rack-scale architecture featuring six new chips. This platform integrates GPUs, CPUs, networking, security, software, power delivery, and cooling, all co-designed to optimize performance across distributed systems. Key components include the NVIDIA Rubin GPU with a third-generation Transformer engine and the NVIDIA Vera CPU with 88 Olympus cores, alongside innovations like the sixth-gen NVLink interconnect and ConnectX-9 SuperNIC for efficient data movement. The Vera Rubin NVL72 rack-scale system stands out for its NVIDIA Confidential Computing feature, maintaining data security across various domains. The Rubin platform promises up to 50 petaFLOPS of NVFP4 compute and 260 TB/s per rack, with a modular, cable-free design allowing for faster assembly and servicing. While unofficial reports hint at a potential TDP increase to 2.3 kW per GPU, enhancing performance under stress, NVIDIA's Rubin platform is set to define the future of AI factories, offering high-performance computing capabilities that cater to evolving AI infrastructure needs.
No tracked trend matches for this post yet.