Start your day with intelligence. Get The OODA Daily Pulse.

Subscribe Sign In

Home > Briefs > Technology > Another Giant Leap: The Rubin CPX Specialized Accelerator & Rack

Another Giant Leap: The Rubin CPX Specialized Accelerator & Rack

09/15/2025

Nvidia announced the Rubin CPX, a solution that is specifically designed to be optimized for the prefill phase, with the single-die Rubin CPX heavily emphasizing compute FLOPS over memory bandwidth. This is a game changer for inference, and its significance is surpassed only by the March 2024 announcement of the GB200 NVL72 Oberon rack-scale form factor. Only with hardware specialized to the very different phases of inference, prefill and decode, can disaggregated serving achieve its full potential. As a result, the rack system design gap between Nvidia and its competitors has become canyon-sized. AMD and custom silicon competitors may have made a small step forward in emulating Nvidia’s 72-GPU rack scale design, but Nvidia has just made another Giant Leap, again leaving competitors very distant objects in the rear-view mirror. AMD and ASIC providers have already been investing heavily to catch up in terms of their own rack-scale solutions.

Full analysis : A deep dive into Nvidia’s Rubin CPX chip architecture, which is optimized for the prefill phase of inference, emphasizing compute FLOPS over memory bandwidth.

Tagged: AI Chip NVIDIA

Subscribe Sign In

Related Posts