It looks like Nvidia's AI inference GPU won't see the light of day this year, which could actually be good news for PC gamers

A promotional image for the Nvidia NVL144 CPX platform, with Nvidia Vera CPUs, Rubin GPUs, and Rubin CPX processors clearly shown — (Image credit: Nvidia)

Last September, Nvidia announced a new product to join its all-conquering collection of AI offerings, Rubin CPX. What made this GPU notable was that it was the company's first attempt at making a processor purely for AI inference. However, while Nvidia's Vera CPU and the other Rubin GPU are seemingly racking up the orders, it's all gone very quiet on the CPX front, and that's potentially a brief respite for PC gamers.

Korean publication The Elec drew my attention to this via a short report on the matter, claiming that sources have told it that the whole thing has ground to a halt. "The industry views the project as effectively cancelled," it writes.

The evidence for this is the apparent lack of printed circuit board (PCB) and DRAM module orders, specifically for Rubin CPX installations. Where Vera Rubin, Blackwell, and older AI superchips all use high bandwidth memory (HBM) for the GPU's local storage, because of the bandwidth demands of AI training, CPX uses 128 GB of GDDR7 mounted directly on the PCB, rather than the processor's package.

Latest Videos From

Watch full video here:

In any case, no PCB or GDDR7 orders equivalent to those required for a CPX setup strongly suggests that Nvidia has either sidelined the project for the moment or abandoned it entirely. Rubin CPX wasn't expected to be available until the end of this year, but while we're still a good number of months away from the start of the final quarter, you'd think that production would be scaling now, if demand were high.

So what's happened? The most likely cause was Nvidia entering into a $20 billion non-exclusive technology licensing deal with Groq (not to be confused with Elon Musk's Grok). That company's language processing unit (LPU) design is purely for inference, more so than Rubin CPX, and Nvidia said earlier this year that it was working on integrating the LPU into its Rubin platform.

A Palit RTX 5060 Ti Infinity 3 with three fans and 16 GB of memory on a computer. — Nvidia's RTX 50-series graphics cards all use GDDR7, bar the RTX 5050. (Image credit: Future)

If you're wondering why I'm even mentioning any of this, when it seems to have nothing to do with PC gaming, it's down to the fact that Groq's LPU doesn't use GDDR7 or even DRAM in general: it's packed with 500 MB of SRAM inside.

Nvidia's Rubin GPU still uses mountains of HBM, of course, but the removal of the need for equally copious amounts of GDDR7 could mean that GeForce RTX 50-series graphics cards won't balloon in price due to VRAM supply constraints.

We'll still be suffering the RAMpocalypse, naturally, because AI and the enormous demand for DRAM and SSDs isn't going anywhere just yet. But in these grim times, I'll take any sliver of hope as good news—simply because the alternative isn't worth contemplating.

Best graphics card 2026

1. Best overall: AMD Radeon RX 9070

2. Best value: AMD Radeon RX 9060 XT 16 GB

3. Best budget: Nvidia RTX 5050

4. Best mid-range: AMD Radeon RX 9070 XT

5. Best high-end: Nvidia GeForce RTX 5090

👉Check out our full graphics card guide👈

TOPICS

Nick, gaming, and computers all first met in the early 1980s. After leaving university, he became a physics and IT teacher and started writing about tech in the late 1990s. That resulted in him working with MadOnion to write the help files for 3DMark and PCMark. After a short stint working at Beyond3D.com, Nick joined Futuremark (MadOnion rebranded) full-time, as editor-in-chief for its PC gaming section, YouGamers. After the site shutdown, he became an engineering and computing lecturer for many years, but missed the writing bug. Cue four years at TechSpot.com covering everything and anything to do with tech and PCs. He freely admits to being far too obsessed with GPUs and open-world grindy RPGs, but who isn't these days?