AMD Navi 31 GPUs (Radeon RX 7900 XT?) Could Boast Multi-Chip-Module Design with 10,240 Cores and Much Improved Ray Tracing

Tsing · Jan 23, 2021

Image: AMD

3DCenter.org has shared a rumor suggesting that AMD’s Radeon RX 6000 Series successors could feature a multi-chip-module (MCM) design, which is similar to what NVIDIA is reportedly planning for its “Hopper” family of next-generation graphics cards. The speculation stems from Kepler_L2, who claims that red team has had a working version of Navi 31 since early 2020. He also revealed that the top SKU, which may very well turn out to be the Radeon RX 7900 XT, boasts a pair of 80 CU chiplets for a potential total of 10,240 Stream Processors—5,120 more than the current Radeon RX 6900 XT flagship.

“Navi31 working silicon exists...

Brian_B · Jan 23, 2021

Sure it could.

they “could” also ship Navi2’s.

Grimlakin · Jan 24, 2021

Yea they need better market penetration of the current cards before releasing these new chiplet cards. Though really if they can produce those faster... as long as the drivers are not too divergent it shouldn't be an issue.

Zarathustra · Jan 24, 2021

I could be interested in this, but only if they are doing it chioket style and have found a way to make all of those CU's appear to the operating system as if they are on the same die.

I am done with Crossfire and SLI implementations, even if they are on one board.

Riccochet · Jan 24, 2021

Zarathustra said:
I could be interested in this, but only if they are doing it chioket style and have found a way to make all of those CU's appear to the operating system as if they are on the same die.

I am done with Crossfire and SLI implementations, even if they are on one board.

I don't see why they couldn't have the OS see it as one die. Their CPU's are seen as one die. The controller is what the OS sees.

Denpepe · Jan 24, 2021

so that's 2x the amount of chips needed, that should result in an increase of cards right? right?

LazyGamer · Jan 24, 2021

Denpepe said:
so that's 2x the amount of chips needed, that should result in an increase of cards right? right?

If they're smaller and easier to fab, increasing yields?

That's the hope!

LazyGamer · Jan 24, 2021

Riccochet said:
I don't see why they couldn't have the OS see it as one die. Their CPU's are seen as one die. The controller is what the OS sees.

The OS sees chiplets and so on; this was one of the issues that Microsoft and the Linux kernel developers had to solve before Zen could really stretch its legs.

Beyond that, exposing the configuration through the driver can easily have benefits for software tuning. Perhaps it runs great untuned, and can be made to sing if the application is aware of the layout?

Riccochet · Jan 24, 2021

smaller chiplets results in increased yields. At least that's what I've been told.

Riccochet · Jan 24, 2021

LazyGamer said:
The OS sees chiplets and so on; this was one of the issues that Microsoft and the Linux kernel developers had to solve before Zen could really stretch its legs.

Beyond that, exposing the configuration through the driver can easily have benefits for software tuning. Perhaps it runs great untuned, and can be made to sing if the application is aware of the layout?

The OS sees sockets and cores. That information is being provided by the BIOS and CPU. What the CPU presents as far as cores has nothing to do with how many chiplets are on a substrate. The controller presents it as a single CPU with X number of cores.

LazyGamer · Jan 24, 2021

Riccochet said:
The controller presents it as a single CPU with X number of cores.

The OS definitely sees separate CCXs. Again, this was a big problem with Zen, particularly with the latency difference between the caches. Zen 3 addresses that a bit by simply upping the size of the outermost cache.

DrezKill · Jan 25, 2021

Zarathustra said:
I am done with Crossfire and SLI implementations, even if they are on one board.

The last one I ever f*cked with was the GTX 690, with two 680 GPUs on one board. That thing was... temperamental. Multi-monitor output was kind of a pain in the @ss to deal with too.

Uvilla · Jan 25, 2021

The io die will be the infinity fabric itself, it will be basically a 2 story chip. That would be my guess anyway

Eduardo_Domingot · Jan 25, 2021

At least the rumors are interesting again, maybe not reliable but interesting.

Denpepe · Jan 25, 2021

LazyGamer said:
If they're smaller and easier to fab, increasing yields?

That's the hope!

Riccochet said:
smaller chiplets results in increased yields. At least that's what I've been told.

While in theory I agree, they talk about 2 times the amount of stream processors, so that would be 2 of the current high end chips, even if fabbed on a smaller node that might improve yields, that's no guarantee they can pump those out, maybe the lesser models but why make the effort there if you have to make the more complicated ones anyways.

Riccochet · Jan 25, 2021

Denpepe said:
While in theory I agree, they talk about 2 times the amount of stream processors, so that would be 2 of the current high end chips, even if fabbed on a smaller node that might improve yields, that's no guarantee they can pump those out, maybe the lesser models but why make the effort there if you have to make the more complicated ones anyways.

My guess would be multiple chiplets of smaller amounts of stream processors. That way the product stack can be scaled accordingly. Which would mean higher yields.

Brian_B · Jan 25, 2021

Looking at 6800/6900 power levels - AMD may need a good jump in efficiency before they can get to 10k cores. The 6900 is already at 300W with just half that number.

I mean, sure you could throw that many cores and kneecap the TDP so it fits in a PCI slot form factor, but the 6900 is already doing that. The 6800 have a higher power budget per core than 6900 already - and that kinda plays out in the performance delta.

LazyGamer · Jan 25, 2021

Brian_B said:
Looking at 6800/6900 power levels - AMD may need a good jump in efficiency before they can get to 10k cores. The 6900 is already at 300W with just half that number.

I mean, sure you could throw that many cores and kneecap the TDP so it fits in a PCI slot form factor, but the 6900 is already doing that. The 6800 have a higher power budget per core than 6900 already - and that kinda plays out in the performance delta.

That is certainly an interesting point. I haven't really looked at AMDs power draw this gen, since they've also kneecapped RT and haven't shown much initiative in terms of software support either yet.

But even if they keep it from overheating, which is certainly possible in a PCIe form-factor even if drawing beyond say 500W, they run into so very many problems in terms of actually supporting the product. Chief among them being such a product would have a market limited by the extremes they'd need to go to in order to keep it cool!

Riccochet said:
My guess would be multiple chiplets of smaller amounts of stream processors. That way the product stack can be scaled accordingly. Which would mean higher yields.

Even as everyone is more or less expecting this to be the route they take, one important thing to keep in mind is that they're likely going to need an interposer in there. Possibly more than one, if they decide to do some local HBM per compute chiplet. And interposers are something that AMD has failed at pretty excruciatingly in the past, see Vega and Radeon VII. The potential saving grace is that if they keep the chiplets small and the interconnects well-designed, perhaps they can keep the interposers small too, which would keep their yields up.

AMD Navi 31 GPUs (Radeon RX 7900 XT?) Could Boast Multi-Chip-Module Design with 10,240 Cores and Much Improved Ray Tracing

Tsing

The FPS Review

Brian_B

Forum Posting Supreme

Grimlakin

Forum Posting Supreme

Zarathustra

Cloudless

Riccochet

FPS Junkie

Denpepe

FPS Junkie

LazyGamer

FPS Junkie

LazyGamer

FPS Junkie

Riccochet

FPS Junkie

Riccochet

FPS Junkie

LazyGamer

FPS Junkie

DrezKill

FPS Junkie

Uvilla

FPS Regular

Eduardo_Domingot

{NG}Fidel

Denpepe

FPS Junkie

Riccochet

FPS Junkie

Brian_B

Forum Posting Supreme

LazyGamer

FPS Junkie