Discussion RDNA 5 / UDNA (CDNA Next) speculation

Kepler_L2 · Jul 24, 2025

SolidQ said:
he saying one of AMD sources leak. instead 184 is 144CU?

Not by that much

SolidQ · Jul 24, 2025

Kepler_L2 said:
Not by that much

i feel there would be AT1? because too big gap between 64CU and 154CU

Tuna-Fish · Jul 24, 2025

Is MALL gone and folded into L2? It's not listed.

That's a huge hole in the lineup between a 18GB 64CU model and the 36GB 154CU one.

SolidQ · Jul 24, 2025

Tuna-Fish said:
hat's a huge hole in the lineup between a 18GB 64CU model and the 36GB 154CU one.

Yeah i think there should be AT1, because there no 256bus card.
P.S Mlid saying he have document, where 64CU target below 550$ (estimate 5080-4090 perf)

gdansk · Jul 24, 2025

Market segment - CGVDI?

adroc_thurston · Jul 24, 2025

gdansk said:
Market segment - CGVDI?

GPU virtualization farms with SRIOV.

gdansk · Jul 24, 2025

And as usual I suspect everything interesting will be killed.

adroc_thurston · Jul 24, 2025

gdansk said:
And as usual I suspect everything interesting will be killed.

No, they actually moved a pile of senior technical staff to Radeon recently.

marees · Jul 24, 2025

Kepler_L2 said:
It's real but some of the numbers like CU count are slightly wrong (maybe intentionally to find out who leaks this).

Welp! What does Lisa Su do to the leakers ???

luro · Jul 24, 2025

adroc_thurston said:
No, they actually moved a pile of senior technical staff to Radeon recently.

Nice

gdansk · Jul 24, 2025

marees said:
Welp! What does Lisa Su do to the leakers ???

Terminate. With prejudice.

adroc_thurston · Jul 24, 2025

marees said:
What does Lisa Su do to the leakers ???

Kuvaldirovka or perhaps obnuleniye.

branch_suggestion · Jul 24, 2025

Kepler_L2 said:
It's real but some of the numbers like CU count are slightly wrong (maybe intentionally to find out who leaks this).

Yeah AT0 should be 192CU and AT2 72CU.
Both only make sense with 3SA/SE.
If we plebs don't get the full fat AT0 for gaming it will be the first time since Vega20, though that was a prosumer part. I think we will depending on where NV ends up.
Not shocked to see a probable H2'27 launch, all those console parts don't tape out or validate themselves.
N3P is whatever, but considering they are going for mono compute+MID N2 would be too yield/cost sensitive. Also 512b for the flagship is absolutely necessary to match NV on memory as otherwise they have easy wins in ML et al.

SolidQ said:
i feel there would be AT1? because too big gap between 64CU and 154CU

Nope, it is great. There is no such thing as a $1k market for dGPU, you either have your halo parts or your $500-$600 market.
AT2 is the exact part I was pushing for with RDNA4 had they used GDDR7, great config, the extra 2GB is a big deal.
NV only has that market because of lack of comp pressure, GB203 could be sold for <$700 no problem.
AT0 is a fairly modest halo compared to the moonshot halo's they could've built. Still ~600mm^2 on N3P is a lotta silicon.

Tuna-Fish said:
Is MALL gone and folded into L2? It's not listed.

Now this is interesting, I think MALL outside of APUs will only be used for 3D stacked parts as only they have enough compute density to overwhelm GDDR7.
Still it appears they are increasing L2 by 4x, so the dynamics will end up somewhere between current AMD L2 and NV L2.
Potentially 2x or more bandwidth with 8x the capacity vs N48 for a moest latency hit would be good enough with 512b GDDR7 for an 8SE/192CU part.
MALL becomes necessary beyond 250CU or so.

Kepler_L2 · Jul 24, 2025

branch_suggestion said:
Yeah AT0 should be 192CU and AT2 72CU.
Both only make sense with 3SA/SE.
If we plebs don't get the full fat AT0 for gaming it will be the first time since Vega20, though that was a prosumer part. I think we will depending on where NV ends up.
Not shocked to see a probable H2'27 launch, all those console parts don't tape out or validate themselves.
N3P is whatever, but considering they are going for mono compute+MID N2 would be too yield/cost sensitive. Also 512b for the flagship is absolutely necessary to match NV on memory as otherwise they have easy wins in ML et al.

Nope, it is great. There is no such thing as a $1k market for dGPU, you either have your halo parts or your $500-$600 market.
AT2 is the exact part I was pushing for with RDNA4 had they used GDDR7, great config, the extra 2GB is a big deal.
NV only has that market because of lack of comp pressure, GB203 could be sold for <$700 no problem.
AT0 is a fairly modest halo compared to the moonshot halo's they could've built. Still ~600mm^2 on N3P is a lotta silicon.

Now this is interesting, I think MALL outside of APUs will only be used for 3D stacked parts as only they have enough compute density to overwhelm GDDR7.
Still it appears they are increasing L2 by 4x, so the dynamics will end up somewhere between current AMD L2 and NV L2.
Potentially 2x or more bandwidth with 8x the capacity vs N48 for a moest latency hit would be good enough with 512b GDDR7 for an 8SE/192CU part.
MALL becomes necessary beyond 250CU or so.

I think they are gargantuan SAs instead, 4 SE x 2 SA x 12 WGP for AT0 and 2 SE x 2 SA x 9 WGP for AT2

adroc_thurston · Jul 24, 2025

branch_suggestion said:
Not shocked to see a probable H2'27 launch, all those console parts don't tape out or validate themselves.

Oh no it's not that late.

Kepler_L2 said:
I think they are gargantuan SAs instead, 4 SE x 2 SA x 12 WGP for AT0 and 2 SE x 2 SA x 9 WGP for AT2

God they sure love changing the ratios and engine org every generation.

gdansk · Jul 24, 2025

I dare not speculate on the performance of it. No one, not even AMD, can predict Radeon frequencies.

reaperrr3 · Jul 24, 2025

branch_suggestion said:
Now this is interesting

I mean, with how poorly SRAM scales since N5, reducing the combined amount of L2+MALL is one of the easiest area wins, if you can cushion the negative impact on latency and bandwidth efficiency through other means.

adroc_thurston · Jul 24, 2025

gdansk said:
No one, not even AMD, can predict Radeon frequencies.

they'll get closer to 3.46 this time.

reaperrr3 said:
reducing the combined amount of L2+MALL is one of the easiest area wins

L2 macro is far bigger, and far less dense.

branch_suggestion · Jul 24, 2025

If we take SVP comments verbatim, this should be phase 2/3 of the comeback plan, beat NV on value in mainstream, perform well vs NV across ~90% of the market, finally after that Lisa will hopefully allow them to build the hydrogen bomb.

Kepler_L2 said:
I think they are gargantuan SEs instead, 4 SE x 2 SA x 12 WGP for AT0 and 2 SE x 2 SA x 9 WGP for AT2

Is SE scheduling that much harder than WGP scheduling in each SE to fatten them up that much?
I though XSX was bad enough.
I much prefer 8SE/4SE unless each SE is 2x the stuff of before so the same effective throughput.

adroc_thurston said:
Oh no it's not that late.

I sure hope not but the comments are weird unless that is just obfuscation.
Actually duh, AT0 launches first Q4'26/CES'27 with AT2 at Computex/Gamescom.

adroc_thurston said:
God they sure love changing the ratios and engine org every generation.

Meanwhile NV in client is just doing mild variations of the same compute hierarchy since Pascal, just pushed further each time.

gdansk · Jul 24, 2025

branch_suggestion said:
finally after that Lisa will hopefully allow them to build the hydrogen bomb.

This line up has what would be easily a 100 tflops+ gaming GPU. That isn't a hydrogen bomb?

branch_suggestion · Jul 24, 2025

gdansk said:
This line up has what would be easily a 100 tflops+ gaming GPU. That isn't a hydrogen bomb?

No, 200CU+, 200TFLOPS+, 200b xtor+ is the hydrogen bomb.
Only possible with a 3D design with MALL.

gdansk · Jul 24, 2025

branch_suggestion said:
No, 200CU+, 200TFLOPS+, 200b xtor+ is the hydrogen bomb.
Only possible with a 3D design with MALL.

Oh, it may only be seen in Cocytus then. They must have their ML ducks in a row first before trying to sell gamers that sorcery.

branch_suggestion · Jul 24, 2025

gdansk said:
Oh, it may only be seen in Cocytus then. They must have their ML ducks in a row first before trying to sell gamers that sorcery.

For example, one possible config is 256CU/4 or 8 SE which would need a little over 3Ghz to get to 200TF, give it 512b G7 and 256MB of MALL below along with Matrix cores in memory and that would be around 200b xtors using ~1k mm^2 of N2/N3 class.
That is the halo config I've been thinking about for some time, would need around 800W or so to hit those clocks I think.

adroc_thurston · Jul 24, 2025

gdansk said:
That isn't a hydrogen bomb?

No that's babymode, not even remotely flexing their 3D integration expertise.

branch_suggestion · Jul 24, 2025

adroc_thurston said:
they'll get closer to 3.46 this time.

FinFlex my beloved.

adroc_thurston said:
L2 macro is far bigger, and far less dense.

Yeah the overall area dedicated to cache won't change that much, but this seems to mean a return to de facto 2 cache levels, not a huge shock with RDNA4 being de facto 3 levels down from 4.
GDDR7 is not bad after all, Blackwell does not do it justice.

Discussion RDNA 5 / UDNA (CDNA Next) speculation

Golden Member

Golden Member

Golden Member

Golden Member

Diamond Member

Diamond Member

Diamond Member

Diamond Member

Platinum Member

Member

Diamond Member

Diamond Member

Senior member

Golden Member

Diamond Member

Diamond Member

Member

Diamond Member

Senior member

Diamond Member

Senior member

Diamond Member

Senior member

Diamond Member

Senior member