Discussion RDNA 5 / UDNA (CDNA Next) speculation

Page 90 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

adroc_thurston

Diamond Member
Jul 2, 2023
8,476
11,197
106
Now with -8CU (-25%), +50% size SE, halved RB and Rasterizer (-50%), no MALL, L2 ?<32MB, LPDDR5X/?LPDDR6 weak memory interface.
1/(-25% WGP (0.75) x +15% freq (1.15)) = +16% IPC when compute bound. Higher in raster games with -50% RB + Rasterizer offset (unless bigger HW).
see assumptions like these are hard for a veeeeery different uarch
 

MrMPFR

Senior member
Aug 9, 2025
213
407
96
see assumptions like these are hard for a veeeeery different uarch
I'm just listing the things the weak points (on paper specs) of AT4 based on what has been said so far. Obviously clean slate approach is mandatory.

+15% clk is high end of TSMC node scaling for N3P. +10% clk= 21% higher raster IPC.

Are the number of vector ALUs per WGP/CU vs RDNA4 WGP different? Any other assumptions that need to be changed?
 

Kepler_L2

Golden Member
Sep 6, 2020
1,081
4,665
136
I'm just listing the things the weak points (on paper specs) of AT4 based on what has been said so far. Obviously clean slate approach is mandatory.

+15% clk is high end of TSMC node scaling for N3P. +10% clk= 21% higher raster IPC.

Are the number of vector ALUs per WGP/CU vs RDNA4 WGP different? Any other assumptions that need to be changed?
256 for both but RDNA5 has much fewer dual-issue restrictions.
 
  • Like
Reactions: MrMPFR

adroc_thurston

Diamond Member
Jul 2, 2023
8,476
11,197
106

reaperrr3

Member
May 31, 2024
172
495
96
So it'd be the same as Strix Halo (and much less than the 9060 XT)
Don't underestimate how inferior RDNA3(.5) is in terms of bw efficiency vs. RDNA4, let alone RDNA5.

I mean, the 7900 XT already scales worse at 4K than the XTX, meaning even 320bit / 80MB IF$ aren't enough to fully saturate bw at that perf level for RDNA3.

Meanwhile, N48 does more with even less IF$/mem bw, and RDNA5 appears to have several more bw/cache efficiency improvements.

I'd wager the main challenge for desktop AT3/4 will be per-GB-cost of LP5X/LP6 at the planned release window, not bandwidth.
 

branch_suggestion

Senior member
Aug 4, 2023
887
1,933
106
It's going to suffocate from lack of bandwidth. Pretty obvious that the marketing's going to entirely be around VRAM. Well, depending on how expensive the 8 GB LPDDR6 is then.
Just for the record, AT4 LP6 minimum is 16GB, AT3 is 32GB.
MDSP minimum is realistically 32GB.
LP6 is gonna be a fair bit cheaper per GB than G7.
 
  • Like
Reactions: lightmanek

basix

Senior member
Oct 4, 2024
313
611
96
16GB for AT4 cards would be really good. 32GB for AT3 looks like overkill and really weird when AT2 will feature only 18/24 GByte of GDDR7 memory.

If using LPDDR5X with 12.7 GT/s (Samsung has announced such memory modules):
- 508 GByte/s should be enough for AT3. AT2 with 192bit GDDR7 running at 32 Gbps will have 768 GB/s of bandwidth, so +50% CU and +50% bandwidth would match perfectly
- And then use 16 or 24 GByte for AT3
 

MrMPFR

Senior member
Aug 9, 2025
213
407
96
You can't.
DRAM densities too high to get that on an octachannel.
Yeah not happening. Samsung only lists LPDDR5X-10677 x64 at 96-192Gb densities. So 12-24GB x 4 = 48-96GB.

Cadence says LPDDR6 starts at 4GB device densities (x24). 16 of those in AT3 = 64GB. If this is wrong at x48 is device bus width then 32GB is still a lot.
 

T2098

Junior Member
Oct 10, 2024
2
3
41
They certainly do make 512Mx64 (32gbit / 4GiB) LPDDR5X packages though. I've found up to 9600MT/s from Micron, so it's not a huge stretch to think that the 10677 MT/s grade might exist by the time RDNA5 releases.1769808340374.png
 
  • Like
Reactions: MrMPFR