Discussion RDNA4 + CDNA3 Architectures Thread

Page 476 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
Mar 3, 2017
1,777
6,791
136
1655034287489.png
1655034259690.png

1655034485504.png

With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it :grimacing:

This is nuts, MI100/200/300 cadence is impressive.

1655034362046.png

Previous thread on CDNA2 and RDNA3 here

 
Last edited:

basix

Senior member
Oct 4, 2024
239
489
96
Yeah, I do not expect any refresh either. N48 gets already pushed towards its max. frequency anyways (9070 XT).

Stuff like a 9070 GRE 12 / 16 GByte is also already there to fill up some lineup holes and maximize salvaged yield of N48.
Maybe we'll see a 9060 16 GByte in the future. That will be it for RDNA4.
 
  • Like
Reactions: Tlh97 and marees

marees

Golden Member
Apr 28, 2024
1,727
2,362
96
Maybe we'll see a 9060 16 GByte in the future. That will be it for RDNA4
I see possibility of N44 derivatives extending into RDNA 5 era as the AT3 / 10060xt is lpddr based (& hence could suffer some performance regressions)
 

ToTTenTranz

Senior member
Feb 4, 2021
686
1,145
136
Maybe we'll see a 9060 16 GByte in the future. That will be it for RDNA4.
Some rumor mills keep insisting on the Q1 26 release of a N48 with GDDR7 and increased power budget for 10-15% higher core clocks.



I see possibility of N44 derivatives extending into RDNA 5 era as the AT3 / 10060xt is lpddr based (& hence could suffer some performance regressions)
I think / hope AMD isn't going to raise many waves around the LPDDR5X versions of AT3 and AT4, unless these chips can take on Samsung's (still hypothetical) 12.7MT/s memory.
406GB/s and 203GB/s sound like the bare minimum bandwidth AT3 and AT4 should have.
 

basix

Senior member
Oct 4, 2024
239
489
96
I see possibility of N44 derivatives extending into RDNA 5 era as the AT3 / 10060xt is lpddr based (& hence could suffer some performance regressions)
N44 could live on the bottom end of the lineup. But mainly due to N4P. A 9060 non-XT with 16 GByte would be perfect for that (28 CU salvage, only 3.0 GHz boost, cost optimal 18 Gbps GDDR6).

Memory bandwidth should not be an issue for AT3 and AT4. Quad-Channel LPDDR6 delivers up to 690 GByte/s (14.4 Gbps which is advertised by Cadence already). A 9060 XT delivers 640 GByte/s.
Sure, RDNA5 needs to deliver enhanced bandwidth efficiency but I think that will happen. Why?
- AT4 = 256...345 GByte/s with 12 CU (192bit LPDDR6 10.67...14.4 Gbps)
- AT3 = 512...690 GByte/s with 24 CU (384bit LPDDR6 10.67...14.4 Gbps)
- AT2 = 768...864 GByte/s with 36 CU (192bit GDDR7 32...36 Gbps)

If AT4 and AT3 are bandwidth limited, AT2 would it be as well.

Salvage or mobile variants of AT3 and AT4 could use (fast) LPDDR5X instead of LPDDR6. This will be more of a power efficiency and cost question. As ToTTenTranz mentioned, 200 & 400 GB/s sound like a minimum bandwidth requirement. But 12.7 Gbps LPDDR5X could be hard to buy, maybe mobile variants will go down to 10.7 Gbps (which is available from all LPDDR5X manufacturers). 171 & 342 GByte/s is not much. But if salvaged (e.g. 10 & 20 CU for AT4 and AT3 respectively) and lowered clockrates for mobile (e.g. 1.2x lower) we get into the right range relatively to a full-chip and higher clockrate variant for desktop cards.
 
Last edited: