Discussion RDNA4 + CDNA3 Architectures Thread

Page 466 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
Mar 3, 2017
1,777
6,787
136
1655034287489.png
1655034259690.png

1655034485504.png

With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it :grimacing:

This is nuts, MI100/200/300 cadence is impressive.

1655034362046.png

Previous thread on CDNA2 and RDNA3 here

 
Last edited:

marees

Golden Member
Apr 28, 2024
1,345
1,943
96

Keller_TT

Member
Jun 2, 2024
146
167
76
Nor sure if this will interest many, but I'm definitely excited. Modular, after announcing top-tier support for CDNA 3 with their Max platform and Mojo, have announced that they want to bring it to consumer GPUs too (RDNA 3 & 4) for the hobbyists at home. We're now this close to getting rid of the CUDA swamp.

The catch is that CDNA uses true Matrix cores with MFMA intrinsics, while RDNA does Matrix multiplication through WMMA instructions, which have been overhauled in RDNA4 and not backwards-compatible with RDNA 3 (also a main reason why FSR 4 isn't for RDNA 3). At the moment, they are relying on eager volunteers to help along.
I'm keeping an eye on it, and will offer what little I can do. I'm just genuinely chuffed that I can soon write GPU kernels on my RDNA4 that'll be fun, and not an experience in hell.
 

ToTTenTranz

Senior member
Feb 4, 2021
501
910
136
The catch is that CDNA uses true Matrix cores with MFMA intrinsics, while RDNA does Matrix multiplication through WMMA instructions, which have been overhauled in RDNA4 and not backwards-compatible with RDNA 3 (also a main reason why FSR 4 isn't for RDNA 3). At the moment, they are relying on eager volunteers to help along.

My only problem with this big departure from RDNA3's ML capabilities is the fact that AMD is getting stuck with that architecture in their APUs for another 2 years.

As it stands, FSR4 is useless for all Phoenix, Strix and Medusa APUs, while Intel already has XMX cores in their Lunar Lake and Arrow Lake H SoCs that run Xess 2.
 
  • Like
Reactions: Tlh97 and marees

marees

Golden Member
Apr 28, 2024
1,345
1,943
96
My only problem with this big departure from RDNA3's ML capabilities is the fact that AMD is getting stuck with that architecture in their APUs for another 2 years.

As it stands, FSR4 is useless for all Phoenix, Strix and Medusa APUs, while Intel already has XMX cores in their Lunar Lake and Arrow Lake H SoCs that run Xess 2.
Microsoft partnership should net better APUs from 2027 onwards, maybe with MFMA too 🤔
 

DAPUNISHER

Super Moderator CPU Forum Mod and Elite Member
Super Moderator
Aug 22, 2001
31,779
31,790
146
My only problem with this big departure from RDNA3's ML capabilities is the fact that AMD is getting stuck with that architecture in their APUs for another 2 years.

As it stands, FSR4 is useless for all Phoenix, Strix and Medusa APUs, while Intel already has XMX cores in their Lunar Lake and Arrow Lake H SoCs that run Xess 2.
Man, this is one of those situations where as a gamer, my criteria would have me choosing AMD. Here's why: The Intel software is a weak point. No integrated recording or streaming. Driver issues are more prevalent, and some engines just don't seem to play well with the hardware. The CPU overhead is another weak point that needs to be taken into consideration.

Also, if the APUs are in handhelds or thin and lights with small screens, the DP4a version of XeSS will do a great job on the AMD hardware. I find myself choosing XeSS on AMD cards in almost every game that has it. The performance is usually not significantly lower and the IQ is predominantly better. AMD in Linux is outperforming windows in quite a few games now, sometimes drastically so, with much better 1% lows too.

The Intel offering would have to be substantially faster in both windows and Linux at the same price to win me over.
 
  • Like
Reactions: marees