Discussion RDNA4 + CDNA3 Architectures Thread

Page 190 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
Mar 3, 2017
1,777
6,785
136
1655034287489.png
1655034259690.png

1655034485504.png

With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it :grimacing:

This is nuts, MI100/200/300 cadence is impressive.

1655034362046.png

Previous thread on CDNA2 and RDNA3 here

 
Last edited:

Saylick

Diamond Member
Sep 10, 2012
3,925
9,151
136
Level 0 – Legacy Solutions
Level 1 – Software on Traditional GPUs
Level 2 – Ray/Box and Ray/Tri Testers in Hardware
Level 3 – Bounding Volume Hierarchy (BVH) Processing in Hardware
Level 4 – BVH Processing with Coherency Sorting in Hardware
Level 5 – Coherent BVH Processing with Scene Hierarchy Generator in Hardware

Interesting what level is RDNA4 lvl3?
Probably Level 3 at worst, Level 4 at best. RDNA2/3 was already Level 2. ARC is Level 4, Ada is Level 4.
 
  • Like
Reactions: Tlh97 and Elfear

SolidQ

Golden Member
Jul 13, 2023
1,452
2,366
106
from PS5pro deep dive about RDNA4 RT
386f2a809ecef8d243f1c1897e20a8ff.png


c390c1b1c2f6e34a5c16c46d62847a3a.png


PS5pro AI
f06150c523c6cdb40e687d53c5804789.png
 
Last edited:

Saylick

Diamond Member
Sep 10, 2012
3,925
9,151
136
Looks like RDNA 4 finally gets something for thread coherency, which means it's Level 4.

Also, Amethyst is not about proprietary features and is a collaboration between Sony and AMD. This is good, if it means FSR4 gets to leverage what's going on under-the-hood in PSSR.
 

Mopetar

Diamond Member
Jan 31, 2011
8,438
7,633
136
I just want an efficient architecture with no major bottlenecks that can run my games at reasonable frame rates without needing to rely on upsampling or other trickery to make bigger bars solely for the purpose of making me feel better about my purchase.

I get that some people are excited about these technologies (and if supported they do give the card longer legs down the road) but I mainly view them as taking resources away from what I want. I don't care how good FSR4 is. I hope to never have to turn it on in the first place.
 

Win2012R2

Senior member
Dec 5, 2024
984
980
96
Very interesting lack of dual issue (which works bad, far worse than Nvidia's solution) in PS5 Pro, AMD needs to rethink it for RDNA 5.
 

Win2012R2

Senior member
Dec 5, 2024
984
980
96
gfx13 probably uses MI300's method of dual FP32.
Is it more like NVIDIA's implementation, which isn't exactly double speed, but it seems to be a lot better than AMD's? I guess it can only be better than current sad state of affairs.
 

Saylick

Diamond Member
Sep 10, 2012
3,925
9,151
136
Is it more like NVIDIA's implementation, which isn't exactly double speed, but it seems to be a lot better than AMD's? I guess it can only be better than current sad state of affairs.
I tried to look into how CDNA 3's dual issue works and based on what I can understand from C&C's article, it still has its limitations, especially since it appears more suited to how GCN works vs RDNA. Nvidia's approach appears to be more flexible overall, imo.
 
  • Like
Reactions: Tlh97 and Win2012R2

Bigos

Member
Jun 2, 2019
198
514
136
I am still waiting for a day when the game engine developer job - the upscaling - is actually done by game engine developers and not ISV... This stupid precedent is made by Nvidia having enough developers to create a vendor lock in...
 

marees

Golden Member
Apr 28, 2024
1,258
1,814
96
Also, Amethyst is not about proprietary features and is a collaboration between Sony and AMD. This is good, if it means FSR4 gets to leverage what's going on under-the-hood in PSSR.
Looks like Amethyst is a generalized architecture (like Vulkan) for using CNNs. What nvidia is using in Tensor Cores

Sony said this is a 4 year project. So probably something for RDNA 5, 6 (or UDNA 1, 2) & PS6

For PS5 pro, sony has mentioned that they are using register memory as RAM to avoid latency for PSSR (Again no Tensor cores). I wonder if RDNA 4 uses same architecture for FSR 4 ?
 
  • Like
Reactions: Tlh97 and Win2012R2