• We should now be fully online following an overnight outage. Apologies for any inconvenience, we do not expect there to be any further issues.

Discussion RDNA4 + CDNA3 Architectures Thread

Page 74 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
Mar 3, 2017
1,777
6,791
136
1655034287489.png
1655034259690.png

1655034485504.png

With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it :grimacing:

This is nuts, MI100/200/300 cadence is impressive.

1655034362046.png

Previous thread on CDNA2 and RDNA3 here

 
Last edited:

SolidQ

Golden Member
Jul 13, 2023
1,504
2,473
106
Would be cool if RDNA4 can beat at least 5070.
Based on this. Maybe 5070 = 4080, so RDNA4 maybe going compete with RTX 5060ti
7bf0bb98c9ff91fcf9564cdd7a72f068.png
 

TESKATLIPOKA

Platinum Member
May 1, 2020
2,696
3,260
136
So the specs look like this for now:
RDNA4SizeSEClocksWGPCUDual-issue ShadersTMUROPsInfinity Cachebus widthGDDR6Mem BWEffective
BW

IC BW
Vram
N48~240mm24?32644096 ?256 ?96 ?64MB ?256-bit21.65gbps693 GB/s2770 GB/s16GB
N44~130mm22 ??16322048 ?128 ?64 ?32MB ?128-bit18gbps288 GB/s515 GB/s8-16GB ?
WGP is supposedly new so I don't know what changes that means for the specs.

N44 doesn't look that good for higher resolutions thanks to the underwhelming memory subsystem.

edit: Ok, It's probably just IC BW and not effective BW.

edit2: also included memory BW
 
Last edited:
  • Like
Reactions: Tlh97 and bearmoo

Mahboi

Golden Member
Apr 4, 2024
1,058
1,969
96
Are you sure about what you wrote?
RTX 4090: 384-bit paired with 21gbps GDDR6 = 1008 GB/s

Effective BW is Infinity cache hitrate + BW from memory controller.
Nope, not sure. That's why it's a question.
The number just seems very high compared to N44. It's five times more.
I'd be more willing to bet that it's the estimated clocks at 2770Mhz and 2515 Mhz, but the 2 was missed or something.
 

Timorous

Golden Member
Oct 27, 2008
1,977
3,861
136
So the specs look like this for now:
RDNA4SizeSEClocksWGPCUDual-issue ShadersTMUROPsInfinity Cachebus widthGDDR6Effective
BW
Vram
N48~240mm24?32644096 ?256 ?96 ?64MB ?256-bit21.65gbps2770 GB/s16GB
N44~130mm22 ??16322048 ?128 ?64 ?32MB ?128-bit18gbps515 GB/s8-16GB ?
WGP is supposedly new so I don't know what changes that means for the specs.

N44 doesn't look that good for higher resolutions thanks to the underwhelming memory subsystem.

It will be fine at 1080p / 1440p with compromises or upscaling and N48 will be fine for 1440p and 4k with compromises or upscaling.
 

TESKATLIPOKA

Platinum Member
May 1, 2020
2,696
3,260
136
Nope, not sure. That's why it's a question.
The number just seems very high compared to N44. It's five times more.
I'd be more willing to bet that it's the estimated clocks at 2770Mhz and 2515 Mhz, but the 2 was missed or something.
You were too fast. I changed that post of mine, check It out.
I think It's very unlikely he was talking about clockspeed.
 

Mahboi

Golden Member
Apr 4, 2024
1,058
1,969
96
It will be fine at 1080p / 1440p with compromises or upscaling and N48 will be fine for 1440p and 4k with compromises or upscaling.
32 CUs of non-broken RDNA 3/RDNA 3.5 would already probably handle 1440p quite well. The current broken 7600 is already very close. Assuming 20% extra perf, it would honestly surprise me that it can't reach 6700xt or so performance. And with the expected RDNA 4 improvements, maybe it'll be closer to 7700 xt performance...except it has that tiny bus.
Same with 64 CUs of non broken RDNA 3, that would already gain a solid 15% extra, it would put the 7800 xt around 4070 Ti performance? And then RDNA 4 improvements on top of it. I don't think there's going to be a lot of compromises compared to the current top range. Unless you think a 7900 xt is too weak for 4K.
 

Abwx

Lifer
Apr 2, 2011
11,885
4,873
136
So the specs look like this for now:
RDNA4SizeSEClocksWGPCUDual-issue ShadersTMUROPsInfinity Cachebus widthGDDR6Effective
BW

IC BW
Vram
N48~240mm24?32644096 ?256 ?96 ?64MB ?256-bit21.65gbps2770 GB/s16GB
N44~130mm22 ??16322048 ?128 ?64 ?32MB ?128-bit18gbps515 GB/s8-16GB ?
WGP is supposedly new so I don't know what changes that means for the specs.

N44 doesn't look that good for higher resolutions thanks to the underwhelming memory subsystem.

edit: Ok, It's probably just IC BW and not effective BW.
256b bus 693 GB/s and 2770MHz with 64CUs
128b bus 288 GB/s and perhaps 3515MHz along with 32CUs, would be odd that frequency is only 2515MHz for the smaller GPU.
 

TESKATLIPOKA

Platinum Member
May 1, 2020
2,696
3,260
136
32 CUs of non-broken RDNA 3/RDNA 3.5 would already probably handle 1440p quite well. The current broken 7600 is already very close. Assuming 20% extra perf, it would honestly surprise me that it can't reach 6700xt or so performance. And with the expected RDNA 4 improvements, maybe it'll be closer to 7700 xt performance...except it has that tiny bus.
Same with 64 CUs of non broken RDNA 3, that would already gain a solid 15% extra, it would put the 7800 xt around 4070 Ti performance? And then RDNA 4 improvements on top of it. I don't think there's going to be a lot of compromises compared to the current top range. Unless you think a 7900 xt is too weak for 4K.
That depends on what you mean by handle quite well. >60FPS?
average-fps-per-game-2560-1440.png

In raster most likely yes, unless It's a game like City Skylines II. :p
RT is another story.

P.S. I am talking about N44
 
  • Like
Reactions: Tlh97 and Timorous

TESKATLIPOKA

Platinum Member
May 1, 2020
2,696
3,260
136
256b bus 693 GB/s and 2770MHz with 64CUs
128b bus 288 GB/s and perhaps 3515MHz along with 32CUs, would be odd that frequency is only 2515MHz for the smaller GPU.
What? I have question marks for clock.
I don't think this was meant for me.