Discussion RDNA4 + CDNA3 Architectures Thread

Page 175 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
Mar 3, 2017
1,777
6,785
136
1655034287489.png
1655034259690.png

1655034485504.png

With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it :grimacing:

This is nuts, MI100/200/300 cadence is impressive.

1655034362046.png

Previous thread on CDNA2 and RDNA3 here

 
Last edited:

Tuna-Fish

Golden Member
Mar 4, 2011
1,649
2,474
136
Let me rephrase that, they are probably still making RDNA3 cards. The economics for AIB manufacture are much more favorable for slow continuous production. But the chips they are putting in those cards were all made a long time ago, and they are probably not making any more.
 

marees

Golden Member
Apr 28, 2024
1,259
1,817
96
Let me rephrase that, they are probably still making RDNA3 cards. The economics for AIB manufacture are much more favorable for slow continuous production. But the chips they are putting in those cards were all made a long time ago, and they are probably not making any more.
You mean the chips were baked in the oven, a long time ago.
But these cold chips are being packed into fresh GPU cards

Why not bake less RDNA 3 ?
Or did they place RDNA 3 orders for the chips when RDNA 2 was at its peak due to covid shortage
 

Tuna-Fish

Golden Member
Mar 4, 2011
1,649
2,474
136
You mean the chips were baked in the oven, a long time ago.
But these cold chips are being packed into fresh GPU cards
Yep.
Why not bake less RDNA 3 ?
Or did they place RDNA 3 orders for the chips when RDNA 2 was at its peak due to covid shortage
Yep. It's all based on estimation of demand. They overestimated, so they have too much of them. They may have underestimated some other product, and now have too little of it. The fundamental problem here is that the timeline from ordering chips from TSMC to delivery of actual product to consumers is way too long for product cycles. You cannot really respond to demand, you just estimate it and hope you got it right. If you guessed too high, you get stuck with piles of rapidly depreciating chips that cost you a pretty penny, if you guessed too low you run out of product to sell. Both situations suck, but the former sucks more. They are not really trading off between products, as the relevant process has slack at TSMC.
 

Mahboi

Golden Member
Apr 4, 2024
1,058
1,969
96
Wait I just realised, that means that on CES they'll advertise both Halo and RDNA 4?
Sounds like mixed signals already.

"this is our new top tier amazing APU with our new-old RDNA 3.5 that has meh raytracing that's still amazing raytracing, and this is our real new architecture that has actual raytracing hardware that makes even more amazing raytracing"
 

jpiniero

Lifer
Oct 1, 2010
16,499
7,003
136
"this is our new top tier amazing APU with our new-old RDNA 3.5 that has meh raytracing that's still amazing raytracing, and this is our real new architecture that has actual raytracing hardware that makes even more amazing raytracing"

I imagine Lisa will spend the entire time on Strix Halo talking about AI.
 

SolidQ

Golden Member
Jul 13, 2023
1,452
2,368
106
5085849039236c9d0a33375d37c87db2.png


RDNA4 is 8x2? Based on Kepler_l2 info
 

soresu

Diamond Member
Dec 19, 2014
3,899
3,331
136
"this is our new top tier amazing APU with our new-old RDNA 3.5 that has meh raytracing that's still amazing raytracing, and this is our real new architecture that has actual raytracing hardware that makes even more amazing raytracing"
They'll probably just downplay Halo's RT to focus on overall perf.
 

Mahboi

Golden Member
Apr 4, 2024
1,058
1,969
96
Networking is kind of a sore spot for me so I'm really hoping for someone clever to do a good Pensando P4/NVLink competitor review. Real curious to see how it compares.
 

itsmydamnation

Diamond Member
Feb 6, 2011
3,047
3,842
136
Makes more sense, I was thinking of Daft Punk or something for a second there...
i dont know NVlink well , but wiki has 25gbps per lane for NVlink5.0 , so im assuming based off SFP28 targeted standards , so not even using PAM4. this would mean that at the MAC level NV is a technology generation behind. Not that it really matters, because ultimately what will drive scale is overall network fabric architecture vs number of PCI lanes attached/allocated.

DPU are a bit of a wank , really all they are doing is moving switch fabric logic to the NIC , this way the edge intelligence can be a physical hop closer to the CPU/GPU. Modern high end Carrier protocols ( Segment Routing ) push alot of capabilities for multipathing , traffic engineering , affinity / anti-affinity to the edge device. So i expect all of forest's comments to be in effect , lets pick up the current carrier MPLS stack ( IS-IS / BGP , SR , PCE ,PCC ) and extend the boundary one hop. Nothing that he talked about is new from a high end high performance carrier network perspective.

But i can see how it would help AI workloads that are push 95% fabric bandwidth , normally in networking your ringing major alarms for upgrades at => 70%
 
Last edited:

Saylick

Diamond Member
Sep 10, 2012
3,925
9,154
136
5085849039236c9d0a33375d37c87db2.png


RDNA4 is 8x2? Based on Kepler_l2 info
There’s 2 Ray Accelerators in a RDNA2/3/3.5 WGP and each RA can do 4 box tests per clock or 1 triangle tests per clock.

RDNA4 either doubles the number of RAs or each RA can do 2x the throughput. Not sure of which it will be but RDNA4 should be twice as performant at minimum.
 

ToTTenTranz

Senior member
Feb 4, 2021
463
853
136
Package size is influenced not just by die size but also I/O capabilities. N44 is very N23-like in this regard with 128-bit bus and (I think) 8x PCIe.
Is RDNA4 PCIe 4 or 5?

8GB VRAM + 8x PCIe 4.0 doesn't bode well for a GPU that is supposedly replacing Navi 22 in raster performance.
 

marees

Golden Member
Apr 28, 2024
1,259
1,817
96
Was the N41 the halo radeon. Now N48 is the strongest Radeon and is not here to outperform Nvidia halo now.
My understanding:

N44 = 5060 / 4060 ti
N43/N48 = 5070 / 4070 super / 4070 ti
N42 = 5080 / 4080 super
N41 = 4090

Was there a N40c on top of N41 ??
 

static shock

Member
May 25, 2024
133
61
61
Wow. 399 is possible of there's not much power consumption to package it. Developing the cards costs much more, 3NE is not cheap but hey if they can get this performance at 2.5 times less power consumption is a solid win for AMD. $399 for such a strong card is a winner.
 

static shock

Member
May 25, 2024
133
61
61
My understanding:

N44 = 5060 / 4060 ti
N43/N48 = 5070 / 4070 super / 4070 ti
N42 = 5080 / 4080 super
N41 = 4090

Was there a N40c on top of N41 ??
Its more about HD 5870/5850(7800 xt sucessor) being N48 and 5770/5750 being N44(RX 7600 sucessor). No GTX 570/580 competitor. Got it?


Disclaimer: i'm not a tech journalist and have no NDA to hide anything, these rumor posts are just Guesses of what can happen. They did Zen1 they did Zen5 they did 4850 why they can't do another marvel like these?
 
Last edited: