• We’re currently investigating an issue related to the forum theme and styling that is impacting page layout and visual formatting. The problem has been identified, and we are actively working on a resolution. There is no impact to user data or functionality, this is strictly a front-end display issue. We’ll post an update once the fix has been deployed. Thanks for your patience while we get this sorted.

Discussion RDNA4 + CDNA3 Architectures Thread

Page 235 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
1655034287489.png
1655034259690.png

1655034485504.png

With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it :grimacing:

This is nuts, MI100/200/300 cadence is impressive.

1655034362046.png

Previous thread on CDNA2 and RDNA3 here

 
Last edited:
Well i still think rdna4 simds can access other simds beoynd the CU vs rdna3 (within cu) adroc already laughed at me for this ~1year ago.
This would be true dual issue for true 50Tflops of single precision = 4080s (has 50)
 
Well i still think rdna4 simds can access other simds beoynd the CU vs rdna3 (within cu) adroc already laughed at me for this ~1year ago.
This would be true dual issue for true 50Tflops of single precision = 4080s (has 50)
shader oomph is whatever. look elsewhere.
 
shader oomph is whatever. look elsewhere.
well there are few things pointing to this.
1. they changed the instructions
2. better prefetch / better identify and optimize dual issue opportunities
3. more flexible instruction scheduling

But this would req some global memory
 
well there are few things pointing to this.
1. they changed the instructions
2. better prefetch / better identify and optimize dual issue opportunities
3. more flexible instruction scheduling
Again, flops are whatever. look elsewhere.
They're getting a ton of perf outta 256b 20Gbps GDDR6 so try to imagine how and why.
 
Again, flops are whatever
Cerny!
Mark-Cerny-PlayStation-5-Pro-Flopflation.jpg
 
I hope someone posts a video / comments about it

Went there right now.

It's running on an engineering sample GPU so I'm assuming it's the 9070/XT.

Frame rate seems to be around 30FPS (there's no frame counter) but for some reason it's set with VSync off nor freesync, so lots of tearing when you pan around.

As for how the demo looks, I really can't say it looks transformative. I guess it's more interesting to know what's happening behind it especially when you turn denoising off and then you definitely notice it.

1000066520.jpg

1000066521.jpg
 
Yeah, + need check it's real 9070 or XT
If the sticker on that GPU is the same as the engineering sample posted above good luck finding what GPU is in there.
I don't think anyone can walk in there and run CoD on that PC. If that was the case we would have seen all tech media posting CoD benchmarks today :tearsofjoy:
That benchmark provided by IGN was most likely allowed by AMD. With was purpose ? IDK. Will it bite them in the back ? IDK.
 
Here's to hoping. AMD and FSR is what gives me the option for Frame Gen in CP2077 on my 3080. Thanks Nvidia!
You actually want fake frames???

I mean Nvidia also gave improvements to 30 series with DLSS4 that are way more useful than fake frames especially for a 3080 user.
 
Back
Top