• We’re currently investigating an issue related to the forum theme and styling that is impacting page layout and visual formatting. The problem has been identified, and we are actively working on a resolution. There is no impact to user data or functionality, this is strictly a front-end display issue. We’ll post an update once the fix has been deployed. Thanks for your patience while we get this sorted.

Question Speculation: RDNA3 + CDNA2 Architectures Thread

Page 20 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.
Obviously a CG representation, but looks like two dies, each having four HBM memory stacks. Not quite the chiplet design I had imagined. Then a question is, does each module show up as a single GPU?
 
MI200 has 95TF of fp32 performance. 🤯
I think it's TF32, not FP32

obrzek_2021-11-08_190p6ke5.png
 
Last edited:
Seems like it appears as 2 GPUs and isn't all that connected really. For sure wouldn't work for gaming.
Which is to be expected. Its clearly a compute only part.
I believe RDNA3 is going to use an embedded bridge to connect the GCDs, and not using regular IF links like how MI200 does. The silicon bridges in MI200 are between the GCD and HBM modules, which makes sense given the bandwidth required there. Same goes for RDNA3 between the GCDs.
 
Some of the numbers are getting to the point of downright nutty. If you look at the BF/FP16 matrix numbers we're getting to the point where it's only a few more generations before we start having to measure the performance numbers in PFLOPs.
 
Another downright nutty figure is the TDP. In a few generations (maybe just one?) it'll be measured in kilowatts.
An unavoidable side effect of packing more and more silicon in the same package. It used to not be possible to jam this much silicon onto the package due to reticle limits, but MCM and other advanced packaging techniques eliminates that. As long as perf/W and perf/socket increases, increasing package power is of little consequence.
 
Meh, it is on an older node, and with 2 GPUs no less. My RTX 3090 peaks at around 420W, and can’t come close to these numbers, though the Instinct doesn’t use CUDA, so many would opt for NVIDIA anyway.
Isn't actually ponte vecchio rumored to be close to that?
I'm not saying it's AMD specific. It's an industry wide trend. Intel, Nvidia, AMD are all doing it in response to some demand.
 
Back
Top