Krteq
Senior member
- May 22, 2015
- 989
- 670
- 136
This one is interposer based only, it is like a high dandwidth version of EPYC Naples chiplets. The full 3D stacked SoIC versions will come with RDNA3 and MI300Obviously a CG representation, but looks like two dies, each having four HBM memory stacks. Not quite the chiplet design I had imagined. Then a question is, does each module show up as a single GPU?
That's for matrix ops. Even more crazy is that it has the same for double precision matrix ops. I wonder what workloads need such matrix throughput. Most probably not AI-type workloads.MI200 has 95TF of fp32 performance. 🤯
It is packed FP32 peak throughput.I think it's TF32, not FP32
![]()
Seems like it appears as 2 GPUs and isn't all that connected really. For sure wouldn't work for gaming.Then a question is, does each module show up as a single GPU?
Which is to be expected. Its clearly a compute only part.Seems like it appears as 2 GPUs and isn't all that connected really. For sure wouldn't work for gaming.
Seems like it appears as 2 GPUs and isn't all that connected really. For sure wouldn't work for gaming.
I believe RDNA3 is going to use an embedded bridge to connect the GCDs, and not using regular IF links like how MI200 does. The silicon bridges in MI200 are between the GCD and HBM modules, which makes sense given the bandwidth required there. Same goes for RDNA3 between the GCDs.Which is to be expected. Its clearly a compute only part.
Another downright nutty figure is the TDP. In a few generations (maybe just one?) it'll be measured in kilowatts.Some of the numbers are getting to the point of downright nutty. If you look at the BF/FP16 matrix numbers we're getting to the point where it's only a few more generations before we start having to measure the performance numbers in PFLOPs.
An unavoidable side effect of packing more and more silicon in the same package. It used to not be possible to jam this much silicon onto the package due to reticle limits, but MCM and other advanced packaging techniques eliminates that. As long as perf/W and perf/socket increases, increasing package power is of little consequence.Another downright nutty figure is the TDP. In a few generations (maybe just one?) it'll be measured in kilowatts.
Meh, it is on an older node, and with 2 GPUs no less. My RTX 3090 peaks at around 420W, and can’t come close to these numbers, though the Instinct doesn’t use CUDA, so many would opt for NVIDIA anyway.Another downright nutty figure is the TDP. In a few generations (maybe just one?) it'll be measured in kilowatts.
Ins't actually ponte vecchio rumored to be close to that?Another downright nutty figure is the TDP. In a few generations (maybe just one?) it'll be measured in kilowatts.
600W, allegedly. Water-cooled.Ins't actually ponte vecchio rumored to be close to that?
Meh, it is on an older node, and with 2 GPUs no less. My RTX 3090 peaks at around 420W, and can’t come close to these numbers, though the Instinct doesn’t use CUDA, so many would opt for NVIDIA anyway.
I'm not saying it's AMD specific. It's an industry wide trend. Intel, Nvidia, AMD are all doing it in response to some demand.Isn't actually ponte vecchio rumored to be close to that?
We've already been there with 3 and 4 way cf /sli.Another downright nutty figure is the TDP. In a few generations (maybe just one?) it'll be measured in kilowatts.
Thread starter | Similar threads | Forum | Replies | Date |
---|---|---|---|---|
![]() |
Question RTX 4000/RX 7000 price speculation thread | Graphics Cards | 501 |
Similar threads |
---|
Question RTX 4000/RX 7000 price speculation thread |