Question HBM Genoa

LightningZ71 · Nov 20, 2024

Who could forget Kid Icky?!?!

I do think that Nintendo does do a better job of making sure that their first part IP products aren't hot garbage on release. That's more than most of their competition these days.

Bigos · Nov 20, 2024

LightningZ71 said:
I do think that Nintendo does do a better job of making sure that their first part IP products aren't hot garbage on release.

Pokemon Scarlet/Violet has entered the channel.

LightningZ71 · Nov 20, 2024

Better... Not perfect...

StefanR5R · Jan 19, 2025

While this thread was meant to be about MI300C specifically, here is a closely related article:

Chips and Cheese, Inside the AMD Radeon Instinct MI300A's Giant Memory Subsystem, January 18, 2025

(How MI300A's Infinity Fabric is structured, and latency and bandwidth measurements. Edit, also touches on topics such as: pros and cons of memory-side cache compared to cache in core complexes; why Genoa crams many cores into dualsocket nodes rather than scaling up to quadsocket nodes; the noisy neighbor problem; SPEC 1T performance of Zen 4 in MI300A compared with Zen 4 desktop and Zen 2 desktop; MI300A's CPU–GPU memory sharing compared to some desktop and mobile implementations; Infinity Fabric as a tool to manage hardware design development complexity…)

Chester Lam said:
The Radeon Instinct MI300A’s memory subsystem may not be kind to its Zen 4 cores from a latency perspective. From the bandwidth side though, it’s an all-you-can-eat buffet where Infinity Fabric links between each CCD and the rest of the system is your plate. […] Unlike desktop Zen 4, hitting Infinity Fabric or DRAM bandwidth limits with the CPU cores is simply impossible.

Hmm, doesn't this make MI300C a somewhat unbalanced product?

DrMrLordX · Jan 19, 2025

@StefanR5R

Sounds like Zen5 would love the HBM hookup even more than Zen4.

igor_kavinski · Jan 19, 2025

StefanR5R said:
Hmm, doesn't this make MI300C a somewhat unbalanced product?

Balanced more towards AI. They sacrificed their Zen 4 cores to essentially act like Zen 2 cores in CPU performance just so they could have better AI performance.

igor_kavinski · Jan 19, 2025

DrMrLordX said:
Sounds like Zen5 would love the HBM hookup even more than Zen4.

The latency hit would kill any love Zen 5 may have for HBM's bandwidth.

DrMrLordX · Jan 19, 2025

igor_kavinski said:
The latency hit would kill any love Zen 5 may have for HBM's bandwidth.

No more than it does for Zen4.

StefanR5R · Jan 20, 2025

Chester Lam said:
Unlike desktop Zen 4, hitting Infinity Fabric or DRAM bandwidth limits with the CPU cores is simply impossible.

StefanR5R said:
Hmm, doesn't this make MI300C a somewhat unbalanced product?

AFAIU, the SERDES based interface between CCD and IOD was improved somewhat in Turin over Genoa, but not substantially. In MI300*A*, CPU bandwidth tests show the CCD 2 IOD interface to be the bottleneck before IOD x IOD IF and memory bandwidth. That's natural though for such a configuration in which the more bandwidth hungry part are the GPUs, not the CPUs. Yet I was wondering how well the raw memory bandwidth can be translated into actual performance in MI300*C*, given the CCD 2 IOD IF limitation (Genoa-style GMI wide with 2×32 B/cycle × bidirectional).

But I should have simply looked up the figures which were published so far:
– The aggregated HBM3 bandwidth on MI300A is 5.3 TB/s, according to Chips and Cheese.
– AMD claim MI300C's performance in STREAM Triad to be 6.9 TB/s even.
– Chips and Cheese measured MI300A's per-CCD performance with 71.5 GB/s read and 60.7 GB/s write with their own microbenchmarking software.
– If the 12 CCDs of MI300C performed the same, that would be 858 GB/s read and 728 GB/s write.
This doesn't make sense to me. What am I missing?
Or are the 5.3 and 6.9 TB/s for four sockets together, not for a single MI300?

Edit: Looking back at Microsoft's announcement of Azure HBv5 virtual machines, the 6.9 TB/s appear to be the sum of four sockets indeed. If so, this would match well with Chips and Cheese's MI300A measurements.

DrMrLordX said:
Sounds like Zen5 would love the HBM hookup even more than Zen4.

A hypothetical Zen 5 based MI300A and/or MI300C successor (with Zen 5's considerably increased vector arithmetic execution width over Zen 4) would apparently profit from some sort of upgrade of the CCD 2 IOD IF. The existing Zen 5 CCD may not be prepared for such an upgrade.

DZero · Jan 20, 2025

adroc_thurston said:
Only half of the problem. Nintendo also sits on a pile of dead IPs.
Remember Kid Icarus?

Nintendo is far worse in that matter. Only SEGA is chill with that.

StefanR5R · Jan 20, 2025

Remind me, which ones are the MI300 based consoles — Xbox, NES, or SEGA? :-P

DrMrLordX · Jan 20, 2025

StefanR5R said:
Remind me, which ones are the MI300 based consoles — Xbox, NES, or SEGA? :-P

Atari Jaguar 2!

igor_kavinski · Jan 21, 2025

DrMrLordX said:
Atari Jaguar 2!

Give me the console. Take my $500!

Search

Question HBM Genoa

LightningZ71

Platinum Member

Bigos

Senior member

LightningZ71

Platinum Member

StefanR5R

Elite Member

DrMrLordX

Lifer

igor_kavinski

Lifer

igor_kavinski

Lifer

DrMrLordX

Lifer

StefanR5R

Elite Member

DZero

Golden Member

StefanR5R

Elite Member

DrMrLordX

Lifer

igor_kavinski

Lifer

TRENDING THREADS