Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

DisEnchantment · Sep 29, 2022

Speculate at will

Kryohi · Dec 11, 2024

Timmah! said:
Then the question should be, why thats the case, if some other apps, that utilize 4090, can overflow to system RAM, once the VRAM is filled. Naturally this tanks performance, but maybe thats still better than no performance at all?
Are LLMs latency-sensitive like games?

Afaik not to latency, but definitely to bandwidth. Besides small (<10B) models no one uses system RAM for LLMs, once you offload to that some parts of a large model, performance tanks very quickly.

Markfw · Dec 11, 2024

wrong thread.

Timmah! · Dec 12, 2024

Kryohi said:
Afaik not to latency, but definitely to bandwidth. Besides small (<10B) models no one uses system RAM for LLMs, once you offload to that some parts of a large model, performance tanks very quickly.

But its gonna be system RAM, whats gonna be used in case of Halo, no?

FlameTail · Dec 12, 2024

Kraken Point appears in Geekbench

https://videocardz.com/newz/acer-swift-laptop-to-feature-8-core-ryzen-ai-7-350-amd-krackan-processor-faster-than-ryzen-7-8845hs

Joe NYC · Dec 12, 2024

FlameTail said:
Kraken Point appears in Geekbench

https://videocardz.com/newz/acer-swift-laptop-to-feature-8-core-ryzen-ai-7-350-amd-krackan-processor-faster-than-ryzen-7-8845hs

It looks like a very competitive chips for mainstream market, for laptops less expensive than Strix Point and Lunar Lake.

Performance should be quite close to Strix Point in most typical ST centric applications, and it looks like all 8 cores are on the same ring with 16MB L3, which would improve performance when running on 5c cores (due to larger L3) and cost of switching to full cores should be less. So overall power efficiency vs. Strix Point should go up.

Also, cheapest to manufacture (vs. Strix Point and Lunar Lake).

Josh128 · Dec 13, 2024

Joe NYC said:
and it looks like all 8 cores are on the same ring with 16MB L3, which would improve performance when running on 5c cores (due to larger L3) and cost of switching to full cores should be less. So overall power efficiency vs. Strix Point should go up.

Where do you deduce this from? I agree, it would help perf, but up until this point it didnt look that way. I still think either 4C CCX +4C CCX will be more likely. Thats what the Geekbench shows. L3 would not be unified between the two CCXs.

Kepler_L2 · Dec 13, 2024

Josh128 said:
Where do you deduce this from? I agree, it would help perf, but up until this point it didnt look that way. I still think either 4C CCX +4C CCX will be more likely. Thats what the Geekbench shows. L3 would not be unified between the two CCXs.

View attachment 113150

Geekbench shows Strix to have 16MB L3 (which is just the Zen5 CCX).

Joe NYC · Dec 13, 2024

Josh128 said:
Where do you deduce this from? I agree, it would help perf, but up until this point it didnt look that way. I still think either 4C CCX +4C CCX will be more likely. Thats what the Geekbench shows. L3 would not be unified between the two CCXs.

View attachment 113150

Just a speculation, based on the fact that AMD can do ring bus with 8 stops.

It would be wasteful if if Kraken had 4x Zen 5c with a separate L3 and it would suck if the Zen 5c cores had no L3. So just connecting some dots (which may turn out to be wrong).

Josh128 · Dec 13, 2024

Kepler_L2 said:
Geekbench shows Strix to have 16MB L3 (which is just the Zen5 CCX).View attachment 113154

Yes, but it shows two "Clusters", which is accurate. I'd be willing to bet Krakan's cluster setup is the same. High latency coms between cores from differing clusters, just like Strix.

Josh128 · Dec 13, 2024

Joe NYC said:
Just a speculation, based on the fact that AMD can do ring bus with 8 stops.

It would be wasteful if if Kraken had 4x Zen 5c with a separate L3 and it would suck if the Zen 5c cores had no L3. So just connecting some dots (which may turn out to be wrong).

Joe, this is AMD we are talking about, AND presumably their lowest profit SKU. They likely just took the Strix design and neutered the Zen 5C 8 core cluster down to 4, cut the GPU down, and otherwise retained the exact same design to save time and money.

jpiniero · Dec 13, 2024

adroc_thurston said:
No.
Just not gonna float in commercial et al.

AIOs with mobile parts seem to be pretty popular...

Joe NYC · Dec 13, 2024

Josh128 said:
Joe, this is AMD we are talking about, AND presumably their lowest profit SKU. They likely just took the Strix design and neutered the Zen 5C 8 core cluster down to 4, cut the GPU down, and otherwise retained the exact same design to save time and money.

It's a brand new die, and I think this will be a high volume die for Zen 5. So, it would make sense, for area efficiency, remove the 2nd ring bus and its L3 altogether. And then sell a very area efficient CPU, in order to achieve good margins.

So, some additional design cost in that but lower die costs to offset them. AMD also took extra 6 months for Kracken after Strix Point, which would also point in the direction that AMD took time to get this one right.

gdansk · Dec 13, 2024

It isn't small, apparently.
Rumor says it is slightly larger than Phoenix 1.

FlameTail · Dec 13, 2024

gdansk said:
It isn't small, apparently.
Rumor says it is slightly larger than Phoenix 1.

Which would make it bigger than Hamoa die of X Elite.

AMD	Phoenix	178 mm²
Qualcomm	Hamoa	172 mm²
Qualcomm	Purwa	~130 mm²

Josh128 said:
Yes, but it shows two "Clusters", which is accurate. I'd be willing to bet Krakan's cluster setup is the same. High latency coms between cores from differing clusters, just like Strix.

Geekbench reads that different cores are in different clusters, even if technically they are in the same cluster.

Joe NYC · Dec 13, 2024

If Strix Point is 232 mm2, Kraken can save maybe 15% of die area, so maybe ~198 mm2 ballpark?

Abwx · Dec 13, 2024

Joe NYC said:
If Strix Point is 232 mm2, Kraken can save maybe 15% of die area, so maybe ~198 mm2 ballpark?

Smaller GPU than HPoint and also 4 Zen 5c cores, so it could eventualy be smaller if it wasnt for the bigger NPU, we can expect a comparable 178mm2 size.

Joe NYC · Dec 13, 2024

Abwx said:
Smaller GPU than HPoint and also 4 Zen 5c cores, so it could eventualy be smaller if it wasnt for the bigger NPU, we can expect a comparable 178mm2 size.

I don't know if this is the correct scale, but if you remove 8 MB L3 and 4 "c" cores, and also similar reduction to GPU, and if everything else stays the same, it would probably be around 15% reduction:

FlameTail · Dec 13, 2024

Strix Point dieshot from Nemez (GPUsAreMagic)

	Strix Point	Kraken Point
CPU	4 × Zen5 16 MB L3 8 × Zen5C 8 MB L3 59 mm²	4 × Zen5 4 × Zen5C 16 MB L3 40 mm²
GPU	RDNA3.5 8 WGP 41 mm²	RDNA3.5 4 WGP 21 mm²
SoC	232 mm²	~190 mm²

*Leaked info
**Estimates
***Measured

LightningZ71 · Dec 14, 2024

I believe that we've seen a leak in the past pointing to Kraken having reduced PCIe and USB support as well. If they pulled or reduced some of the shoreline units, that could also reduce area.

SteinFG · Dec 14, 2024

It's gonna be a really awkward time point for AMD - in 2025 dGPU designs will go with Ryzen 200 (hawk), and iGPU designs will go for Ryzen 300 (kraken). Considering their die sizes, the cost of ryzen 200 8-core and ryen 300 8-core will be close.

DrMrLordX · Dec 14, 2024

SteinFG said:
It's gonna be a really awkward time point for AMD - in 2025 dGPU designs will go with Ryzen 200 (hawk), and iGPU designs will go for Ryzen 300 (kraken). Considering their die sizes, the cost of ryzen 200 8-core and ryen 300 8-core will be close.

They'll be happy selling units either way. It's up to the OEMs to decide what they want to deliver to customers.

JustViewing · Dec 15, 2024

Can't they reduce the font size a little? why force the e to overflow?

adroc_thurston · Dec 15, 2024

SteinFG said:
AMD - in 2025 dGPU designs will go with Ryzen 200 (hawk)

No they're KRK.

StefanR5R · Dec 15, 2024

JustViewing said:
Can't they reduce the font size a little? why force the e to overflow?
View attachment 113263

This was done on purpose, to highlight how tight the cache is for it to be shared between eight cores.

igor_kavinski · Dec 15, 2024

https://videocardz.com/pixel/msi-x870e-motherboards-now-support-up-to-192gb-of-ddr5-memory-at-6400-mt-s

Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

Golden Member

Member

Moderator Emeritus, Elite Member

Golden Member

Diamond Member

Diamond Member

Golden Member

Golden Member

Diamond Member

Golden Member

Golden Member

Lifer

Diamond Member

Diamond Member

Diamond Member

Diamond Member

Lifer

Diamond Member

Diamond Member

Platinum Member

Senior member

Lifer

Senior member

Diamond Member

Elite Member

Lifer