Question Zen 6 Speculation Thread

Joe NYC · Jul 7, 2025

Darkmont said:
https://www.anandtech.com/show/16026/tsmc-teases-12-high-3d-stacked-silicon The technology AMD uses to 3D stack L3, SOIC, has been shown to support 12-hi stacks since 2019 with test chips. It shouldn't come as a surprise that AMD's got the ability in 2025/2026 to go above 1-hi layers
View attachment 126740

There must be some cost to validate > 1 layer of V-Cache, and past sales of V-Cache CPUs probably did not justify the investment to turn it into a product.

Fast forward from 2022 to 2025 level sales of V-Cache processors, underlying assumptions have changed. Best gaming CPUs are now synonymous with V-Cache and sell in high numbers.

In addition to high profit margins, V-Cache CPUs are a free advertisement for AMD. A brand building product. Now, that Intel is going to be releasing bLLC, I think Lisa will want to smack it down really hard.

Additional layer would do just that, but it would do even more. It would show AMD's technological superiority, showing that AMD can just slap an extra layer while Itel has to redesign the entire CPU chiplet every time Intel wants to give it more L3.

adroc_thurston · Jul 7, 2025

Joe NYC said:
There must be some cost to validate > 1 layer of V-Cache

They already validated 4-Hi stacks on Milan-X.
4 years ago.

Joe NYC said:
Fast forward from 2022 to 2025 level sales of V-Cache processors, underlying assumptions have changed. Best gaming CPUs are now synonymous with V-Cache and sell in high numbers.

In addition to high profit margins, V-Cache CPUs are a free advertisement for AMD. A brand building product. Now, that Intel is going to be releasing bLLC, I think Lisa will want to smack it down really hard.

Additional layer would do just that, but it would do even more. It would show AMD's technological superiority, showing that AMD can just slap an extra layer while Itel has to redesign the entire CPU chiplet every time Intel wants to give it more L3.

I get that fatter L3 piles gives you a raging stiffy, but the ROI of 2-Hi SoIC is zero.

Joe NYC · Jul 7, 2025

Io Magnesso said:
But how do you implement it? There is a problem that
If you want to use 3D-V Cache not only from the CPU but also from other units, you have to think about where to put it.

You can look at how it is implemented in Strix Halo. Strix Halo compute chiplet has its own L3, and MALL cache is programmed to priorities GPU requests.

But it is a small MALL. 32 MB SRAM vs. Zen 6 12 chiplet with its own V-Cache would be 144 MB SRAM.

L3 can't be moved away from CPU cores and the ring bus. The latency penalty would be too high, it would defeat the purpose.

Joe NYC · Jul 7, 2025

adroc_thurston said:
They already validated 4-Hi stacks on Milan-X.
4 years ago.

I get that fatter L3 piles gives you a raging stiffy, but the ROI of 2-Hi SoIC is zero.

My points were:
- ROI in 2026 >> ROI 2022
- free advertisement worth $100 of millions

Your point was valid in 2022, but probably no longer valid in 2026

adroc_thurston · Jul 7, 2025

Joe NYC said:
My points were

You had none.
It's all imaginary reasons for personal feefees.

Joe NYC said:
- ROI in 2026 >> ROI 2022

Nope, games aren't more cache-local than they used to be.

Joe NYC said:
- free advertisement worth $100 of millions

the what.

Joe NYC said:
Your point was valid in 2022, but probably no longer valid in 2026

It's valid forever until the games neatly fit their workset into a 240M-sized slab of L3 (which is never).

Joe NYC · Jul 7, 2025

adroc_thurston said:
Nope, games aren't more cache-local than they used to be.

It's valid forever until the games neatly fit their workset into a 240M-sized slab of L3 (which is never).

It's about cache miss rate.

If the cache size goes up by a factor of 2.5x (240 MB vs. 96 MB), then the cache miss rate would go down to of 63% of the original miss rate.

You don't have to fit the entire instruction / dataset to L3 to realize the benefit. Any increase provides a benefit.

The only time there would be no benefit if the game / task already fit into L3 - which is rarely the case.

adroc_thurston · Jul 7, 2025

Joe NYC said:
It's about cache miss rate.

Well duh.

Joe NYC said:
If the cache size goes up by a factor of 2.5x (240 MB vs. 96 MB), then the cache miss rate would go down to of 63% of the original miss rate.

Point being is that 96M is already enough for hot data, and mem hits won't stop until games fit directly into cache (which they won't).

Joe NYC said:
Any increase provides a benefit.

Very minor.
Again, you should stop.
2-hi V$ is just not happening.

igor_kavinski · Jul 7, 2025

adroc_thurston said:
Point being is that 96M is already enough for hot data, and mem hits won't stop until games fit directly into cache (which they won't).

Most DOSBox games and even Win95 would fit in that space. So it can be done if developers are just a tad bit inventive.

adroc_thurston · Jul 7, 2025

igor_kavinski said:
Most DOSBox games and even Win95 would fit in that space.

great argument.

igor_kavinski said:
So it can be done if developers are just a tad bit inventive.

Retvrn to the 7th gen 256MB DRAM limitations.

Either way, gaming is defined by big box consoles and they're cachelet.
Just forget about it; better for your sanity.

Joe NYC · Jul 7, 2025

adroc_thurston said:
the what.

Building a brand, marketing.

You may not be familiar, or may not like it, but like it or not, they have a big impact on whether a CPU is sold or not.

adroc_thurston · Jul 7, 2025

Joe NYC said:
Building a brand, marketing.

the what.

Joe NYC said:
You may not be familiar, or may not like it, but like it or not, they have a big impact on whether a CPU is sold or not.

You have a big impact when you have the fastest gaming CPU.
People dgaf about how much L3 it has or if it comes with a free redacted pony).

Joe NYC · Jul 7, 2025

adroc_thurston said:
You have a big impact when you have the fastest gaming CPU.
People dgaf about how much L3 it has or if it comes with a and a pony).

That's what I am saying. Having the fastest CPU is worth $100 million in marketing / shilling / bribing of OEMs.

With that (fastest CPU, brand loyalty) you can also sell for higher price. And you can also up-sell. To say 2-hi V-Cache.

adroc_thurston · Jul 7, 2025

Joe NYC said:
Having the fastest CPU is worth $100 million in marketing / shilling / bribing of OEMs

It already is the fastest.

Joe NYC said:
With that (fastest CPU, brand loyalty) you can also sell for higher price. And you can also up-sell. To say 2-hi V-Cache.

2-hi won't be any faster so why bother?
Also brand loyalty does not exist in CPU spaces.

Io Magnesso · Jul 7, 2025

Joe NYC said:
You can look at how it is implemented in Strix Halo. Strix Halo compute chiplet has its own L3, and MALL cache is programmed to priorities GPU requests.

But it is a small MALL. 32 MB SRAM vs. Zen 6 12 chiplet with its own V-Cache would be 144 MB SRAM.

L3 can't be moved away from CPU cores and the ring bus. The latency penalty would be too high, it would defeat the purpose.

what are you talking about？
you first You've brought up the story of Infinity Cache, right?

LightningZ71 · Jul 7, 2025

adroc_thurston said:
b) no one actually uses iGPUs.

The what?!?!

Are you smoking the good stuff and not sharing? Quick, someone rush out and tell AMD that they're doing the Z series all wrong! Berate them for continuing to expand the iGPU on all their mobile processors. Call them absolutely ignorant for developing the processors for the major consoles and the Steam Deck! Roast them all over the investor calls for the fortune they spent for their Halo line.

ALL of those things use variations on a theme: iGPUs.

Now, if you just mean their desktop processors with their puny "makes a monitor light up" iGPUs, then still the majority of their customers STILL use their iGPUs. They just don't give an excrement about the performance.

adroc_thurston · Jul 7, 2025

LightningZ71 said:
The what?!?!

yeah.

LightningZ71 said:
Quick, someone rush out and tell AMD that they're doing the Z series all wrong!

Rebrands of existing parts?

LightningZ71 said:
Berate them for continuing to expand the iGPU on all their mobile processors.

MDS1 literally has half the iGFX of Strix1.

LightningZ71 said:
Call them absolutely ignorant for developing the processors for the major consoles and the Steam Deck!

Consoles are a distinct market with millions of units.
Gabeboy uses MS Surface salvage.

LightningZ71 said:
ALL of those things use variations on a theme: iGPUs.

Which is getting smaller with Medusa. Because no one actually really uses the GFX (where it matters, in ultrathin laptops. Including commercial).

LightningZ71 · Jul 7, 2025

adroc_thurston said:
Rebrands of existing parts?

That all use the iGPU at max performance...

adroc_thurston said:
MDS1 literally has half the iGFX of Strix1.

You know full well that the iGPU of STP1 was an oversized relic of when it had MALL cache and no NPU in early design. 16CU is WAY too much for dual DDR5.

adroc_thurston said:
Consoles are a distinct market with millions of units.
Gabeboy uses MS Surface salvage.

Which all rely on the iGPU

adroc_thurston said:
Which is getting smaller with Medusa. Because no one actually really uses the GFX (where it matters, in ultrathin laptops. Including commercial).

You mean right sized for the rest of the chip? Still uses the iGPU regularly, often even when configured with a dGPU when on battery.

adroc_thurston · Jul 7, 2025

LightningZ71 said:
That all use the iGPU at max performance...

In a tiny irrelevant market.

LightningZ71 said:
You know full well that the iGPU of STP1 was an oversized relic of when it had MALL cache and no NPU in early design. 16CU is WAY too much for dual DDR5.

idk chief, 8CU is smaller than even 12CU in Phoenix.
We're so back!

LightningZ71 said:
Which all rely on the iGPU

Calling that iGPU is very dishonest.

LightningZ71 said:
You mean right sized for the rest of the chip?

Smaller than everything shipped since 2022?
8CUs, completely and utterly castrated versus LPDDR speeds they'll be shipping for MDS1.
Too bad!

LightningZ71 said:
Still uses the iGPU regularly, often even when configured with a dGPU when on battery.

You don't need more than 4CUs for that anyway.

Doug S · Jul 7, 2025

adroc_thurston said:
a) cost
b) no one actually uses iGPUs.

Huh? Most PCs ship with only an iGPU these days. What are you talking about?

adroc_thurston · Jul 7, 2025

Doug S said:
Most PCs ship with only an iGPU these days

Nominally.

Doug S said:
What are you talking about?

Average ultrathin laptop user gives 0 craps about how much FPS or TS GT2 pts their iGP pushes out at 20 watts because they don't actually use the iGP.
Display and media cores are what matters.
Which is why MDS1 has gutted gfx.

HurleyBird · Jul 7, 2025

Thunder 57 said:
There would still be the problem of inter-CCD latency that way.

I'm not thinking about gaming (although it would help those scenarios where the 9950X3D lags the 9800X3D), but more the awkwardness of having two sets of cores that sometimes have next to no performance difference, and other times an enormous difference.

The dual CCD parts are for productivity and gaming, where the second CCD is basically wasted on gaming. The frequency hit is so minimal now that it would be nicer if both CCDs had the same performance profile.

soresu · Jul 7, 2025

Win2012R2 said:
They've only improved it just about now in 5.6 - most big game dev use much older versions as it takes 5-6-7 years to make game these days, and upgrading isn't trivial so they will ship using older versions most certainly. And frankly 5.6 isn't exactly solving the problem completely, they only hope to achieve it in UE 6 - so that's for games a decade from now.

It's a big shift starting at the lowest levels of the engine code.

Like a similar effort on Firefox/Gecko (Project Electrolysis) it takes time.

Like bringing in new features starts with one version at experimental -> beta in a later version and then finally production on an even later version.

Only this effort is rewriting fundamental parts of the engine rather than just adding parts on, so it's going to affect (and potentially break) everything sitting on top of it, which is going to require an insane amount of testing by comparison.

Josh128 · Jul 7, 2025

adroc_thurston said:
Also brand loyalty does not exist in CPU spaces.

What a huge crock of shat.

adroc_thurston · Jul 7, 2025

Josh128 said:
What a huge crock of shat.

It's the truth, DIY CPU marketshare did rightful swings between either vendor depending on release quality.

MS_AT · Jul 7, 2025

HurleyBird said:
The dual CCD parts are for productivity and gaming, where the second CCD is basically wasted on gaming. The frequency hit is so minimal now that it would be nicer if both CCDs had the same performance profile.

The second x3D CCD would be less than ideal for gaming, unless engine developers would take CPU topology into account, and would try to avoid placing threads that like to often talk to each other on different CCDs.

Question Zen 6 Speculation Thread

Diamond Member

Diamond Member

Diamond Member

Diamond Member

Diamond Member

Diamond Member

Diamond Member

Lifer

Diamond Member

Diamond Member

Diamond Member

Diamond Member

Diamond Member

Senior member

Platinum Member

Diamond Member

Platinum Member

Diamond Member

Diamond Member

Diamond Member

Platinum Member

Diamond Member

Golden Member

Diamond Member

Senior member