Question Zen 6 Speculation Thread

OneEng2 · Nov 28, 2025

StefanR5R said:
No, that's not what this post says. This post shows "TDP scaling (Cinebench 2024 MC)", plotted by Computerbase.

C i n e b e n c h

It's something quite special. Better to not make general statements based on data like that.

3D CPU rendering
Video encoding (CPU-heavy)
Scientific/M&E multi-thread workloads
Compilers
Anything that scales perfectly with threads and uses FP math heavily

I'll give it to you that this isn't the bulk of most day to day usage for the vast majority of x86 users, but that is another subject all together.

The post I have been debating is that some still believe that somehow Zen 6 24c will beat NVL 52c in these kinds of apps. To me, the math just doesn't make sense. Seems like NVL will win easily in these applications.

adroc_thurston · Nov 28, 2025

OneEng2 said:
The post I have been debating is that some still believe that somehow Zen 6 24c will beat NVL 52c in these kinds of apps

Yeah it will.

OneEng2 said:
Seems like NVL will win easily in these applications.

Yeah man and a 288c Atom pile on 18A should easily clap Turin-De-NOPE.

poke01 · Nov 28, 2025

I think the confidence comes from the fact that arctic wolf e core will be amazing.

It’s still an E core doesn’t matter if it has AVX10.2. No e core or small core ever is substitute for an actual P core like Zen6 or Coyote cove.

So for me it’s 16P cores+ 32 Cinebench accelerators and 4LPE cores that don’t belong in desktop.

I rather have 24 zen6 cores. You all getting caught up the number of cores but what matters is how good 1t. That’s it.

adroc_thurston · Nov 28, 2025

poke01 said:
I think the confidence comes from the fact that arctic wolf e core will be amazing

Indeed.
Unfortunately, 288 DKTs on 18A struggle to compete with 192c Z5dense on N3e so back to reality they go.

poke01 said:
It’s still an E core doesn’t matter if it has AVX10.2. No e core or small core ever is substitute for an actual P core like Zen6 or Coyote cove.

So for me it’s 16P cores+ 32 Cinebench accelerators and 4LPE cores that don’t belong in desktop.

Really not the problem.
Atoms are competent all-around cores with good area and horrible power.
They just kinda suck at power-limited nT.

Fjodor2001 · Nov 28, 2025

poke01 said:
You all getting caught up the number of cores but what matters is how good 1t. That’s it.

1t is all that matter for MT. Oookey.

MS_AT · Nov 28, 2025

Fjodor2001 said:
1t is all that matter for MT. Oookey.

Ever heard of https://en.wikipedia.org/wiki/Amdahl's_law ?

OneEng2 said:
Video encoding (CPU-heavy)

Has trouble scaling unless you split it into parts and encode in parallel. At least what I have read on x265's enthusiast forums.

OneEng2 said:
Compilers

Real build systems hit https://en.wikipedia.org/wiki/Amdahl's_law If you want proof, go over servethehome.com Linux compile tests over the years and how they changed the methodology. Or better yet, try to build chromium with varying number of workers on your own.

OneEng2 said:
Anything that scales perfectly with threads and uses FP math heavily

Shouldn't these end up on GPUs?

adroc_thurston · Nov 28, 2025

MS_AT said:
Shouldn't these end up on GPUs?

GPUs suck.
There's a ton of usecases for CPU SIMD.
Unfortunately for the guy, Zen *excels* at CPU SIMD.

Fjodor2001 · Nov 28, 2025

MS_AT said:
Ever heard of https://en.wikipedia.org/wiki/Amdahl's_law ?

No need to spam the thread with such basics again and again. It’s already been posted and discussed numerous times.

We’re talking about a 48T scenario here. So it’s the same situation for both Zen6 24C/48T and NVL-S 48C/T.

Joe NYC · Nov 28, 2025

Fjodor2001 said:
No need to spam the thread with such basics again and again. It’s already been posted and discussed numerous times.

We’re talking about a 48T scenario here. So it’s the same situation for both Zen6 24C/48T and NVL-S 48C/T.

In PCs (Personal Computers) gains from the threads at thrdr thread counts are flatlined, meaning you are getting practically no gains, and 2nd CCD (in both cases of Zen 6 and NVL) delivers almost nothing.

I am not sure why you are making this argument, in mostly irrelevant (flat) part of the Amdahl curve, where you could make an argument instead in comparing 1 CCD Zen 6 and NVL, where, in both cases additional threads still may deliver performance increment that is > 0.

What's the obsession with 48+ threads?

HurleyBird · Nov 28, 2025

OneEng2 said:
The post I have been debating is that some still believe that somehow Zen 6 24c will beat NVL 52c in these kinds of apps.

If NVL is 52c, Zen6 is 26c. If Zen6 is 24c, NVL is 48c.

adroc_thurston · Nov 28, 2025

Joe NYC said:
What's the obsession with 48+ threads?

People wanna see NVL winning something after the trainwreck that is ARL-S.

HurleyBird said:
If NVL is 52c, Zen6 is 26c. If Zen6 is 24c, NVL is 48c.

They're both 48T parts and that's how they stack up against each other.

Joe NYC · Nov 28, 2025

adroc_thurston said:
People wanna see NVL winning something after the trainwreck that is ARL-S.

I guess somewhere between "moral victory" and "pyrrhic victory".

In the meantime, Lisa Su's Thanksgiving turkey size seems to be scaling proportionally with AMD profits. She may have an ostrich for Thanksgiving in coming years.

https://twitter.com/x/status/1994194473484443710

MS_AT · Nov 28, 2025

Fjodor2001 said:
It’s already been posted and discussed numerous times.

And so many times you failed to understand why this is a problem😉

Fjodor2001 said:
We’re talking about a 48T scenario here. So it’s the same situation for both Zen6 24C/48T and NVL-S 48C/T.

It's not. When the Amdahl's law hits you, you basically care that the longest subtask that others are waiting for is running on fastest core. It's easier to achieve with homogeneous cores. For example most builds systems are not aware of hybrid CPUs and are trusting the OS and Thread director will do a good job. Spoiler alert, they sometimes fail miserably. I mean people at work explicitly disable E cores, so the compiles go faster... (keep in mind it's a dev work, which is different than CI work). Disclaimer we don't have Arrow Lake to test with, only Raptor and Meteor Lake. But Windows still have problem with handling that so many years after Alder Lake.

adroc_thurston said:
There's a ton of usecases for CPU SIMD.

I don't deny it. But the conditions he specified are perfect for GPUs. I mean if something scales perfectly with increasing thread count it's probably embarrassingly parallel workload with little communication between workers, and since it's pure number crunching little need for branch prediction.

StefanR5R · Nov 28, 2025

OneEng2 said:
3D CPU rendering

If we pick just this one as an example MT application area and look around different renderers (and different test scenes on top maybe), we observe that Cinebench is not giving the full picture of perf/W by CPU vendor x vs. CPU vendor y. Not at all.

DrMrLordX · Nov 28, 2025

adroc_thurston said:
Indeed.
Unfortunately, 288 DKTs on 18A struggle to compete with 192c Z5dense on N3e so back to reality they go.

The worst part about Clearwater Forest is that it isn't even available yet, while Turin-dense has been on the market for awhile now.

Joe NYC said:
In the meantime, Lisa Su's Thanksgiving turkey size seems to be scaling proportionally with AMD profits.

She hasn't been showing off huge rings recently, so maybe she has to compete with JHH's leather jackets using roast turkeys.

Fjodor2001 · Nov 29, 2025

Joe NYC said:
In PCs (Personal Computers) gains from the threads at thrdr thread counts are flatlined, meaning you are getting practically no gains, and 2nd CCD (in both cases of Zen 6 and NVL) delivers almost nothing.

I am not sure why you are making this argument, in mostly irrelevant (flat) part of the Amdahl curve, where you could make an argument instead in comparing 1 CCD Zen 6 and NVL, where, in both cases additional threads still may deliver performance increment that is > 0.

What's the obsession with 48+ threads?

The context was OneEng2’s statement that NVL-S 48C/T is likely to win over Zen6 24C/48T in 48T MT scenarios. So both CPUs will be executing 48T and thus they are at the same point on Amdahl’s curve, so throwing that curve into the discussion does not add anything for that scenario.

Fjodor2001 · Nov 29, 2025

MS_AT said:
And so many times you failed to understand why this is a problem😉

It's not. When the Amdahl's law hits you, you basically care that the longest subtask that others are waiting for is running on fastest core. It's easier to achieve with homogeneous cores. For example most builds systems are not aware of hybrid CPUs and are trusting the OS and Thread director will do a good job. Spoiler alert, they sometimes fail miserably. I mean people at work explicitly disable E cores, so the compiles go faster... (keep in mind it's a dev work, which is different than CI work). Disclaimer we don't have Arrow Lake to test with, only Raptor and Meteor Lake. But Windows still have problem with handling that so many years after Alder Lake.

I don't deny it. But the conditions he specified are perfect for GPUs. I mean if something scales perfectly with increasing thread count it's probably embarrassingly parallel workload with little communication between workers, and since it's pure number crunching little need for branch prediction.

See my previous post above. Also, you have similar problems with fast vs slow threads on Zen6, due to SMT and some tasks/threads executing faster or slower due to that.

Then we also have cases where multiple apps are executing in parallel. Does not have to be a single app using all 48T.

The scenario discussed was when all 48T are actually being used, regardless of how that is done.

Kryohi · Nov 29, 2025

MS_AT said:
the conditions he specified are perfect for GPUs. I mean if something scales perfectly with increasing thread count it's probably embarrassingly parallel workload with little communication between workers, and since it's pure number crunching little need for branch prediction.

Eh in the real world it often doesn't work like this.
Maybe for big companies, but otherwise
1. Porting software to use gpgpu is a PITA, only worth for big and very reusable stuff
2. Abysmal FP64 performance on modern GPUs if you need that
3. Who says code with a lot of branches must necessarily have a lot of communication between threads?
4. Often you need more cores because you have a lot of data to work on in parallel (e.g. a 3 hour long 4k video to encode vs a 10m 1080p one, or a bionformatics pipeline), not because the actual algorithms used are particularly parallelizzabile (see again e.g. x265)

A lot of CPU cores are useful for a lot of people, though I personally do not like at all P+E configurations, and I know for a fact people have had trouble with them with a couple of different and widely used scientific software.

MS_AT · Nov 29, 2025

Kryohi said:
1. Porting software to use gpgpu is a PITA, only worth for big and very reusable stuff

That is generally true but this particular case (massively parallel, heavy math) should be easier to port relative to other things.

Kryohi said:
Abysmal FP64 performance on modern GPUs if you need that

That's the case only for consumer GPUs and even then except for iGPUs I am not sure 9950x has higher FP64 performance than mid class consumer GPU. Especially if you factor in the massive memory BW disadvantage.

Kryohi said:
3. Who says code with a lot of branches must necessarily have a lot of communication between threads?

I don't know. If you reread my message it said code with lots of number crunching will not have a lot of branches. FFT kernels, matmul kernels. The only branches come from loops or you have done something wrong.

Kryohi said:
Often you need more cores because you have a lot of data to work on in parallel (e.g. a 3 hour long 4k video to encode vs a 10m 1080p one, or a bionformatics pipeline), not because the actual algorithms used are particularly parallelizzabile (see again e.g. x265)

Sorry, but I am not sure where this came from. Anyway I was not saying that people shouldn't get more cores if their workflow demands it. Just that in a lot of MT cases 1T perf still matters.

Kryohi said:
I know for a fact people have had trouble with them with a couple of different and widely used scientific software.

Add virtualization software to the list.

Fjodor2001 · Nov 29, 2025

Is anything known about whether there will be any NPU on Zen6 DT?

NVL-S is expected to have NPU6 @ 74 TOPS INT8. I assume one of the reasons is to comply with the Microsoft Copilot+ PC requirement of 40+ TOPS. So will Zen6 DT follow the same path, or be declared non-compliant with that requirement?

adroc_thurston · Nov 29, 2025

Fjodor2001 said:
So will Zen6 DT follow the same path, or be declared non-compliant with that requirement?

Who cares

511 · Nov 30, 2025

adroc_thurston said:
Who cares

AMD

DrMrLordX · Nov 30, 2025

Desktop chips don't need NPUs. The little green sticker is more likely to sell laptops than desktops.

biostud · Nov 30, 2025

DrMrLordX said:
Desktop chips don't need NPUs. The little green sticker is more likely to sell laptops than desktops.

I wonder how large a percentage of users make their decisions on whether a NPU is present in their laptop.

poke01 · Nov 30, 2025

biostud said:
I wonder how large a percentage of users make their decisions on whether a NPU is present in their laptop.

Nobody does but some people will think more TOPs means a better overall CPU/system which is not true

Question Zen 6 Speculation Thread

Golden Member

Diamond Member

Diamond Member

Diamond Member

Diamond Member

Senior member

Diamond Member

Diamond Member

Diamond Member

Platinum Member

Diamond Member

Diamond Member

Senior member

Elite Member

Lifer

Diamond Member

Diamond Member

Member

Senior member

Diamond Member

Diamond Member

Diamond Member

Lifer

Lifer

Diamond Member