Question Zen 6 Speculation Thread

Win2012R2 · 2026-03-25T08:00:05-0400

adroc_thurston said:
No, that one specifically sucks.
It's aa64 but Bad.

What's better? It's the only agreed game in town.

adroc_thurston · 2026-03-25T08:02:28-0400

Win2012R2 said:
What's better? It's the only agreed game in town.

Actually doing a cleanup ISA break, just like aa64 did.

basix · 2026-03-25T08:03:51-0400

coercitiv said:
Intel says that iBOT currently requires extensive validation, a little over a quarter. They claim to be working on optimization for content creation software.

Wouldn't something like iBOT be very interesting to be put into HW accelerated IP in the CPU? IBOT seems to be some kind of high-level workflow arbiter.

adroc_thurston · 2026-03-25T08:07:23-0400

basix said:
Wouldn't something like iBOT be very interesting to be put into HW accelerated IP in the CPU? IBOT seems to be some kind of high-level workflow arbiter.

You're 3 inches away from reinventing Transmeta.

MS_AT · 2026-03-25T08:13:32-0400

Win2012R2 said:
PGO is fantastic for any software that runs heavy stuff: it helps compiler decide how to inline better and that can make huge difference vs function call.

You missed my point. You collect the profile on specific machine. The inlining decisions made for that specific machine might not carry over to another CPU. Due to cache sizes etc. So the benefit can be limited if not outright nullified. That's not a problem on console, where for each generation the CPU stays exactly the same. Of course it would be nice if the OS could perform something like BOLT for you on first launch, but well, people are too impatient for that😉

Win2012R2 · 2026-03-25T08:14:01-0400

adroc_thurston said:
Actually doing a cleanup ISA break, just like aa64 did.

Not happening in x86 - ever.

MS_AT said:
You missed my point. You collect the profile on specific machine. The inlining decisions made for that specific machine might not carry over to another CPU.

PGO is driven by profile data (PG bit), not CPU features like cache size or ISA (that's different optimisation type), there is also Dynamic PGO that is used in .NET and I believe Java to do such optimisations depending on work load, so no static one off necessary.

MS_AT · 2026-03-25T08:25:33-0400

Win2012R2 said:
PGO is driven by profile data (PG bit), not CPU features like cache size or ISA

Yes, but the CPU features are implicit dependency. You gather profile on CPU X, based on that toolchain will alter code layout branch hints whatever it can to extract most performance. Then you run on CPU Y, some of those optimization might not hold (for example smaller uOP cache might end up penalizing too big functions etc).

adroc_thurston · 2026-03-25T08:28:14-0400

Win2012R2 said:
Not happening in x86 - ever.

Well, no, APX was a golden opportunity.
Alas, Intel exists.

Win2012R2 · 2026-03-25T08:35:05-0400

MS_AT said:
Yes, but the CPU features are implicit dependency

That has got nothing to do with PGO which is Profile Guided Optimisation, it is based solely on data captured during profiling session.

Hulk · 2026-03-25T08:37:45-0400

adroc_thurston said:
The facts are they have the best scores and the best PPW(A) in industry standard cross-platform tests.

Thank you for this. Re-coding an application and showing it runs better on Apple actually supports my "less baggage/newer ISA" advantage supposition.

Win2012R2 · 2026-03-25T08:38:18-0400

adroc_thurston said:
Well, no, APX was a golden opportunity.

What exactly they could have done and what benefit it would have brought? Changing stuff in small way won't give any meaningful benefits and serious breaking changes would mean you compete directly against ARM.

adroc_thurston · 2026-03-25T09:00:53-0400

Hulk said:
Re-coding an application and showing it runs better on Apple actually supports my "less baggage/newer ISA" advantage supposition.

brotha, spec2017 subtests are *ancient*.

Win2012R2 said:
What exactly they could have done and what benefit it would have brought?

Remember amd64? Gotta do that again. They just extended opcode space instead. yuck.

Win2012R2 said:
and serious breaking changes would mean you compete directly against ARM.

amd64 was already a "serious breaking change".

Hulk · 2026-03-25T09:05:01-0400

adroc_thurston said:
brotha, spec2017 subtests are *ancient*.

Remember amd64? Gotta do that again. They just extended opcode space instead. yuck.

amd64 was already a "serious breaking change".

hmm....2017-1979=38

Win2012R2 · 2026-03-25T09:10:54-0400

adroc_thurston said:
amd64 was already a "serious breaking change".

What exactly did it break in terms of backwards compatibility?

adroc_thurston · 2026-03-25T09:42:54-0400

Hulk said:
hmm....2017-1979=38

Wut.

Win2012R2 said:
What exactly did it break in terms of backwards compatibility?

A lot?
amd64 addressing modes alone are uhhh.

Win2012R2 · 2026-03-25T09:45:20-0400

adroc_thurston said:
A lot?
amd64 addressing modes alone are uhhh.

You can still run old 16/32 bit stuff on it, no?

adroc_thurston · 2026-03-25T09:48:11-0400

Win2012R2 said:
You can still run old 16/32 bit stuff on it, no?

Nothing stopped ARMv8 cores from running 32b stuff either.
They just had a clean 64b ISA instead of what we have with APX.

Win2012R2 · 2026-03-25T09:53:30-0400

adroc_thurston said:
They just had a clean 64b ISA instead of what we have with APX.

Well it's a bit too late for that, been 20+ years since amd64 got out, and ARM got 64-bits nearly 10 years later, obviously it was easier to follow up with better arch, nothing can be done about it.

adroc_thurston · 2026-03-25T09:54:43-0400

Win2012R2 said:
Well it's a bit too late for that, been 20+ years since amd64 got out, and ARM got 64-bits nearly 10 years later, obviously it was easier to follow up with better arch, nothing can be done about it.

You could've given APX a separate exec mode with a cleaned up opcode space.
Like ARMv8 did for aa64.

Again, the problem is that Intel exists.

Win2012R2 · 2026-03-25T09:56:48-0400

adroc_thurston said:
You could've given APX a separate exec mode with a cleaned up opcode space.

What exactly you want to change there - change to fixed rather than variable length opcodes?

adroc_thurston · 2026-03-25T10:00:04-0400

Win2012R2 said:
What exactly you want to change there - change to fixed rather than variable length opcodes?

Maybe.
General prefix cleanup would help too.
I mean they're doing the hardly relevant aa64 bits (32 GPRs) but without all the good stuff otherwise.

Nothingness · 2026-03-25T13:28:45-0400

Win2012R2 said:
That support takes very few transistors, keeping it for the sake of 100% backwards compatibility is what made x86 successful, once you start cutting "old stuff" it's a slipper slope and will result in fragmentation, basically neither Intel nor AMD will do such madness.

You're seriously underestimating the cost of desgin and more the one of validation, and the impacts on how you can let an ISA progress with such dead 50 years old weight.

And do you really think removing 16-bit code will impact anyone? Also as I previously wrote even for 20-30 years old 32-bit binaries it's easier to install and run them on an emulator rather than trying to tweak Windows and locate old DLL that will likely fail. This whole "always support everything" is a bad joke.

LightningZ71 · 2026-03-25T13:31:33-0400

A full op-mode would have been ideal, but that would have required HEAVY buy-in from MS. MS wants to do the absolute LEAST they can do to keep the gravy train running, and doing THAT lifting wasn't going to fly. You might have gotten better buy in from Linux, but, you're going to need to evangelize Torvalds first, and he's not a fan of that sort of change these days.

Doug S · 2026-03-25T14:33:17-0400

Hulk said:
But I do see Apple CPU's not being so fast in Windows, right? I mean really this kind of perfectly illustrates my point.

The only way an Apple Silicon Mac can run Windows is under a VM. There are performance penalties for that, depending on what you're doing.

I'm not sure what numbers you're referencing with "I do see Apple CPU's not being so fast in Windows", but the proper comparison would be a Mac running Windows in a VM measured against a PC running Windows in a VM.

Hulk · 2026-03-25T15:00:34-0400

Doug S said:
The only way an Apple Silicon Mac can run Windows is under a VM. There are performance penalties for that, depending on what you're doing.

I'm not sure what numbers you're referencing with "I do see Apple CPU's not being so fast in Windows", but the proper comparison would be a Mac running Windows in a VM measured against a PC running Windows in a VM.

I'm done with this topic until Apple release a x86 cpu. Meaning... I'm done.

Question Zen 6 Speculation Thread

Golden Member

Diamond Member

Senior member

Diamond Member

Senior member

Golden Member

Senior member

Diamond Member

Golden Member

Diamond Member

Golden Member

Diamond Member

Diamond Member

Golden Member

Diamond Member

Golden Member

Diamond Member

Golden Member

Diamond Member

Golden Member

Diamond Member

Diamond Member

Platinum Member

Diamond Member

Diamond Member