• We’re currently investigating an issue related to the forum theme and styling that is impacting page layout and visual formatting. The problem has been identified, and we are actively working on a resolution. There is no impact to user data or functionality, this is strictly a front-end display issue. We’ll post an update once the fix has been deployed. Thanks for your patience while we get this sorted.

Question Zen 6 Speculation Thread

Page 396 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.
PGO is fantastic for any software that runs heavy stuff: it helps compiler decide how to inline better and that can make huge difference vs function call.
You missed my point. You collect the profile on specific machine. The inlining decisions made for that specific machine might not carry over to another CPU. Due to cache sizes etc. So the benefit can be limited if not outright nullified. That's not a problem on console, where for each generation the CPU stays exactly the same. Of course it would be nice if the OS could perform something like BOLT for you on first launch, but well, people are too impatient for that😉
 
Actually doing a cleanup ISA break, just like aa64 did.
Not happening in x86 - ever.
You missed my point. You collect the profile on specific machine. The inlining decisions made for that specific machine might not carry over to another CPU.
PGO is driven by profile data (PG bit), not CPU features like cache size or ISA (that's different optimisation type), there is also Dynamic PGO that is used in .NET and I believe Java to do such optimisations depending on work load, so no static one off necessary.
 
PGO is driven by profile data (PG bit), not CPU features like cache size or ISA
Yes, but the CPU features are implicit dependency. You gather profile on CPU X, based on that toolchain will alter code layout branch hints whatever it can to extract most performance. Then you run on CPU Y, some of those optimization might not hold (for example smaller uOP cache might end up penalizing too big functions etc).
 
Re-coding an application and showing it runs better on Apple actually supports my "less baggage/newer ISA" advantage supposition.
brotha, spec2017 subtests are *ancient*.
What exactly they could have done and what benefit it would have brought?
Remember amd64? Gotta do that again. They just extended opcode space instead. yuck.
and serious breaking changes would mean you compete directly against ARM.
amd64 was already a "serious breaking change".
 
That support takes very few transistors, keeping it for the sake of 100% backwards compatibility is what made x86 successful, once you start cutting "old stuff" it's a slipper slope and will result in fragmentation, basically neither Intel nor AMD will do such madness.
You're seriously underestimating the cost of desgin and more the one of validation, and the impacts on how you can let an ISA progress with such dead 50 years old weight.

And do you really think removing 16-bit code will impact anyone? Also as I previously wrote even for 20-30 years old 32-bit binaries it's easier to install and run them on an emulator rather than trying to tweak Windows and locate old DLL that will likely fail. This whole "always support everything" is a bad joke.
 
A full op-mode would have been ideal, but that would have required HEAVY buy-in from MS. MS wants to do the absolute LEAST they can do to keep the gravy train running, and doing THAT lifting wasn't going to fly. You might have gotten better buy in from Linux, but, you're going to need to evangelize Torvalds first, and he's not a fan of that sort of change these days.
 
But I do see Apple CPU's not being so fast in Windows, right? I mean really this kind of perfectly illustrates my point.

The only way an Apple Silicon Mac can run Windows is under a VM. There are performance penalties for that, depending on what you're doing.

I'm not sure what numbers you're referencing with "I do see Apple CPU's not being so fast in Windows", but the proper comparison would be a Mac running Windows in a VM measured against a PC running Windows in a VM.
 
The only way an Apple Silicon Mac can run Windows is under a VM. There are performance penalties for that, depending on what you're doing.

I'm not sure what numbers you're referencing with "I do see Apple CPU's not being so fast in Windows", but the proper comparison would be a Mac running Windows in a VM measured against a PC running Windows in a VM.
I'm done with this topic until Apple release a x86 cpu. Meaning... I'm done.
 
Back
Top