Discussion Intel current and future Lakes & Rapids thread

uzzi38 · Jun 3, 2021

IntelUser2000 said:
Well in CPUs with higher performance per clock, there's less chance for SMT to be better. So yea it makes sense.

How big is the difference in SMT gains between the two? That's the big question.

Tested by a friend in a few benchmarks

V4 and V5 are V-Ray 4.1 and 5.0

JoeRambo · Jun 3, 2021

uzzi38 said:
Tested by a friend in a few benchmarks

Great work, but i think same core count CPUs should be compared. 3300X is quad that has all resources of full chip to itself, while 8C of 5800X have twice as many cores sharing resources.

uzzi38 · Jun 3, 2021

JoeRambo said:
Great work, but i think same core count CPUs should be compared. 3300X is quad that has all resources of full chip to itself, while 8C of 5800X have twice as many cores sharing resources.

Testing was done in this manner to keep L3 per core the same. Both are valid ways of testing at the end of the day though tbf.

coercitiv · Jun 3, 2021

I wonder how ADL-S will handle MT loads (by default) once the first 8 threads are distributed among the big cores. Back when doing napkin performance estimates my immediate expectation was that it would distribute threads on the small cores before using SMT on the big cores.

This approach would likely maximize hroughput, but does require some compensation mechanic to ensure latency sensitive workloads stay on the big cores even at the cost of prematurely using SMT capability. Granted I'm also ssuming the small core clusters will have significantly higher inter-core latencies, which is still a mere assumption at this point.

On one side I find all this very interesting from a theoretic point of view, on the other side I find that Apple's (rumored) approach with fewer small cores to be far easier to manage in terms of performance consistency. Then again it's nice to see so many different approaches clashing in the near future.

blckgrffn · Jun 3, 2021

DrMrLordX said:
Which node does Intel currently use for their off-die PCH? 22nm?

All I know off hand is that B360 were 14nm and the B365 "update" was really a 22nm change.

I'll find the source later, not sure if that was Anandtech coverage or what. It was sort of a weird deal, because B365 is not a super set of B360, features were lost and gained.

jpiniero · Jun 3, 2021

Most of the chipsets are 14 nm. The 22 nm ones are basically rebrands from the Skylake/Kabylake era.

eek2121 · Jun 3, 2021

blckgrffn said:
*ahem*

Intel thread.

That is all 😉

I avoided replying for this reason. 🤣

coercitiv said:
I wonder how ADL-S will handle MT loads (by default) once the first 8 threads are distributed among the big cores. Back when doing napkin performance estimates my immediate expectation was that it would distribute threads on the small cores before using SMT on the big cores.

This approach would likely maximize hroughput, but does require some compensation mechanic to ensure latency sensitive workloads stay on the big cores even at the cost of prematurely using SMT capability. Granted I'm also ssuming the small core clusters will have significantly higher inter-core latencies, which is still a mere assumption at this point.

On one side I find all this very interesting from a theoretic point of view, on the other side I find that Apple's (rumored) approach with fewer small cores to be far easier to manage in terms of performance consistency. Then again it's nice to see so many different approaches clashing in the near future.

Microsoft needs to change the Windows scheduler to behave more like macOS: https://arstechnica.com/gadgets/202...-cpu-but-m1-macs-feel-even-faster-due-to-qos/

coercitiv · Jun 3, 2021

eek2121 said:
Microsoft needs to change the Windows scheduler to behave more like macOS: https://arstechnica.com/gadgets/202...-cpu-but-m1-macs-feel-even-faster-due-to-qos/

FYI

eek2121 · Jun 3, 2021

coercitiv said:
FYI
View attachment 45269

Will believe it when I see it.

Hulk · Jun 3, 2021

coercitiv said:
FYI
View attachment 45269

Where is that from? Is that comparing Rocket Lake to 8+8 Alder Lake? If so then that would roughly translate to Alder Lake 8+8 being as fast in some situations as the 5950X, or as we've been speculating around here somewhere between 5900X and 5950X performance.

From the Ars article linked. Interesting strategy.

"What makes the Apple M1 feel so fast isn't the fact that four of its cores are slower than the others—it's the operating system's willingness to sacrifice maximum throughput in favor of lower task latency."

jpiniero · Jun 3, 2021

eek2121 said:
Will believe it when I see it.

Discussed this before, I'm guessing it's comparing Rocket Lake T to the equivalent Alder Lake 8+8.

coercitiv · Jun 3, 2021

eek2121 said:
Will believe it when I see it.

Let me help you see it then:

Lakefield uses the Sunny Cove and Atom cores in concert. Foreground, high-priority tasks are given top the Sunny Cove core, while background tasks are handed off to the low-power Atom chips. The combination improves both power and graphics, Khushu said.

dullard · Jun 3, 2021

eek2121 said:
Microsoft needs to change the Windows scheduler to behave more like macOS: https://arstechnica.com/gadgets/202...-cpu-but-m1-macs-feel-even-faster-due-to-qos/

Exactly. See my top 3 suggestions on how to make big Little good on desktops:

Discussion - Intel current and future Lakes & Rapids thread

Page 387 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

forums.anandtech.com

IntelUser2000 · Jun 3, 2021

coercitiv said:
FYI
View attachment 45269

That could always mean on mobile parts. I don't know how that's possible on desktop.

moinmoin · Jun 3, 2021

Hulk said:
"What makes the Apple M1 feel so fast isn't the fact that four of its cores are slower than the others—it's the operating system's willingness to sacrifice maximum throughput in favor of lower task latency."

That latter part of the quote reminds me of BeOS (superseded by Haiku).

coercitiv · Jun 4, 2021

IntelUser2000 said:
That could always mean on mobile parts. I don't know how that's possible on desktop.

That's why my original post was about the different requirements of the desktop platform, I was wondering how they would tackle the use of SMT on the big core vs. firing up a small core instead.

The point of my latter replies containing slides related to Alder Lake and Lakefield was to prove that Intel already has the software (and hardware) in place to prioritize foreground tasks (unless they've been lying about Lakefield all along). The common part of the hybrid problem that they share with Apple is already addressed.

Moving away from that though, in high performance systems, Apple doesn't face the same problems as Intel when it comes to hybrid chips. First, they lack SMT, so their priority list is quite simple. Second, if the rumors are true, in performance chips they prioritize big core count over small core with a ratio of 4:1, so ensuring consistent performance won't be much of an issue, if any at all.

Thala · Jun 4, 2021

coercitiv said:
That's why my original post was about the different requirements of the desktop platform, I was wondering how they would tackle the use of SMT on the big core vs. firing up a small core instead.

That's not hard to predict, isn't it? For thread allocation : Big core > small core > SMT.

Hulk · Jun 4, 2021

Intel also has to be concerned about comparisons to AMD so while snappy response may be important to the end user, I would think tuning for benchmarks is also going to be high on the "to-do" list. I'm sure they're finding the optimum allocation of cores for Cinebench MT as I type this😉

Zucker2k · Jun 4, 2021

About ADL scheduling:

Additionally, Intel and Microsoft are working close to optimize Alder Lake CPU performance for an upcoming build of its Windows operating system which will bring massive scheduling upgrades & will also be coming out around the same time as the launch of Alder Lake chips. The first unveiling is expected on the 24th of June.

Intel 10nm Alder Lake Desktop CPUs Launching During Halloween 2021, Will Feature Support On New LGA 1700 Socket Motherboards

Intel's 12th Generation Alder Lake K-series desktop CPUs based on the 10nm Enhanced SuperFin architecture are expected to launch in October.

wccftech.com

https://twitter.com/x/status/1400367814440013826

dullard · Jun 4, 2021

Zucker2k said:
Additionally, Intel and Microsoft are working close to optimize Alder Lake CPU performance for an upcoming build of its Windows operating system which will bring massive scheduling upgrades & will also be coming out around the same time as the launch of Alder Lake chips. The first unveiling is expected on the 24th of June.

Hmm, seems like this is in conflict with Thala's earlier assertion that the scheduler isn't being changed for Alder Lake. Note: I believe you Zucker2k.

Thala said:
Not sure why you are speculating. The heterogenous Windows scheduler is already implemented and is used for every device which features heterogenous core configurations.

Thala said:
Besides even if this would be the case, it would not change the scheduler at all...
You have to understand, that this feature (mapping certain threads to a subset of available cores) is supported since literally forever - and is not new feature with respect to the heterogenous scheduler discussed in this context.

uzzi38 · Jun 4, 2021

Zucker2k said:
About ADL scheduling:

Intel 10nm Alder Lake Desktop CPUs Launching During Halloween 2021, Will Feature Support On New LGA 1700 Socket Motherboards

Intel's 12th Generation Alder Lake K-series desktop CPUs based on the 10nm Enhanced SuperFin architecture are expected to launch in October.

wccftech.com

https://twitter.com/x/status/1400367814440013826

This is BS.

Hitman928 · Jun 4, 2021

uzzi38 said:
This is BS.

Any reasoning why?

Edit: Nevermind, for some reason I thought the source was someone else. MLID holds no weight with me, personally.

dullard · Jun 4, 2021

uzzi38 said:
This is BS.

Is the idea of changing the scheduler BS (changing the scheduler for better performance of new chips is wrong), or is the content of the post BS (the scheduler change won't happen)? I wasn't sure which you were referring to.

Asterox · Jun 4, 2021

uzzi38 said:
This is BS.

Yes and no, Microsoft is still at least roughly doing something about it.In short, we should not expect miracles though it is still Windows and Microsoft.

uzzi38 · Jun 4, 2021

dullard said:
Is the idea of changing the scheduler BS, or is the content of the post BS? I wasn't sure which you were referring to.

The changing the scheduler bit. ADL uses the same scheduling stuff as implemented for WoA pretty much unmodified.

Discussion Intel current and future Lakes & Rapids thread

Platinum Member

Golden Member

Platinum Member

Diamond Member

Diamond Member

Lifer

Diamond Member

Diamond Member

Diamond Member

Diamond Member

Lifer

Diamond Member

Elite Member

Elite Member

Diamond Member

Diamond Member

Golden Member

Diamond Member

Golden Member

Elite Member

Platinum Member

Diamond Member

Elite Member

Golden Member

Platinum Member