Discussion Intel current and future Lakes & Rapids thread

Page 600 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Exist50

Platinum Member
Aug 18, 2016
2,445
3,043
136
. . . uh oh.
Hmm, kind of weird since Intel has been pushing multiple dice on package (EMIB, FOVEROS) for quite a while. What the heck is going on with that company?! Gee, let's NOT use the packing tech that WE developed :rolleyes:
The story I've heard is that everyone was just called into the main conference room, and shown a presentation of basically "This is Meteor Lake now". Supposedly elicited a mixture of silence and chuckling.

For this reason, COVID-instigated hiring freezes, the addition of Arrow Lake, and Microsoft poaching half of Intel's best talent in Oregon, I consider myself a Meteor Lake sceptic even independent of any process issues.

And Ajay, right tool for the job. Advanced packaging is a useful tool, but doesn't beat monolithic in performance or power. And iirc, at the time, MTL was still more low power targeted. Though with process issues and a more flexible lineup, it might not be a bad idea in retrospect. But IDC will probably do things differently for Lunar Lake and whatever else they own.
 

Ajay

Lifer
Jan 8, 2001
15,332
7,792
136
And Ajay, right tool for the job. Advanced packaging is a useful tool, but doesn't beat monolithic in performance or power. And iirc, at the time, MTL was still more low power targeted. Though with process issues and a more flexible lineup, it might not be a bad idea in retrospect.

Forgot everyone has been saying MTL is going to be mobile only. So yeah, a chiplet strategy in that regards is kind of nuts.
 

IntelUser2000

Elite Member
Oct 14, 2003
8,686
3,785
136
What are the advantages of using compute tiles for desktop with lower core counts that would fit on a monolithic chip?

If Raptor desktop is 8+16 and Meteor is 8+16 what's the advantage of the compute tiles? It is simply production cost?

Mind you, if they go tiles, they'll most likely go all-in on tiles, because creating a monolithic die will be additional work and resources required. So they will not create a monolithic small die if they have a tile strategy.

Also interesting.

Raptorlake goes from 8 to 16 E cores for additional performance. So Meteorlake might stay at 8+16, but use new cores. Arrowlake uses same cores, but goes to 8+32. Lunarlake would be 8+32 but new cores? Can't guess about Nova.
 

Hulk

Diamond Member
Oct 9, 1999
4,191
1,975
136
1) Higher yields with tiles. Instead of needing to throw away an entire monolithic chip (total loss of that silicon area), you might have several good tiles and one bad tile in the same area (mostly usable silicon area).

2) One design problem doesn't hold up the whole generation. Suppose there is a problem with a new iGPU but the new CPU is performing great. With a monolithic chip, the thing can't ship. With tiles, you can use a previous iGPU combined with your new CPU. This eliminates the need for long delays and the expenses of backporting (like the 26.5 months between Comet Lake and Alder Lake and the necessary costs of creating Rocket Lake).

3) Flexibility. This is related to #2, but you essentially can ship products incrementally when they are ready rather than waiting for all new concepts to be perfected. The CPU team doesn't need to wait for the GPU team to be ready (and vise versa). They can launch what they have and move on to the next project. Being able to have your people work more independently gives your designs, planning, etc far more flexibility. Heck, Intel can even outsource some of the tiles to other companies for even more flexibility.

The drawbacks are of course higher latency, higher power, and/or more costly packaging.

Excellent explanation. Those reasons make perfect sense. Thanks.
 

Hulk

Diamond Member
Oct 9, 1999
4,191
1,975
136
Mind you, if they go tiles, they'll most likely go all-in on tiles, because creating a monolithic die will be additional work and resources required. So they will not create a monolithic small die if they have a tile strategy.

Also interesting.

Raptorlake goes from 8 to 16 E cores for additional performance. So Meteorlake might stay at 8+16, but use new cores. Arrowlake uses same cores, but goes to 8+32. Lunarlake would be 8+32 but new cores? Can't guess about Nova.

Having a 12700K currently I know for my usage 8+4 is not enough. The "8" is fine, but not the 4. I need more background app compute. I don't think 8+8 would get me where I want to be performance-wise either. 8+16, or even 8+12 would be fantastic. Don't get me wrong I love this 12700K, my 4770K was basically 0+4 in terms of ADL performance!

Right now I'm rendering some video in the background at 3-4fps. When I bring that app to the front the P's make it happen at 13-14fps.
 

coercitiv

Diamond Member
Jan 24, 2014
6,151
11,686
136
Stockholm syndrome at it's finest. Disable the E-cores already, have your system assign threads like it should to all the P-cores and enjoy one of the fastest consumer CPUs.

It's amazing how much performance you're leaving under the table just to maintain this illusion that Intel's approach for the scheduler was correct.
 

LightningZ71

Golden Member
Mar 10, 2017
1,627
1,898
136
Having a 12700K currently I know for my usage 8+4 is not enough. The "8" is fine, but not the 4. I need more background app compute. I don't think 8+8 would get me where I want to be performance-wise either. 8+16, or even 8+12 would be fantastic. Don't get me wrong I love this 12700K, my 4770K was basically 0+4 in terms of ADL performance!

Right now I'm rendering some video in the background at 3-4fps. When I bring that app to the front the P's make it happen at 13-14fps.

You'd be lucky to get 7fps in rendering with a +8 processor. That's not really going to improve your workflow much. You'd do better manually assigning priority or affinity to the rendering/processing task and ketting the e cores handle your foreground tasks.
 

Hulk

Diamond Member
Oct 9, 1999
4,191
1,975
136
Stockholm syndrome at it's finest. Disable the E-cores already, have your system assign threads like it should to all the P-cores and enjoy one of the fastest consumer CPUs.

It's amazing how much performance you're leaving under the table just to maintain this illusion that Intel's approach for the scheduler was correct.

I'll try it. Perhaps I have been brainwashed. For some reason I have this nagging thought that I'd be leaving some additional compute behind by disabling the E's.
 

eek2121

Platinum Member
Aug 2, 2005
2,904
3,906
136
I'll try it. Perhaps I have been brainwashed. For some reason I have this nagging thought that I'd be leaving some additional compute behind by disabling the E's.

try leaving it on the E cores, then do some other intense workload on the P cores.
 

jpiniero

Lifer
Oct 1, 2010
14,510
5,159
136
Been meaning to mention this but Icelake W is pretty much nonexistant, even several months after the "launch". There does seem to be retail places you can special order it and they will ship it indirectly from Intel but that doesn't sound exactly like high volume. From what I looked you can't really buy a OEM workstation with the regular Icelake Server either. Guess the demand is all Threadripper Pro now.

Of note on this, Apple was expected to release a minor update of the Mac Pro with Icelake W as the last x86 release. I wonder if Apple will just not bother if they can't get the chips.
 

IntelUser2000

Elite Member
Oct 14, 2003
8,686
3,785
136
Man, Alderlake mobile feels like more of a disappointment than I initially expected. The silence around battery life was deafening. Seriously not even one mention of it? While the competitor AMD gave some really, really good power numbers.

The E cores this generation is all about performance and performance/watt. Not battery life, don't get them confused. The problem is the -U Rembrandt is 25-30% faster than the predecessor. ADL will probably catch up to Cezanne, but not Rembrandt. So it'll have the MT performance, iGPU, and battery life leadership! Granted MT performance gap is closer than Cezanne vs. Tigerlake. But based on marketshare none really cared about Cezanne being pretty much an 8 core HT less version of Tigerlake in MT.

14K Cinebench R23 and 5K+ R20 in 28W is what Rembrandt is.

Raptorlake is what puts the focus back on battery life and fully closing the CPU performance gap. Not the iGPU part.

I'd really want a proper Lakefield successor though. Not because Lakefield was good, because it was piss poor. But because it had potential. Meteorlake-M is probably the proper Lakefield, and since ADL-M is nonexistent, that's one product that can come in H1 2023. MTL-M done properly is what can also allow them to enter the proper Tablet market, one they exited after Airmont in 2016.
 

JoeRambo

Golden Member
Jun 13, 2013
1,814
2,105
136
The E cores this generation is all about performance and performance/watt. Not battery life, don't get them confused.

Important part of battery life equation is waking cores and supporting uncore up ( L3 cache slices, ring etc) then doing the task required and going back to sleep. Race to the idle if You'd like.
So while hybrid is good at power efficiency @ peak performance, full 100% of cpu load rendering and compiling something, it falls flat in actual real world tasks for laptops like browsing web, office jobs, video, typing in some IDE. If you are not idle, you are keeping your ring/L3 active, your IMC busy and DRAM chips not in power saving mode. Need massive difference between joules spent on big/marketing cores to "win" here.

There is also software side, hybrid scheduling decisions cost extra power and rescheduling tasks between big/small cores is not free either. If task starts life on small cores and is moved to big core it is disaster from HW point of view. L1/L2 caches are obviuos, but modern CPUs have massive BPUs that train over millions of cycles and TLBs that also start empty on context switch to fresh core.
This does not matter when workload will run for seconds 100%, but in bursty workload it is wasting energy.
 

insertcarehere

Senior member
Jan 17, 2013
639
607
136
14K Cinebench R23 and 5K+ R20 in 28W is what Rembrandt is.

Is it? Because that's a whole lot more than '25-30%' faster that AMD themselves claim over the 5800u:
13.png
 

Hulk

Diamond Member
Oct 9, 1999
4,191
1,975
136
Is it? Because that's a whole lot more than '25-30%' faster that AMD themselves claim over the 5800u:
13.png

I think the big question is what frequencies ADL mobile will be able to sustain within various power/thermal envelopes?

If 2+8 can sustain 3GHz for P's and E's that would provide CB23 of ~8,850.
If that configuration can sustain 3.5GHz for P/E then that would attain ~10,300.
Even 2.5GHz P/E would score ~7,400, which substantially higher than TGL.

Another bit of information to keep in mind is that at equal frequencies a Gracemont cluster outperforms a Golden Cove core by nearly a 2:1 factor. Assuming approximate area, the E's can be clocked much lower than the P's and still be more efficient.

I think we are going to have to wait for actual tests. We just don't know how this silicon optimized for mobile is going to do when optimized for power at the extreme U mobile envelope. Intel has it work cut out for it determine optimum frequency and core distribution for various workloads. It's a much more nuanced problem than for the desktop since power and heat dissipation are much more limited.

I think we are going to be surprised by how well 2+8 performs in the ultra mobile space. I have a feeling that 2 P's, with 4 strong logical threads are going to be able to carry the bulk of the work for most applications with the E's handling the lighter thread work loads.

Furthermore Intel knew (knows) that mobile is ultimately the most important category so I believe they designed ADL around that in the first place.

We shall see soon enough.
 

IntelUser2000

Elite Member
Oct 14, 2003
8,686
3,785
136
The Cinebench numbers are bit high but I believe the 30% gen-on-gen is going to be true, because the base clock increases greatly, which suggests power handling has improved greatly.

Their iGPU will not be greatly behind, but still significant. Their presentation compared it to a Dell XPS 13 9310, which has a PL1 of 18W and underperforms in games a fair bit. That 3x gain in Doom Eternal makes no sense, since Iris Xe G7 is getting 40 and 60 fps right now. I believe the 9310 is getting somewhere around 25-30 fps.

The Witcher 3 gains are muted since the 9310 happens to perform exceptionally in that game.

Overall it'll probably end up being 30-50% faster in iGPU over a properly configured 28W, LPDDR4x-4266 Tigerlake.

@Hulk They'll probably do well on the 9W segment. And the 6+8 might be competitive.
 

mikk

Diamond Member
May 15, 2012
4,112
2,108
136
The Cinebench numbers are bit high but I believe the 30% gen-on-gen is going to be true, because the base clock increases greatly, which suggests power handling has improved greatly.


Once again, do you really compare 15W Cezanne against 28W Rembrandt? Really? If you think 30% is tested versus the same power your are wrong once again. AMD even admitted there is no 28W Cezanne U device in the market (no idea if true or a silly excuse) and apparently there is a 15W slide with less gains.
 
  • Like
Reactions: TESKATLIPOKA

uzzi38

Platinum Member
Oct 16, 2019
2,565
5,575
146
Once again, do you really compare 15W Cezanne against 28W Rembrandt? Really? If you think 30% is tested versus the same power your are wrong once again. AMD even admitted there is no 28W Cezanne U device in the market (no idea if true or a silly excuse) and apparently there is a 15W slide with less gains.
It's funny that you say that, when AMD's reference numbers last year on launch are 25W figures, not 15W figures.
f91a9a756cd42ffdc7bdfa01ac1909eb.jpg
 

Abwx

Lifer
Apr 2, 2011
10,847
3,297
136
Last edited:

IntelUser2000

Elite Member
Oct 14, 2003
8,686
3,785
136
Once again, do you really compare 15W Cezanne against 28W Rembrandt? Really? If you think 30% is tested versus the same power your are wrong once again. AMD even admitted there is no 28W Cezanne U device in the market (no idea if true or a silly excuse) and apparently there is a 15W slide with less gains.

Dude the scores are from CPUMonkey. I don't know where they are getting that from and I might have been bit hasty on the claims but Alderlake isn't the magic bullet for mobile.

Even if the CPU is closer than it looks they'll be behind on the battery life and iGPU department, and not by a small bit. AMD is claiming 30% battery life improvement gen-on-gen? That's an insane lead! Right now Tigerlake leads a bit on video playback and uses little more power in browsing. Video playback they were claiming 30-40% reduction or something which will get ahead of TGL.

The web browsing gains are really, really good. TGL is not that efficient. There's a notebookreview thread about optimizing battery life and you can get Cometlake-U to be lower power.

The 20-30% gain in battery life for RMB cuts the fundamental battery life differences between x86 and ARM laptops by about half. Great job on them! I thought I'd see that with Lakefield!
 

Thala

Golden Member
Nov 12, 2014
1,355
653
136
Actually there s designs that get to 15W after some time, CB R15 perfs can be extracted from the graphs below.

1924 pts@30W
1820 pts@25W
1420 pts@14W


Thanks for the link! Reading a bit more carefully it seems that at 14W the score drops to 1244 while for the 1402 score it is looking closer to 19-20W.
 

gdansk

Golden Member
Feb 8, 2011
1,980
2,355
136
I find it very unlikely Rembrandt will be ahead in MT. 28W 6+8 should win in most multi-thread benchmarks. In usability and perceived performance I have no idea. But it's basically designed for benchmarks and given the exact right PL2 that Intel knows it needs.
 
Last edited:
  • Like
Reactions: BorisTheBlade82