Discussion Intel Meteor, Arrow, Lunar & Panther Lakes + WCL Discussion Threads

Page 337 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Tigerick

Senior member
Apr 1, 2022
941
857
106
Wildcat Lake (WCL) Specs

Intel Wildcat Lake (WCL) is upcoming mobile SoC replacing Raptor Lake-U. WCL consists of 2 tiles: compute tile and PCD tile. It is true single die consists of CPU, GPU and NPU that is fabbed by 18-A process. Last time I checked, PCD tile is fabbed by TSMC N6 process. They are connected through UCIe, not D2D; a first from Intel. Expecting launching in Q1 2026.

Intel Raptor Lake UIntel Wildcat Lake 15W?Intel Lunar LakeIntel Panther Lake 4+0+4
Launch DateQ1-2024Q2-2026Q3-2024Q1-2026
ModelIntel 150UIntel Core 7Core Ultra 7 268VCore Ultra 7 365
Dies2223
NodeIntel 7 + ?Intel 18-A + TSMC N6TSMC N3B + N6Intel 18-A + Intel 3 + TSMC N6
CPU2 P-core + 8 E-cores2 P-core + 4 LP E-cores4 P-core + 4 LP E-cores4 P-core + 4 LP E-cores
Threads12688
Max Clock5.4 GHz?5 GHz4.8 GHz
L3 Cache12 MB12 MB12 MB
TDP15 - 55 W15 W ?17 - 37 W25 - 55 W
Memory128-bit LPDDR5-520064-bit LPDDR5128-bit LPDDR5x-8533128-bit LPDDR5x-7467
Size96 GB32 GB128 GB
Bandwidth136 GB/s
GPUIntel GraphicsIntel GraphicsArc 140VIntel Graphics
RTNoNoYESYES
EU / Xe96 EU2 Xe8 Xe4 Xe
Max Clock1.3 GHz?2 GHz2.5 GHz
NPUGNA 3.018 TOPS48 TOPS49 TOPS






PPT1.jpg
PPT2.jpg
PPT3.jpg



As Hot Chips 34 starting this week, Intel will unveil technical information of upcoming Meteor Lake (MTL) and Arrow Lake (ARL), new generation platform after Raptor Lake. Both MTL and ARL represent new direction which Intel will move to multiple chiplets and combine as one SoC platform.

MTL also represents new compute tile that based on Intel 4 process which is based on EUV lithography, a first from Intel. Intel expects to ship MTL mobile SoC in 2023.

ARL will come after MTL so Intel should be shipping it in 2024, that is what Intel roadmap is telling us. ARL compute tile will be manufactured by Intel 20A process, a first from Intel to use GAA transistors called RibbonFET.



LNL-MX.png
 

Attachments

  • PantherLake.png
    PantherLake.png
    283.5 KB · Views: 24,043
  • LNL.png
    LNL.png
    881.8 KB · Views: 25,531
  • INTEL-CORE-100-ULTRA-METEOR-LAKE-OFFCIAL-SLIDE-2.jpg
    INTEL-CORE-100-ULTRA-METEOR-LAKE-OFFCIAL-SLIDE-2.jpg
    181.4 KB · Views: 72,439
  • Clockspeed.png
    Clockspeed.png
    611.8 KB · Views: 72,326
Last edited:

Wolverine2349

Senior member
Oct 9, 2022
532
185
86
The atom core is good. But some Intel users might anyway decide to shut them down to gain better P core OC and compatibility in some situation.

And if you care atom core enough, you should have known E core IPC is already quite good, compare Gracemont to Goldencove is just 16% lower on INT, and 38% lower on FP due to less FPU resources.

Guess why Intel compare Skymont to LP Crestmont in Meteorlake which lacks L3 cache.

View attachment 100431

Yes will be the thing to do if they have a more than 8 P cores on any model in single tile. Otherwise, a waste to do so and just get 9700X or 9800X3D for only 8 cores of similar performance. Unless you just love Intel and are ok with losing SMT with only 8 cores.
 

DavidC1

Platinum Member
Dec 29, 2023
2,160
3,305
106
Gracemont is around skylake IPC. Skymont is around Raptor Cove IPC, which is a jump of around 20-50% (specint int 20%, fp 50%) AND gets a clock boost.
What are you smoking? Sunny Cove is 18% faster than Skylake and Golden Cove is 19% faster than Sunny.

Skymont is 38% faster than Gracemont. This is a blowout increase alone, but there's also a 68% FP increase coming from the straight up doubling of FP performance, which will benefit vast majority of applications from beginning of the FP era, without needing to recompile like AVX512.

Skymont's FP unit is more capable than Golden Cove in
-64-bit FP: Literally everything
-128-bit FP: Everything from Core 2
-256-bit FP: Somewhat worse.

Assuming Int/FP split of 65/35%, that's a 50% gain. In 2022 it was Raptor Lake with Gracemont, and Golden Cove enhanced, and Zen 4.
 
  • Like
Reactions: Tlh97 and Hulk

Henry swagger

Senior member
Feb 9, 2022
513
313
106
What are you smoking? Sunny Cove is 18% faster than Skylake and Golden Cove is 19% faster than Sunny.

Skymont is 38% faster than Gracemont. This is a blowout increase alone, but there's also a 68% FP increase coming from the straight up doubling of FP performance, which will benefit vast majority of applications from beginning of the FP era, without needing to recompile like AVX512.

Skymont's FP unit is more capable than Golden Cove in
-64-bit FP: Literally everything
-128-bit FP: Everything from Core 2
-256-bit FP: Somewhat worse.

Assuming Int/FP split of 65/35%, that's a 50% gain. In 2022 it was Raptor Lake with Gracemont, and Golden Cove enhanced, and Zen 4.

In two years,
-P core Intel got 14% gain
-P core AMD got 16% gain
-E core Intel got 50% gain

Three years of 14%: 48%
Three years of 16%: 56%

Conclusion 1: Three times two year gains of P cores delivered in 2 years. Feast on that. I had a good feeling the E core team will make a breakout someday.
Conclulsion 2: 50% is Bulldozer to Zen level gains. Unlike Bulldozer though Gracemont was pretty good already.
Where is intel's e team based at ?
 

DavidC1

Platinum Member
Dec 29, 2023
2,160
3,305
106
In two years,
-P core Intel got 14% gains
-P core AMD got 16% gains
-E core Intel got 50% gains

Three years of 14%: 48%
Three years of 16%: 56%

Skylake was late 2015. So it took the Intel P core team 9 years to deliver 60% gains. E core team did it in two.

Conclusion 1: Three times two year gains of P cores delivered in 2 years. Feast on that. I had a good feeling the E core team will make a breakout someday.
Conclulsion 2: 50% is Bulldozer to Zen level gains. Unlike Bulldozer though Gracemont was pretty good already.
 

Saylick

Diamond Member
Sep 10, 2012
4,117
9,628
136
You can't read the slides either :)

The vs raptor cove is not Lunar Lake LP Skymont vs Raptor Cove, but an undisclosed (Arrow Lake) implementation where the Skymont cluster is on the ring bus (and shares L3 with P-cores)
Username checks out.

Point is, Intel is using a L3 cache for these Skymont comparisons.
 

DavidC1

Platinum Member
Dec 29, 2023
2,160
3,305
106
Yes.

But it isn't L5. The system level cache is mostly for power saving purposes by reducing distance of data access ala Apple Mx chips. The performance advantage of the system level cache won't be as significant as having L3 cache.
 

gdansk

Diamond Member
Feb 8, 2011
4,749
8,054
136
Yes.

But it isn't L5. The system level cache is mostly for power saving purposes by reducing distance of data access ala Apple Mx chips. The performance advantage of the system level cache won't be as significant as having L3 cache.
It can still access it though? Why is the nomenclature L# is not applicable.
 

DavidC1

Platinum Member
Dec 29, 2023
2,160
3,305
106
It can still access it though? Why is the nomenclature L# is not applicable.
Levels suggest that it has to always go through it before main memory, or the next level. It is called system wide cache for a reason. Such a broad cache will also not be as performant as a dedicated cache.
 

gdansk

Diamond Member
Feb 8, 2011
4,749
8,054
136
Levels suggest that it has to always go through it before main memory, or the next level. It is called system wide cache for a reason. Such a broad cache will also not be as performant as a dedicated cache.
So L4 is filled directly from memory and doesn't look in the last level cache first?
 
Jun 4, 2024
116
146
71
What are you smoking? Sunny Cove is 18% faster than Skylake and Golden Cove is 19% faster than Sunny.

Skymont is 38% faster than Gracemont. This is a blowout increase alone, but there's also a 68% FP increase coming from the straight up doubling of FP performance, which will benefit vast majority of applications from beginning of the FP era, without needing to recompile like AVX512.

Skymont's FP unit is more capable than Golden Cove in
-64-bit FP: Literally everything
-128-bit FP: Everything from Core 2
-256-bit FP: Somewhat worse.

Assuming Int/FP split of 65/35%, that's a 50% gain. In 2022 it was Raptor Lake with Gracemont, and Golden Cove enhanced, and Zen 4.

Taking my figures from this IPC comparison of Gracemont, which is thought to be around Skylake IPC and Golden Cove: https://www.techpowerup.com/298887/...n-4-and-golden-cove-spring-surprising-results. That being said, Intel has previously said Gracemont outperforms Skylake, which I forgot about.

You're right that Intel's stated IPC is 19% average over Cypress Cove, which is ~19% above Skylake, or around 35%. The numbers don't make sense, must be relying on different workloads for comparison.
 

DavidC1

Platinum Member
Dec 29, 2023
2,160
3,305
106
Taking my figures from this IPC comparison of Gracemont, which is known to be around Skylake IPC and Golden Cove: https://www.techpowerup.com/298887/...n-4-and-golden-cove-spring-surprising-results

That being said, Intel has previously said Gracemont outperforms Skylake, which I forgot about.

You're right that Intel's stated IPC is 19% average over Cypress Cove
Back in Gracemont days, Intel said it was approximately like Skylake without giving explicit figures. We found out that the FP is substantially slower(66% advantage for Golden Cove, meaning Skylake is likely 20% or so faster), and even in Integer it wasn't fully up to par.

Now they are saying it'll outright outperform Raptor Cove.
 
Jun 4, 2024
116
146
71
So in Lunar Lake the cache structure for the P core is as follows:
L1 (48K),
L2 (192K),
L3 (2.5MB),
L4 (12MB),
L5 (8MB)

Is that right?

Intel has stated that while L5/SLC is accessible to p-cores, in practice hit p-cores don't need/use L5, and keep in mind that this is shared between NPU and e-cores. More useful to think of Lunar Lake as 2 subsystems, P-cores with L1 - 4, E-cores with l1-3
 
Jun 4, 2024
116
146
71
Back in Gracemont days, Intel said it was approximately like Skylake without giving explicit figures. We found out that the FP is substantially slower(66% advantage for Golden Cove, meaning Skylake is likely 20% or so faster), and even in Integer it wasn't fully up to par.

Now they are saying it'll outright outperform Raptor Cove.
My point is that the the uplift over non-LP gracemont must be on the order of 20% for int and 50% for fp in Specint, adroc's fud notwithstanding.

Forgot to mention that intel state's on average LP gracemont IPC is down only ~5% with non-LP.
 

DavidC1

Platinum Member
Dec 29, 2023
2,160
3,305
106
Intel has stated that while L5/SLC is accessible to p-cores, in practice hit rate is low, and keep in mind that this is shared between NPU and e-cores. More useful to think of Lunar Lake as 2 subsystems, P-cores with L1 - 4, E-cores with l1-3
The idea is to put everything in the cache as much as possible. So without the system level cache, the caches are for the cores only. Now it has a dedicated one for all uncore and I/O. Sharing reduces performance and increases latency so all units sharing will be low performance*, but for saving power it'll do wonders.
 

DavidC1

Platinum Member
Dec 29, 2023
2,160
3,305
106
My point is that the the uplift over non-LP gracemont must be on the order of 20% for int and 50% for fp in Specint, adroc's fud notwithstanding.
Gracemont is actually slightly slower than Skylake in Int and something like 20% slower in FP if we take 66% Golden Cove and divide by 19% and 18%.

If it's 2% faster than Raptor Cove it'll be about 30% in Int.

Thoughts on Darkmont and Arctic Wolf:
-Darkmont is basically a Tick with few % improvements
-Arctic Wolf is the big one again with roughly 30% for Integer.
 
Jun 4, 2024
116
146
71
Gracemont is actually slightly slower than Skylake in Int and something like 20% slower in FP if we take 66% Golden Cove and divide by 19% and 18%.

If it's 2% faster than Raptor Cove it'll be about 30% in Int.
Mental math: order 20%=~30%. Raptor Cove vs Gracemont INT IPC in that techpowerup article is around 6.8 for Raptor Cove vs 5.53 for Gracemont 13th gen, 23%. Anyway, splitting hairs. My only point is that Adroc is throwing down some red herring cap by focusing on the fact that the comparison Intel drew was to LP cores.
 

DavidC1

Platinum Member
Dec 29, 2023
2,160
3,305
106
Lunar Lake does sound excellent. I had a feeling they could pull it off. I had a mixed feeling for both Lakefield and Meteorlake, though I always thought it was possible to do good in theory.

40% lower video playback power is 60% higher battery life. Now we're cooking.
 

ondma

Diamond Member
Mar 18, 2018
3,320
1,709
136
Dnt worry 8+32 model should take care of zen 5 and zen6
Depends on the use case, that is if we ever even see 8+32.. Should be great for embarrassingly parallel workloads, but more E cores wont help for gaming. Personally, I would rather see more P cores than continued spamming of E cores. If they feel the need, AMD has already stated they could increase core counts too.
 
Jun 4, 2024
116
146
71
Lunar Lake does sound excellent. I had a feeling they could pull it off. I had a mixed feeling for both Lakefield and Meteorlake, though I always thought it was possible to do good in theory.

40% lower video playback power is 60% higher battery life. Now we're cooking.
Yeah, this is the first time I've felt like Intel is executing since the early aughts. Fairchild/Intel was an innovation powerhouse. The beancounters almost sank them. This is a return to form, and we're seeing just the beginning of it. This is Intel's 2003 Dothan moment.
 

dullard

Elite Member
May 21, 2001
26,192
4,860
126
Depends on the use case, that is if we ever even see 8+32.. Should be great for embarrassingly parallel workloads, but more E cores wont help for gaming. Personally, I would rather see more P cores than continued spamming of E cores. If they feel the need, AMD has already stated they could increase core counts too.
P cores do well when fed massive amounts of power. If you just keep adding P cores, then within a given power budget, each subsequent P core gets less and less power. Meaning, very soon you get P cores with not enough power to work well. It could actually perform worse in games unless you get a CPU with a pitiful base clock and a massive power hungry turbo clock--but only turbo for the right number of cores to match your game.

If power use and heat dissipation weren't issues, then your desire would be great. But, in a real world situation, no chip can follow your dream path.

An army can't just double the number of soldiers, but give them no more food and no more supplies than an army half its size requires, and expect this starving army to perform well.
 
Last edited: