Discussion Intel Meteor, Arrow, Lunar & Panther Lakes + WCL Discussion Threads

Page 655 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Tigerick

Senior member
Apr 1, 2022
942
857
106
Wildcat Lake (WCL) Specs

Intel Wildcat Lake (WCL) is upcoming mobile SoC replacing Raptor Lake-U. WCL consists of 2 tiles: compute tile and PCD tile. It is true single die consists of CPU, GPU and NPU that is fabbed by 18-A process. Last time I checked, PCD tile is fabbed by TSMC N6 process. They are connected through UCIe, not D2D; a first from Intel. Expecting launching in Q1 2026.

Intel Raptor Lake UIntel Wildcat Lake 15W?Intel Lunar LakeIntel Panther Lake 4+0+4
Launch DateQ1-2024Q2-2026Q3-2024Q1-2026
ModelIntel 150UIntel Core 7Core Ultra 7 268VCore Ultra 7 365
Dies2223
NodeIntel 7 + ?Intel 18-A + TSMC N6TSMC N3B + N6Intel 18-A + Intel 3 + TSMC N6
CPU2 P-core + 8 E-cores2 P-core + 4 LP E-cores4 P-core + 4 LP E-cores4 P-core + 4 LP E-cores
Threads12688
Max Clock5.4 GHz?5 GHz4.8 GHz
L3 Cache12 MB12 MB12 MB
TDP15 - 55 W15 W ?17 - 37 W25 - 55 W
Memory128-bit LPDDR5-520064-bit LPDDR5128-bit LPDDR5x-8533128-bit LPDDR5x-7467
Size96 GB32 GB128 GB
Bandwidth136 GB/s
GPUIntel GraphicsIntel GraphicsArc 140VIntel Graphics
RTNoNoYESYES
EU / Xe96 EU2 Xe8 Xe4 Xe
Max Clock1.3 GHz?2 GHz2.5 GHz
NPUGNA 3.018 TOPS48 TOPS49 TOPS






PPT1.jpg
PPT2.jpg
PPT3.jpg



As Hot Chips 34 starting this week, Intel will unveil technical information of upcoming Meteor Lake (MTL) and Arrow Lake (ARL), new generation platform after Raptor Lake. Both MTL and ARL represent new direction which Intel will move to multiple chiplets and combine as one SoC platform.

MTL also represents new compute tile that based on Intel 4 process which is based on EUV lithography, a first from Intel. Intel expects to ship MTL mobile SoC in 2023.

ARL will come after MTL so Intel should be shipping it in 2024, that is what Intel roadmap is telling us. ARL compute tile will be manufactured by Intel 20A process, a first from Intel to use GAA transistors called RibbonFET.



LNL-MX.png
 

Attachments

  • PantherLake.png
    PantherLake.png
    283.5 KB · Views: 24,044
  • LNL.png
    LNL.png
    881.8 KB · Views: 25,531
  • INTEL-CORE-100-ULTRA-METEOR-LAKE-OFFCIAL-SLIDE-2.jpg
    INTEL-CORE-100-ULTRA-METEOR-LAKE-OFFCIAL-SLIDE-2.jpg
    181.4 KB · Views: 72,440
  • Clockspeed.png
    Clockspeed.png
    611.8 KB · Views: 72,327
Last edited:

jdubs03

Golden Member
Oct 1, 2013
1,498
1,090
136
Its on a better node so thats kind of given to be smaller. The performance improvements are pretty mundane as well.
Fair appraisal. That surely helps. But still if we’re just looking at core size and comparing to the M4, Oryon-L does look pretty good performance wise while being considerably smaller.
 

FlameTail

Diamond Member
Dec 15, 2021
4,384
2,762
106
wait, are these just the cores themselves. No L2?

If so how can 10-wide core with a 960 ROB be smaller than Lion cove and Zen5C.

@Kepler_L2 , whats going on here?

EDIT: Well it is on N3E
The source says private L2 caches are included into area.

M4-P : 3 mm² N3E
Lion Cove : 4.5 mm² N3B
Zen 5 : 4.15 mm² N4P

Lion Cove has a 2.5 MB private L2 cache. But I am pretty sure that even if we subtract the L2 area, Lion Cove will be bigger than M4 P-core. Speaks a lot about how bloated Intel's Cove designs are. (And Lion Cove is even on the slightly denser N3B node!)
 

cannedlake240

Senior member
Jul 4, 2024
247
138
76
The source says private L2 caches are included into area.

M4-P : 3 mm² N3E
Lion Cove : 4.5 mm² N3B
Zen 5 : 4.15 mm² N4P

Lion Cove has a 2.5 MB private L2 cache. But I am pretty sure that even if we subtract the L2 area, Lion Cove will be bigger than M4 P-core. Speaks a lot about how bloated Intel's Cove designs are. (And Lion Cove is even on the slightly denser N3B node!)
LNC large area probably has to do with higher clocks as well. It's 5.7Ghz vs 4.5 on M4
 
  • Like
Reactions: lightmanek

FlameTail

Diamond Member
Dec 15, 2021
4,384
2,762
106
Fun Fact: Lunar Lake has more cache than M4
M4LNL
CPU4P
4E
4P
4LPE
L0dN/A
N/A
48 KB × 4
N/A
L1d128 KB × 4
64 KB × 4
192 KB × 4
32 KB × 4
L1i192 KB × 4
128 KB × 4
64 KB × 4
64 KB × 4
L216 MB shared
4 MB shared
2.5 MB × 4
4 MB shared
L3N/A12 MB shared
SLC8 MB8 MB
Total 30 MB35.6 MB
 

FlameTail

Diamond Member
Dec 15, 2021
4,384
2,762
106
SoC area comparison (Measurements my own).

ScrewThatDog.jpg

Notes
- Lunar Lake and M4 are on 3nm, whereas X Elite and Strix Point are on 4nm. So areas are not directly comparable between them.
- All numbers are in mm²
- Cores area with asterisks (*) include the private L2 cache
- Lunar Lake SoC area is the N3B Compute Tile
- Apple M4 NPU area is suspiciously small, but I have double checked with their iPhone SoCs, and they also have ~5 mm² NPUs

Sources
-
Lunar Lake and Strix Point die shot annotations by Nemez
- M4 die shot annotation by Frederic Orange
- X Elite dieshot annotation by Piglin
 
Last edited:

Thunder 57

Diamond Member
Aug 19, 2007
4,294
7,100
136
Cb 2024 1T score? Also the first x86 core with traditional 8 wide decode

I'd love to see how much that 8 wide decode helps. AMD's 2x4 decode seems to be very much throughput first and I think that is why it disappointed many in gaming. I wonder how a mythical 6 wide decode would've done. Probably more balanced between client/server, but we know where the money is.

Maybe C&C will have an article on Lion Cove about it, if there isn't already.
 

DavidC1

Platinum Member
Dec 29, 2023
2,187
3,340
106
Now if only someone slaughetered a poor poor Turin-D ES to find out Z5c@N3e area.
You don't need to. We know SP6 sizes and AMD has a Turin-D shot.

I measured 1.9mm2 for 5c 3nm without L2 cache. It's only slightly smaller than N4. Likely 512-bit FP is the difference.

@poke01
It would be so hard for ARM even with Apple cores to be good at gaming. There is so much optimisation that goes into x86 in terms of software for games.
I'm talking about a theoretical x86 core that has M4 level performance.
 

DavidC1

Platinum Member
Dec 29, 2023
2,187
3,340
106
Yeah but the CCD has a really weird(tm) aspect ratio, making any non-dieshot measurements poo.
https://www.amd.com/content/dam/amd/en/images/products/data-centers/2909511-amd-zen5c-chip.png

Seems pretty good to measure with to me.
Cb 2024 1T score? Also the first x86 core with traditional 8 wide decode.
So the P core team spent too much resources and time on something that didn't bring them lot of performance. No balance in design. I bet they could have kept it 6-wide and got the same performance. Could have ended up 5-10% smaller in core and improve TTM.
 
  • Like
Reactions: Tlh97 and 511