Discussion Intel Meteor, Arrow, Lunar & Panther Lakes Discussion Threads

Page 852 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Tigerick

Senior member
Apr 1, 2022
806
766
106
PPT1.jpg
PPT2.jpg
PPT3.jpg



As Hot Chips 34 starting this week, Intel will unveil technical information of upcoming Meteor Lake (MTL) and Arrow Lake (ARL), new generation platform after Raptor Lake. Both MTL and ARL represent new direction which Intel will move to multiple chiplets and combine as one SoC platform.

MTL also represents new compute tile that based on Intel 4 process which is based on EUV lithography, a first from Intel. Intel expects to ship MTL mobile SoC in 2023.

ARL will come after MTL so Intel should be shipping it in 2024, that is what Intel roadmap is telling us. ARL compute tile will be manufactured by Intel 20A process, a first from Intel to use GAA transistors called RibbonFET.



LNL-MX.png

Intel Core Ultra 100 - Meteor Lake

INTEL-CORE-100-ULTRA-METEOR-LAKE-OFFCIAL-SLIDE-2.jpg

As mentioned by Tomshardware, TSMC will manufacture the I/O, SoC, and GPU tiles. That means Intel will manufacture only the CPU and Foveros tiles. (Notably, Intel calls the I/O tile an 'I/O Expander,' hence the IOE moniker.)



Clockspeed.png
 

Attachments

  • PantherLake.png
    PantherLake.png
    283.5 KB · Views: 24,025
  • LNL.png
    LNL.png
    881.8 KB · Views: 25,518
Last edited:

coercitiv

Diamond Member
Jan 24, 2014
7,290
17,215
136
I'd take this tweet with a grain of salt. The 17% number on the Intel slide is SpecIntRate, i.e. one specInt instance for every hardware thread. All the numbers in the tweet (except Darkmont) are specInt.
At the same time, it does provide us with a baseline for perf/clock uplift, and that baseline does not look particularly optimistic given the double digit difference we know exists between Skymont and Crestmont. I agree though that going for the RPC comparison is... not a great idea.

Ah, thought it was Skymont. So, Darkmont is < Skymont. Odd.
We would need to know relative SpecIntRate for Skymont/Crestmont in a similar server architecture, otherwise math goes sideways fast.
 
  • Like
Reactions: Covfefe

Covfefe

Member
Jul 23, 2025
26
40
46
At the same time, it does provide us with a baseline for perf/clock uplift, and that baseline does not look particularly optimistic given the double digit difference we know exists between Skymont and Crestmont. I agree though that going for the RPC comparison is... not a great idea.
I don't think it's useful even as a baseline estimate.Here's some SpecIntRate results for the 5 CPUs from the tweet.

SpecInt “ipc”​
SpecRate score​
copies​
score per copy​
clockrate (nominal)​
SpecRate “ipc”​
specintrate link​
Emerald Rapids(RPC)​
14.5/3.9 = 3.71​
1240​
256​
4.84​
1.9​
2.54​
link
Granite Rapids(RWC)​
13.6/3.9Ghz = 3.48​
1220​
256​
4.76​
2​
2.38​
link
Sierra Forest(SRF)​
9.54/3.0ghz = 3.18​
1410​
288​
4.89​
2.2​
2.22​
link
AMD EPYC 9654​
14.3/3.7 = 3.86​
1790​
384​
4.66​
2.4​
1.94​
link
AMD EPYC 9755​
18/4.1 = 4.39​
2720​
512​
5.31​
2.7​
1.96​
link

The SpecRate ipc I calculated here is obviously next to useless because the real world clockspeed could be miles apart from the nominal clockspeed. Despite that, I think the SpecRate score and score per copy is enough to completely discredit the tweet's comparison. SpecRateInt and SpecInt should not be directly compared.
 
Last edited:

desrever

Senior member
Nov 6, 2021
304
774
106
I don't think it's useful even as a baseline estimate.Here's some SpecIntRate results for the 5 CPUs from the tweet.

SpecInt “ipc”​
SpecRate score​
copies​
score per copy​
clockrate (nominal)​
SpecRate “ipc”​
specintrate link​
Emerald Rapids(RPC)​
14.5/3.9 = 3.71​
1240​
256​
4.84​
1.9​
2.54​
Granite Rapids(RWC)​
13.6/3.9Ghz = 3.48​
1220​
256​
4.76​
2​
2.38​
Sierra Forest(SRF)​
9.54/3.0ghz = 3.18​
1410​
288​
4.89​
2.2​
2.22​
AMD EPYC 9654​
14.3/3.7 = 3.86​
1790​
384​
4.66​
2.4​
1.94​
AMD EPYC 9755​
18/4.1 = 4.39​
2720​
512​
5.31​
2.7​
1.96​
You are comparing SMT scores and dividing on it???
 

coercitiv

Diamond Member
Jan 24, 2014
7,290
17,215
136
Yes, I know it's a bad comparison. That's kind of my whole point.
Let's look at a very crude "efficiency" metric, SpecRate "IPC" / SpecInt "IPC":

SpecInt "IPC"SpecRate "IPC""Efficiency"
Emerald Rapids(RPC)
3.71​
2.54​
0.68​
Granite Rapids(RWC)
3.48​
2.38​
0.68​
Sierra Forest(SRF)
3.18​
2.22​
0.70​
AMD EPYC 9654
3.86​
1.94​
0.50​
AMD EPYC 9755
4.39​
1.96​
0.45​

Seems to me like arches from the same family that use similar interconnect tend to have a degree of semblance when it comes to scaling from SpecRate to SpecInt. It's not enough data to draw a conclusion, but not a chaotic comparison either.
 

Doug S

Diamond Member
Feb 8, 2020
3,430
6,076
136
Am I reading those slides right or did Intel marketing write them stupidly?

Are there actually 12 separate CPU chiplets, mounted four at a time on three base tiles?? If so, holy chiplets Batman!
 

Doug S

Diamond Member
Feb 8, 2020
3,430
6,076
136
Wow my understanding of packaging economics must be out of date because I can't see how that can possibly be cost effective? Ditto dicing wafers into tiny little E core sized chiplets. I haven't followed what Intel is doing that closely (obviously) but this seems crazy to me.

I guess it is good if you have crappy 18A yields though...
 
  • Like
Reactions: Joe NYC

Josh128

Golden Member
Oct 14, 2022
1,152
1,734
106
18A CWF Darkmont is about 25% more cores in given area than N3E Turin D Z5C when excluding LLC.
Good enough. Seems it all comes down to clocks.
How does that jive with 18A claimed density vs N3E? I know its apples to oranges, but Im guessing Darkmont has significantly more transistors than Crestmont, so maybe ballparkish to Zen 5C?
 

gdansk

Diamond Member
Feb 8, 2011
4,420
7,427
136
How does that jive with 18A claimed density vs N3E? I know its apples to oranges, but Im guessing Darkmont has significantly more transistors than Crestmont, so maybe ballparkish to Zen 5C?
It's beyond apples and oranges. I can't say anything useful about the processes from measuring die shots.
 

Saylick

Diamond Member
Sep 10, 2012
3,967
9,270
136
Source link is broken.

Also, I wonder what the core-to-core latency plot will look like, specifically when one core needs to talk to another core on a separate die but on the same base tile and on a separate die but also on a separate base die.

Then again, the use case for a product like this doesn't really necessitate low core-to-core latency so... eh.
 
  • Like
Reactions: coercitiv

adroc_thurston

Diamond Member
Jul 2, 2023
6,346
8,947
106
Also, I wonder what the core-to-core latency plot will look like, specifically when one core needs to talk to another core on a separate die but on the same base tile and on a separate die but also on a separate base die.
Characteristically the same as GNR, where local L3 is slow but manageable and far L3 is pretty bad.
 
  • Like
Reactions: Kepler_L2

DavidC1

Golden Member
Dec 29, 2023
1,776
2,862
96
The SpecRate ipc I calculated here is obviously next to useless because the real world clockspeed could be miles apart from the nominal clockspeed. Despite that, I think the SpecRate score and score per copy is enough to completely discredit the tweet's comparison. SpecRateInt and SpecInt should not be directly compared.
The 17% is useful on the basis that we could take the SpecIntRate score from Sierra Forest, multiply by 1.9x for typical clock scaling with double the cores and do 1.17x on it. Which would be 3140 for Clearwater Forest.

I think they are doing this for two reasons:
-It hides how the final part performs
-Multi-thread performance is the most important metric for Clearwater Forest

The other issue in using Rate is that "IPC" for MT in comparison to SMT enabled cores is affected significantly by SMT. So if AMD benefits more from SMT, then the SpecRate "IPC" would look better for AMD versus if you used 1T SpecInt by a few %.

Actually, Darkmont is only few single digit % faster than Skymont, which will be easily eclipsed either way by SoC implementation. So the real important part for Clearwater is how the part performs against predecessor and competition(including Intel) in actual server workloads, and in SpecIntRate.

Sierra Forest needed Intel comparing a 144 core SRF against 128-thread AMD part. If it gets over 3000+ for 2P, then it can be directly compared against top 192 core AMD part. That wasn't good for Intel as they have to sell it as a perf/W and perf/$ and you need a high end part for more revenue.
 
Last edited:
  • Like
Reactions: Joe NYC