Discussion Intel Meteor, Arrow, Lunar & Panther Lakes + WCL Discussion Threads

Tigerick · Aug 22, 2022

Wildcat Lake (WCL) Specs

Intel Wildcat Lake (WCL) is upcoming mobile SoC replacing Raptor Lake-U. WCL consists of 2 tiles: compute tile and PCD tile. It is true single die consists of CPU, GPU and NPU that is fabbed by 18-A process. Last time I checked, PCD tile is fabbed by TSMC N6 process. They are connected through UCIe, not D2D; a first from Intel. Expecting launching in Q1 2026.

	Intel Raptor Lake U	Intel Wildcat Lake 15W?	Intel Lunar Lake	Intel Panther Lake 4+4+4
Launch Date	Q1-2024	Q2-2026	Q3-2024	Q1-2026
Model	Intel 150U	Intel Core 7	Core Ultra 7 268V	Core Ultra 7 365
Dies	2	2	2	3
Node	Intel 7 + ?	Intel 18-A + TSMC N6	TSMC N3B + N6	Intel 18-A + Intel 3 + TSMC N6

CPU	2 P-core + 8 E-cores	2 P-core + 4 LP E-cores	4 P-core + 4 LP E-cores	4 P-core + 4 LP E-cores
Threads	12	6	8	8
Max Clock	5.4 GHz	?	5 GHz	4.8 GHz
L3 Cache	12 MB		12 MB	12 MB
TDP	15 - 55 W	15 W ?	17 - 37 W	25 - 55 W

Memory	128-bit LPDDR5-5200	64-bit LPDDR5	128-bit LPDDR5x-8533	128-bit LPDDR5x-7467
Size	96 GB		32 GB	128 GB
Bandwidth			136 GB/s

GPU	Intel Graphics	Intel Graphics	Arc 140V	Intel Graphics
RT	No	No	YES	YES
EU / Xe	96 EU	2 Xe	8 Xe	4 Xe
Max Clock	1.3 GHz	?	2 GHz	2.5 GHz

NPU	GNA 3.0	18 TOPS	48 TOPS	49 TOPS

As Hot Chips 34 starting this week, Intel will unveil technical information of upcoming Meteor Lake (MTL) and Arrow Lake (ARL), new generation platform after Raptor Lake. Both MTL and ARL represent new direction which Intel will move to multiple chiplets and combine as one SoC platform.

MTL also represents new compute tile that based on Intel 4 process which is based on EUV lithography, a first from Intel. Intel expects to ship MTL mobile SoC in 2023.

ARL will come after MTL so Intel should be shipping it in 2024, that is what Intel roadmap is telling us. ARL compute tile will be manufactured by Intel 20A process, a first from Intel to use GAA transistors called RibbonFET.

coercitiv · Nov 7, 2022

Hulk said:
First, Intel is probably noticing that many apps that only really rely on 8 cores, can do as well with 6 and if those 6 cores have better IPC than Raptor then it's all the better.

Hulk said:
Second, as we move into the future applications are getting better at MT so moving some more compute to the E's makes sense.

So, to recap:

Intel is seeing consumer apps don't scale well past 6 cores (hence lowering focus on P cores count)
Intel is seeing consumer apps are starting to scale well with more than 6 cores (hence doubling down on E core performance instead of keeping 8 P cores)

I find it very interesting that no matter the trend of the two conflicting above, the prescribed solution is always the same: lower P core count. Entropy sucks.

Geddagod · Nov 7, 2022

coercitiv said:
So, to recap:

Intel is seeing consumer apps don't scale well past 6 cores (hence lowering focus on P cores count)

Intel is seeing consumer apps are starting to scale well with more than 6 cores (hence doubling down on E core performance instead of keeping 8 P cores)

I find it very interesting that no matter the trend of the two conflicting above, the prescribed solution is always the same: lower P core count. Entropy sucks.

Intel is seeing low-latency high ST applications that don't scale well past 6 cores (hence lowering focus on P core count)
Intel is seeing highly parallel applications that do scale well past 6 cores (hence doubling down on e-cores)
And besides, we don't even know if MTL has only 6 cores, there's a reason it's a rumor...
AND meteor lake is primarily a mobile centered architecture, and since main stream high end mobile skus start end with 6 big cores + more little cores, Intel might have just not wanted to spend more money designing an extra die with more big cores, instead of just spamming little cores, or maybe they only had space for two "cores" on the die, and went for little cores instead of big cores.
AAAND it might not even matter as much since there were also leaks of MTL only being mid range, with a quick follow up of ARL high end.

poke01 · Nov 7, 2022

Geddagod said:
Intel is seeing low-latency high ST applications that don't scale well past 6 cores (hence lowering focus on P core count)
Intel is seeing highly parallel applications that do scale well past 6 cores (hence doubling down on e-cores)
And besides, we don't even know if MTL has only 6 cores, there's a reason it's a rumor...
AND meteor lake is primarily a mobile centered architecture, and since main stream high end mobile skus start end with 6 big cores + more little cores, Intel might have just not wanted to spend more money designing an extra die with more big cores, instead of just spamming little cores, or maybe they only had space for two "cores" on the die, and went for little cores instead of big cores.
AAAND it might not even matter as much since there were also leaks of MTL only being mid range, with a quick follow up of ARL high end.

coercitiv said:
So, to recap:

Intel is seeing consumer apps don't scale well past 6 cores (hence lowering focus on P cores count)

Intel is seeing consumer apps are starting to scale well with more than 6 cores (hence doubling down on E core performance instead of keeping 8 P cores)

I find it very interesting that no matter the trend of the two conflicting above, the prescribed solution is always the same: lower P core count. Entropy sucks.

It don't matter. Arrow Lake takes it back up to 8P cores. MTL is a test bench on desktop.

IntelUser2000 · Nov 7, 2022

@Exist50 The extra clocks covering 33% lack of cores are next to impossible unless you are expecting Meteorlake to clock 6GHz+, and all core Turbo at that! They aren't using magic to get those clocks, it's still mostly getting it from using bricks as heatsinks and water cooling being common.

nicalandia said:
I suspect that the MTL-S will have 6MiB per cluster, faster internal ring bus and we could see double digit IPC boosts on those e cores.

No it doesn't. It has the same 3MB per cluster as Raptorlake mont cores do.

Hulk said:
This feels similar to the situation with 10900K to 11900K if 6P core rumors are true for ML-S.

Well, no because MTL-S doesn't cover the high end and Arrowlake will.

Adding Hyperthreading further complicates the scheduling situation when they just got hybrid working properly. In addition to having lower performance in corner case scenarios when Es might be active, you'll have scenarios where it'll perform even worse because E's will be split into two?!

This is Intel's version of the AMD SMT4 rumor that never dies.

Remember, Atoms are pretty much Intel's version of what ARM does. None of them use SMT. Also when you start having many cores, SMT starts becoming redundant in the client space.

BorisTheBlade82 said:
When I first heard of SPR my initial impression was that Intel might turn around DC with it. And in 2020 that might have worked. But Milan and Genoa show the superiority of AMD's approach and made Intel rethink theirs as well.

They might have been able to pull it off, if it wasn't for the fact that the entire server validation team was sacked. Basically they go through the process to deal with issues that the early prototypes have and run testing. These people can't be pulled off fresh out of universities - you need decades of experience for that. Also teams need to be able to work together and build cadence and have the right people on top of that and so on. So basically the whole divison had to be rebuilt I guess?

Some are calling for firing of Riviera(head of DC) that came in just few months ago, etc, etc. It's crazy! That kind of thinking is what made things fail in the first place. It's smash your head into the wall stupid. Gelsinger said the server roadmap is "what it is" until at least Emerald Rapids, and even Granite Rapids has aspects that they can't use the new improved methodology on.

moinmoin · Nov 7, 2022

IntelUser2000 said:
Remember, Atoms are pretty much Intel's version of what ARM does. None of them use SMT.

There is Cortex-A65 which actually does. Seems to be a one-off so far though.

Cortex-A65 | Multithreaded CPU with High Throughput and 8 Cores

Arm Cortex-A65 is the first multithreaded Cortex-A DynamIQ CPU, offering high throughput with dual-thread processing and up to eight cores, for sensor fusion and vision-based systems.

www.arm.com

Hulk · Nov 7, 2022

coercitiv said:
So, to recap:

Intel is seeing consumer apps don't scale well past 6 cores (hence lowering focus on P cores count)

Intel is seeing consumer apps are starting to scale well with more than 6 cores (hence doubling down on E core performance instead of keeping 8 P cores)

I find it very interesting that no matter the trend of the two conflicting above, the prescribed solution is always the same: lower P core count. Entropy sucks.

No you missed the point. No conflict. Two separate cases.
There is some nuance. Perhaps I should have spelled it out better.

1. Apps that don't scale well past 6 cores, 6 stronger P's might be enough. This is becoming a smaller and smaller category as Intel sees it. These apps may not do any better with 8P's so those resources are better spent on more E's for apps that do scale well.

2. For the ever growing highly MT optimized apps the E's generate more computer per square mm.

The trend is Intel is predicting is that we are moving in to a MT software dominated landscape and as that occurs they can reduce the P count and increase the E count because as noted above the E's provide more compute per square mm.

That's just my guess on Intel's strategy.

If yours is different I'd love to hear it?

igor_kavinski · Nov 7, 2022

One cool thing possible with a "sea of low power cores" is executing instructions ahead of time before they are actually needed by the user and storing their results in a cache shared by both P and E cores. This way all the cores can be doing something all the time and instant results available for complex and time consuming instruction sequences for the cost of cache lookup. The P-cores would then just execute those instructions that would be completed in less time than it would take to look them up in the cache. The power consumed should be less than if it was just the P-cores burning power.

For this approach to work successfully, there would need to be tighter coupling between the P and E cores. Not like currently how the E cores seem to be tacked onto the die. A future design may involve each P core surrounded by a cluster of E cores to minimize communication latency and all of them having a common cache. It would be exciting if AMD comes up with this approach in Zen 5.

poke01 · Nov 7, 2022

Hulk said:
If yours is different I'd love to hear it?

Yes, mine is all you said does not make sense as Intel is going back to 8 P cores in Arrow Lake.

Your response is just idealistic to support whatever Intel does

nicalandia · Nov 7, 2022

IntelUser2000 said:
No it doesn't. It has the same 3MB per cluster as Raptorlake mont cores do.

I believe you are mistaken.

A single OG Gracemont Cluster has 2MiB of Shared L2$.
An enhanced Gracemont Cluster(A.K.A Raptormont) has 4MiB shared L2$

MTL-H(The only die we have seen so far) has the next evolution of the e cores, called Crestmont(design wise they look identical TBH), the ones we have seen show to have only 3MiB per cluster, I suspect that is because it's a good compromise between energy efficiency and performance. I fully expect that MTL-S the desktop counter part will have a huge 6MiB of L2$ Per cluster and fully expect double digit IPC gains from OG Gracemont, bringing those e cores to Sunny Cove IPC performance level and leaving Skylake behind.

LightningZ71 · Nov 7, 2022

I would like to point out that, in the most recent round of processor releases, the benchmarks clearly show that, when normalized for achievable clock speeds, the 6 core processors are within 1-2% of the performance of the 8 core processors in the majority of gaming benchmarks. It is often the case that the intel 8 core processors show a bigger difference between their 6 core and 8 core processor performance because the L3 cache increases with the extra cores for most of their parts. This is, overall, for both Intel AND AMD processors. There are a few, like the latest Spiderman game, that have a solid use case for extra threads (pre-compiled shaders on the e-cores) and the benchmarks show it, but, they are definite outliers. When you look at productivity benchmarks, it's a very similar outcome. There are, absolutely, cases where lots of available threads makes a BIG difference, especially in certain professional content creation and editing applications. The people that use those applications know who they are, and Puget does a good job of showing those cases.

A 6-P-core meteor lake that has an improved memory system and improved process for the CPU die will likely outperform 8-P-core Raptor lake processors at similar power limits in most gaming benchmarks that aren't heavily dependent on L3 cache amounts. Those extra 4 threads are rarely used, and if they are, they aren't running any of the time critical threads that heavily affect frame times. There WILL be outliers. This doesn't disprove the premise TODAY.

I am not saying this to support Intel going back to 6 P cores for meteor lake. I am saying this because it's just the facts of the matter. If 6 cores is all they can pull off right now, that's on them, but it's not going to affect games much.

nicalandia · Nov 7, 2022

LightningZ71 said:
I am not saying this to support Intel going back to 6 P cores for meteor lake. I am saying this because it's just the facts of the matter. If 6 cores is all they can pull off right now, that's on them, but it's not going to affect games much.

I can guarantee you that 6 Redwood Cove cores will outperform 8 Raptor Cove cores in gaming.

S'renne · Nov 7, 2022

Btw- errr is there any news on Meteor Lake to Arrow Lake for the iGPU?

coercitiv · Nov 7, 2022

Hulk said:
That's just my guess on Intel's strategy.

If yours is different I'd love to hear it?

My take is Intel used the Atom cores as a crutch on the desktop after allowing AMD to dictate the narrative in the consumer space. It's a reactionary move, probably borrowing from a coherent plan on their mobile side, and therefore lacks the strategic depth people tend to assign it. Once tiles are fully embedded in Intel's consumer line, alleviating the die area problems they've been having, we'll finally get a sense of where Intel is heading with P core / E core balance for desktops.

Geddagod · Nov 7, 2022

coercitiv said:
My take is Intel used the Atom cores as a crutch on the desktop after allowing AMD to dictate the narrative in the consumer space. It's a reactionary move, probably borrowing from a coherent plan on their mobile side, and therefore lacks the strategic depth people tend to assign it. Once tiles are fully embedded in Intel's consumer line, alleviating the die area problems they've been having, we'll finally get a sense of where Intel is heading with P core / E core balance for desktops.

Reactionary or not, it does have a "strategic" depth of increasing MT perf/area.
Even if Intel has tiles fully working, why not have increased MT perf/area as a "cherry on top"?
Afterall, Intel would already be developing little cores regardless, at least as a competitor for future bergamo based AMD products

moinmoin · Nov 7, 2022

Geddagod said:
Afterall, Intel would already be developing little cores regardless, at least as a competitor for future bergamo based AMD products

If Intel wants to counter Bergamo with E cores there is a lot to catch up to though. I'd think Intel should better have spent the time to make P cores more area efficient. As is E cores are barely competitive in INT workloads, and worlds behind in FP ones.

Geddagod · Nov 7, 2022

moinmoin said:
If Intel wants to counter Bergamo with E cores there is a lot to catch up to though. I'd think Intel should better have spent the time to make P cores more area efficient. As is E cores are barely competitive in INT workloads, and worlds behind in FP ones.

I'm actually curious about this. As soon as I'm done with my college apps I might just do a bunch of napkin math to see how bad it really is haha, unless you know someone who already has done something similar to this?

DrMrLordX · Nov 7, 2022

moinmoin said:
I'd think Intel should better have spent the time to make P cores more area efficient.

It looks like we'll see some of that with Meteor Lake. Maybe.

poke01 · Nov 7, 2022

DrMrLordX said:
It looks like we'll see some of that with Meteor Lake. Maybe.

More like Arrow Lake

DrMrLordX · Nov 7, 2022

poke01 said:
More like Arrow Lake

Locuza's pics of Redwood Cove make it look like they're at least enjoying some improvements to area efficiency just from the process shrink. It remains to be seen how many layout improvements there are as well that contribute to the smaller silicon area for a 6+8 config.

nicalandia · Nov 7, 2022

DrMrLordX said:
Locuza's pics of Redwood Cove make it look like they're at least enjoying some improvements to area efficiency just from the process shrink.

SemiAnalysis shows a 25.17% Area reduction as a whole when compared to Golden Cove. If Redwood Cove brings 25% IPC there will be no gains in Performance/Area

Just for comparison Zen4 is 3.84 mm^2

Hulk · Nov 7, 2022

nicalandia said:
SemiAnalysis shows a 25.17% Area reduction as a whole when compared to Golden Cove. If Redwood Cove brings 25% IPC there will be no gains in Performance/Area

View attachment 70580

Just for comparison Zen4 is 3.84 mm^2

I'm not following? I'm missing something because based on the data you presented it seems like even if Golden/Raptor have the same IPC as Redwood then there is a 25% reduction in area for equal performance. Performance/Area gains will be even larger if Redwood has a better IPC than Golden/Raptor.

poke01 · Nov 7, 2022

Some members here predicted that Crestmont will have a configurble cache system.

Well is that what this is?

https://twitter.com/x/status/1589808451068518403

Saylick · Nov 7, 2022

poke01 said:
Some members here predicted that Crestmont will have a configurble cache system.

Well is that what this is?

https://twitter.com/x/status/1589808451068518403

Dang, we haven't seen that much L2 cache since Core 2.

poke01 · Nov 7, 2022

Looks like Crestmont is going be actually different than Gracemont.

lightisgood · Nov 7, 2022

poke01 said:
Some members here predicted that Crestmont will have a configurble cache system.

Well is that what this is?

https://twitter.com/x/status/1589808451068518403

Sierra Forest?

Discussion Intel Meteor, Arrow, Lunar & Panther Lakes + WCL Discussion Threads

Senior member

Attachments

Diamond Member

Golden Member

Diamond Member

Elite Member

Diamond Member

Diamond Member

Lifer

Diamond Member

Diamond Member

Platinum Member

Diamond Member

Member

Diamond Member

Golden Member

Diamond Member

Golden Member

Lifer

Diamond Member

Lifer

Diamond Member

Diamond Member

Diamond Member

Diamond Member

Diamond Member

Senior member