Discussion Intel Meteor, Arrow, Lunar & Panther Lakes + WCL Discussion Threads

Tigerick · Aug 22, 2022

Wildcat Lake (WCL) Specs

Intel Wildcat Lake (WCL) is upcoming mobile SoC replacing Raptor Lake-U. WCL consists of 2 tiles: compute tile and PCD tile. It is true single die consists of CPU, GPU and NPU that is fabbed by 18-A process. Last time I checked, PCD tile is fabbed by TSMC N6 process. They are connected through UCIe, not D2D; a first from Intel. Expecting launching in Q1 2026.

	Intel Raptor Lake U	Intel Wildcat Lake 15W?	Intel Lunar Lake	Intel Panther Lake 4+0+4
Launch Date	Q1-2024	Q2-2026	Q3-2024	Q1-2026
Model	Intel 150U	Intel Core 7	Core Ultra 7 268V	Core Ultra 7 365
Dies	2	2	2	3
Node	Intel 7 + ?	Intel 18-A + TSMC N6	TSMC N3B + N6	Intel 18-A + Intel 3 + TSMC N6

CPU	2 P-core + 8 E-cores	2 P-core + 4 LP E-cores	4 P-core + 4 LP E-cores	4 P-core + 4 LP E-cores
Threads	12	6	8	8
Max Clock	5.4 GHz	?	5 GHz	4.8 GHz
L3 Cache	12 MB		12 MB	12 MB
TDP	15 - 55 W	15 W ?	17 - 37 W	25 - 55 W

Memory	128-bit LPDDR5-5200	64-bit LPDDR5	128-bit LPDDR5x-8533	128-bit LPDDR5x-7467
Size	96 GB		32 GB	128 GB
Bandwidth			136 GB/s

GPU	Intel Graphics	Intel Graphics	Arc 140V	Intel Graphics
RT	No	No	YES	YES
EU / Xe	96 EU	2 Xe	8 Xe	4 Xe
Max Clock	1.3 GHz	?	2 GHz	2.5 GHz

NPU	GNA 3.0	18 TOPS	48 TOPS	49 TOPS

As Hot Chips 34 starting this week, Intel will unveil technical information of upcoming Meteor Lake (MTL) and Arrow Lake (ARL), new generation platform after Raptor Lake. Both MTL and ARL represent new direction which Intel will move to multiple chiplets and combine as one SoC platform.

MTL also represents new compute tile that based on Intel 4 process which is based on EUV lithography, a first from Intel. Intel expects to ship MTL mobile SoC in 2023.

ARL will come after MTL so Intel should be shipping it in 2024, that is what Intel roadmap is telling us. ARL compute tile will be manufactured by Intel 20A process, a first from Intel to use GAA transistors called RibbonFET.

511 · Nov 13, 2025

Geddagod said:
But also, yea, LNL uses 2-1 for the iGPU

source?

Geddagod · Nov 13, 2025

511 said:
source?

who else bruh techinsights lmao

Geddagod · Nov 13, 2025

Geddagod said:
who else bruh techinsights lmao

i wish other companies like yole group put out free stuff to, but looks like they don't (other than a few die shots)

ss is from lnl digital floorplan preview
this is for the m4 btw

511 · Nov 13, 2025

Geddagod said:
i wish other companies like yole group put out free stuff to, but looks like they don't (other than a few die shots)
View attachment 133841
ss is from lnl digital floorplan preview
this is for the m4 btw
View attachment 133842

there is typo it's N3B not N3E

DavidC1 · Nov 13, 2025

511 said:
there is typo it's N3B not N3E

I think he's saying the first paragraph is for Lunarlake and second is for M4. And indeed that is the case. LNL is called "N3".

Fjodor2001 · Nov 13, 2025

What are we really expecting from Titan Lake (successor to Razor Lake) w.r.t. unified core?

Reading this article:

Intel CPU Core Architecture Roadmap 2026-2028: Blending the P & E-Cores into a Unified Core | Hardware Times

Intel’s switch to a hybrid core architecture aimed to improve multi-threaded performance and optimize the power efficiency of mobility designs. While the former has been somewhat successful, the latter has failed spectacularly. Intel’s 12th and 13th Gen CPUs are much less efficient than rival...

hardwaretimes.com

it sounds like we'll still have P and E cores. Says it'll switch to a common ISA, but don't we have that already with NVL for P and E cores?

Khato · Nov 13, 2025

Fjodor2001 said:
it sounds like we'll still have P and E cores. Says it'll switch to a common ISA, but don't we have that already with NVL for P and E cores?

As the article implies, Intel's unified core approach is going to mirror AMDs. Don't design an entirely separate core to differentiate between P and E, just change parameters and synthesis targets on the same design.

Fjodor2001 · Nov 13, 2025

Khato said:
As the article implies, Intel's unified core approach is going to mirror AMDs. Don't design an entirely separate core to differentiate between P and E, just change parameters and synthesis targets on the same design.

Assuming Intel will use the E core as base for the Unified core, can we expect them to push it to P core level of ST perf, for the P core variant of the Unified core?

Kepler_L2 · Nov 13, 2025

Fjodor2001 said:
Assuming Intel will use the E core as base for the Unified core, can we expect them to push it to P core level of ST perf, for the P core variant of the Unified core?

Obviously, they're not going to regress ST

Fjodor2001 · Nov 13, 2025

Kepler_L2 said:
Obviously, they're not going to regress ST

So how to modify current E core design to arrive at Unified core, whose P core variant will reach P core level of ST perf (at perf level expected in 2028 and not now), without sacrificing E core targets (area / perf/watt / price / …)?

Khato · Nov 13, 2025

I'm not sure what all the plans are on the unified core. One interesting notion that might be carried over from the royal core development is to have a more scalable design. Simplest example based on the current E core implementation would be to have the P core variant have 5x3 clustered decode front end while the E core variant remains 3x3. It'll be interesting to see if they come up with a similar 'clustered' approach to the back end. Similarly they could maintain larger L2 cache shared between 2 cores for the P core variant and smaller L2 cache shared between 4 cores for the E core variant.

Another important point to keep in mind is that there won't be the current size disparity between the P and E core variants, probably more like 2:1 instead of the current 4:1. Mostly because the P core variant won't have as bad of ppa as current.

DavidC1 · Nov 13, 2025

Fjodor2001 said:
So how to modify current E core design to arrive at Unified core, whose P core variant will reach P core level of ST perf (at perf level expected in 2028 and not now), without sacrificing E core targets (area / perf/watt / price / …)?

Of course it'll be bigger than current ~1mm2 target for the E cores, that's to be expected. The difference, and the hope is that a grown up version will be power efficient and take up less area than whatever lacklustre trajectory P core had. Or, you get similar parameters but better performance. This is mirroring Netburst to Core transition. It's not exact, but as the saying goes "history doesn't repeat, but rhymes".

Khato said:
I'm not sure what all the plans are on the unified core. One interesting notion that might be carried over from the royal core development is to have a more scalable design. Simplest example based on the current E core implementation would be to have the P core variant have 5x3 clustered decode front end while the E core variant remains 3x3. It'll be interesting to see if they come up with a similar 'clustered' approach to the back end. Similarly they could maintain larger L2 cache shared between 2 cores for the P core variant and smaller L2 cache shared between 4 cores for the E core variant.

Arctic Wolf is very likely 4x3. Unified Core means the base architecture stays the same. If one is 5x3 and the other is say 6x3, then it's not unified. Likely they are both on 7x3, but one is frequency optimized and the other is density. Clusters are not needed for back end. Remember Clustered decode came off of addressing x86 decode issue. In some aspects, clustered is even better than monolithic but I think that's a happy side effect. Backend doesn't have that "issue".

They've been aiming at ~30% gain every generation going back to Silvermont in 2013. Silvermont was the outlier with 50% but it also had the very anemic Bonnell predecessor.

Khato · Nov 13, 2025

DavidC1 said:
Unified Core means the base architecture stays the same. If one is 5x3 and the other is say 6x3, then it's not unified. Likely they are both on 7x3, but one is frequency optimized and the other is density.

It depends on how it's designed, no? Why would parameterizing most of the design to work with an arbitrary number of decode clusters should be difficult? I'd guess that only a few blocks would require separate logic for each configuration. Would that really qualify as a different architecture? I know it would with old school design methodology, but with current design and synthesis flows it could be almost entirely reuse.

Note that I was mentioning the backend side of the design adopting a similar approach simply to keep resource ratios similar. Basically take any structures that are amenable to scaling/duplication and use them to differentiate between the P and E variants, while other areas are exactly the same and designed for the P core performance level.

OneEng2 · Nov 13, 2025

Fjodor2001 said:
What are we really expecting from Titan Lake (successor to Razor Lake) w.r.t. unified core?

Reading this article:

Intel CPU Core Architecture Roadmap 2026-2028: Blending the P & E-Cores into a Unified Core | Hardware Times

Intel’s switch to a hybrid core architecture aimed to improve multi-threaded performance and optimize the power efficiency of mobility designs. While the former has been somewhat successful, the latter has failed spectacularly. Intel’s 12th and 13th Gen CPUs are much less efficient than rival...

hardwaretimes.com

it sounds like we'll still have P and E cores. Says it'll switch to a common ISA, but don't we have that already with NVL for P and E cores?

I think the question really is, how much more like a P Core will "unified core" look than it looks like an E Core? Or will it look more like an E Core?

I have been thinking for those that imagine Zen 6 level performance out of a Skymont level of die area and power envelope, they will be disappointed. In engineering you never get "something for nothin".

Khato said:
As the article implies, Intel's unified core approach is going to mirror AMDs. Don't design an entirely separate core to differentiate between P and E, just change parameters and synthesis targets on the same design.

AMD has a good approach IMO. It is very cost effective and design time friendly (in comparison to making 2 totally different core architectures and then trying to get everything scheduled correctly).

DavidC1 said:
This is mirroring Netburst to Core transition. It's not exact, but as the saying goes "history doesn't repeat, but rhymes".

LOL. Indeed.

I still have trouble giving "Cove" the same black eye as "Netburst". I still have hope that should Intel free up the latency in NVL, "Cove" might breath much better than people give it credit for.

Still, your point is well taken.

511 · Nov 13, 2025

Kepler_L2 said:
Obviously, they're not going to regress ST

Yeah it would be a generational improvement a decent one

Thunder 57 · Nov 14, 2025

OneEng2 said:
I think the question really is, how much more like a P Core will "unified core" look than it looks like an E Core? Or will it look more like an E Core?

I have been thinking for those that imagine Zen 6 level performance out of a Skymont level of die area and power envelope, they will be disappointed. In engineering you never get "something for nothin".

AMD has a good approach IMO. It is very cost effective and design time friendly (in comparison to making 2 totally different core architectures and then trying to get everything scheduled correctly).

LOL. Indeed.

I still have trouble giving "Cove" the same black eye as "Netburst". I still have hope that should Intel free up the latency in NVL, "Cove" might breath much better than people give it credit for.

Still, your point is well taken.

You keep saying that. I think it would be more appropriate to say how there's "No free lunch" regarding physics. To use Netburst as you mentioned, compare it to Core 2. Conroe was faster by a lot, used less die size, less power, and produced less heat than Netburst at the time. Time & cost are more difficult to consider but still seemed like a giant win.

Just recently you posted this:

OneEng2 said:
I guess I believe that in this day and age, there aren't any mysterious CPU architectures that magically work better than everything that came before it.

I believe that Apple, Intel, and AMD all have equivalent engineering teams and tools. The difference is what you target your architecture to do and what things you decide to prioritize and what things you decide to give up.

I believe you can't say I want it all and actually get it all. If you say I want a core that is very power efficient, you can't also say I want a core that clocks higher than the competition.

You can't say I want the core to be very small AND I want 4 way SMT, AVX512, etc, etc.

I do agree that Lion Cove appears to have lower PPA than Zen 5, although I think ARL in general gets a pretty bad rap on the basis of its poor showing in latency sensitive applications (which is mostly a ring bus issue IMO vs a core problem).

I think it is a pretty tall order to take ANY derivative of Skymont and make it compete with Zen 5 across the board. I think you can make it do some things better, but at the expense of doing other things worse.

I just don't see getting something for nothing in engineering.

Well that's a rather defeatist attitude. Rather than pushing to innovate you think that teams just throw in the towel and say "Well that's it lads, we've thought of everything, best to figure out how to put out the best CPU with all that will ever exist.". That might be simplifying it a bit. If I am wrong though I would like to hear your thoughts.

Josh128 · Nov 14, 2025

Thunder 57 said:
You keep saying that. I think it would be more appropriate to say how there's "No free lunch" regarding physics. To use Netburst as you mentioned, compare it to Core 2. Conroe was faster by a lot, used less die size, less power, and produced less heat than Netburst at the time. Time & cost are more difficult to consider but still seemed like a giant win.

Just recently you posted this:

Well that's a rather defeatist attitude. Rather than pushing to innovate you think that teams just throw in the towel and say "Well that's it lads, we've thought of everything, best to figure out how to put out the best CPU with all that will ever exist.". That might be simplifying it a bit. If I am wrong though I would like to hear your thoughts.

Its probably better to say, "theres very little low hanging fruit to pick" in x86 CPU design. Gains are small and are often localized for certain functions. It's all dependent on the current state of the art of boolean algebra, logic design, and materials science. Small advancements in any can lead to significant advancements in performance, but its a slog, and breakthroughs are slow. Maybe AI will help speed things up, lol.

adroc_thurston · Nov 14, 2025

Josh128 said:
Its probably better to say, "theres very little low hanging fruit to pick" in x86 CPU design.

Plenty left, but none of that fits in area or timing budgets.

Josh128 · Nov 14, 2025

adroc_thurston said:
Plenty left, but none of that fits in area or timing budgets.

Which are a requirement of x86 CPU design. Circular reference.

511 · Nov 14, 2025

Josh128 said:
Which are a requirement of x86 CPU design. Circular reference.

Not just x86 every CPU

Kepler_L2 · Nov 14, 2025

https://twitter.com/x/status/1989336499905814906

dullard · Nov 14, 2025

Kepler_L2 said:
https://twitter.com/x/status/1989336499905814906

Thanks for sharing. A few percent slower than the equivalent level 255H. Data encryption and data compression scores were dragging the 358H down quite a bit. But, still not yet a launch BIOS. And we don't know power consumption or frequencies used.

511 · Nov 14, 2025

dullard said:
Thanks for sharing. A few percent slower than the equivalent level 255H. Data encryption and data compression scores were dragging the 358H down quite a bit. But, still not yet a launch BIOS. And we don't know power consumption or frequencies used.

It's 400Mhz lower clock 4.7 vs 5.1 has like 6% IPC Improvements so the best case is PTL will have same st as arl but lower power

dullard · Nov 14, 2025

511 said:
It's 400Mhz lower clock 4.7 vs 5.1 has like 6% IPC Improvements so the best case is PTL will have same st as arl but lower power

I think you misunderstood my post. 4.8 GHz for P cores is what rumors state the final 358H configuration will be (I haven't seen the E core speed rumors yet, do you know?). But, we don't know what speed or at what power that specific test was performed at. We can assume it is what the rumors state. I just like to be conservative and point out that the actual values are not specified.

OneEng2 · Nov 14, 2025

Thunder 57 said:
Well that's a rather defeatist attitude. Rather than pushing to innovate you think that teams just throw in the towel and say "Well that's it lads, we've thought of everything, best to figure out how to put out the best CPU with all that will ever exist.". That might be simplifying it a bit. If I am wrong though I would like to hear your thoughts.

I would argue that, for the most part, generation on generation changes in performance have been on the back of generation on generation lithography improvements.

The notable difference was that Core 2 was simply a MUCH better architecture than Netburst .... which was wrong headed in so many ways.

I do not believe that either AMD or Intel's current architectures are fundamentally "wrong headed" like Netburst.

Because I don't expect much to change in Lithography in the next generation (15%?) I also don't expect much to change in processor design. This comes from my belief that the big improvements come at the expense of higher transistor budget (deeper buffers, wider execution, more execution units, more L1, L2, L3, etc).

When there isn't more transistor budget to be had, I believe the best you can do is make good tradeoffs. Yes, special instructions help, but only in limited circumstances.

adroc_thurston said:
Plenty left, but none of that fits in area or timing budgets.

Agree. I think we might see a little surprise here or there, but without the huge density improvements of the past, it's hard to see big performance improvements generation on generation.

In fact, I think it may be even less than the transistor budget improvement % since many of the current methods have reached the inflection point and adding x% more transistors no longer gets you x% more performance, it get's you y% and y is getting smaller and smaller.

Discussion Intel Meteor, Arrow, Lunar & Panther Lakes + WCL Discussion Threads

Senior member

Attachments

Diamond Member

Golden Member

Golden Member

Diamond Member

Platinum Member

Diamond Member

Golden Member

Diamond Member

Golden Member

Diamond Member

Golden Member

Platinum Member

Golden Member

Golden Member

Diamond Member

Diamond Member

Banned

Diamond Member

Banned

Diamond Member

Golden Member

Elite Member

Diamond Member

Elite Member

Golden Member