- Mar 8, 2022
- 4,309
- 5,633
- 106
Why does the P core exist, because of its 1t perf. So if the e-core exceeds P core in absolute 1t, there is no need for P core. Thats probably what UC will do.
Would be surprised it it doesn't have SMT, though that seems to be the case.
Darkmont with trivial changes gets 3-4% improvements, which eats up majority of the ~10% advantage it has over Skymont. That's less than 7% advantage, which is basically same generation in terms of "IPC".While the E-core team is supposed to be the team in charge of unified core, it would be surprising for the E-cores to esentially already be caught up in IPC before that even.
I wouldn't be suprised if Qualcomm over took Apple this year now that they got NUVIA and been working on V3 cores for a while now.Mind you as impressive as Skymont is, the ultimate test is being able to go head to head against ARM, which is 30% better for X925, while fitting in a phone. Apple of course is even better. And this is in 2025, not 2027. Being par with by-current ARM cores is what I expect they can do. Don't know whether they can be peak, it'll be very hard.
Could be. We will see how it pans out.In server environment, not having SMT would be a serious handicap vs. AMD.
I don't remember the exact words, but yes something similar to that effect, which is why I was surprised too.In Intel's Lion Cove presentation, Intel said that SMT can be turned on and off, added when needed. Implying Intel is not killing off SMT in P-Core only products.
HopefullyDiamond Rapids is not that far away, so we should get a confirmation on this soon...
If Darkmont is a 3-4% improvement over Skymont, wouldn't that by definition not be eating up the majority of the 10% advantage LNC has over Skymont? LolDarkmont with trivial changes gets 3-4% improvements, which eats up majority of the ~10% advantage it has over Skymont. That's less than 7% advantage, which is basically same generation in terms of "IPC".
Unless they never were planning for this core to be a large int IPC improvement in the first place...To go from that to 10% is literally a fail.
Funnily enough I have always thought that, at least this generation, Qualcomm might have been doing just as fine going with ARM's "vanilla" cores rather than their own custom Oryon cores.I wouldn't be suprised if Qualcomm over took Apple this year now that they got NUVIA and been working on V3 cores for a while now.
Yes it doesn't have HT as PNC doesn't have HT.I am a little confused here: Isn't Xeon 7 P Core called Diamond Rapids (DMR)? Are you saying Diamond Rapids will not have HT?
If only you knew the truth but I am done arguing over it we will see when David Huang compars Cougar Cove and Lion Cove.If this is the case, the IO die, iGPU die, and Wildcat Lake can all be shifted over to different nodes or have their plans changed to accommodate 8+16 dies on 18A-P.
Intel is wasting a bunch of money, and is also getting a bunch of bad investor press, by going external. So they have the money to do large payments to TSMC to use their N2 node, but not enough to expand capacity for 18A? You aren't even building whole new fabs, all you are doing is expanding capacity.
Capacity reason makes 0 sense. Even Intel isn't claiming this is the case- it's performance and timing apparently.
And the timing reason is BS too...
SoC is shared in NVL across all Mobile+Desktop SKUBut if the SOC die is different like it is rn, they lose a bunch of battery life, so Intel would have to figure something out for that.
But my main point there was that the 8+16 N2 die looks like it is going to get a lot of use lol.
Why would anyone think 5-6% advantage is significant in any terms? It's already quite small at only 10%. It's so close that in Arrowlake just overclocking the E cores makes 1+16 faster than 8P or 8P+16E in games.If Darkmont is a 3-4% improvement over Skymont, wouldn't that by definition not be eating up the majority of the 10% advantage LNC has over Skymont? Lol
And why would that be the case, if they want to replace the predecessor? Advancing general purpose scalar integer performance has been dreams and aspirations for CPU engineers for 40 years.Unless they never were planning for this core to be a large int IPC improvement in the first place...
Well AVX-512 took quite a lot out of theirAnd why would that be the case, if they want to replace the predecessor? Advancing general purpose scalar integer performance has been dreams and aspirations for CPU engineers for 40 years.
I disagree. They made bigger changes for many many years. And like I said AVX-512 can be made quite small, especially if it's using 256-bit width.Well AVX-512 took quite a lot out of their
development time so the Int performance would be smaller than FP improvement
The wouldn't change the fact that Int improvement will be less than FP considering they are going from 4*128b units to 4*256b unitsI disagree. They made bigger changes for many many years. And like I said AVX-512 can be made quite small, especially if it's using 256-bit width.
If there's a team that can do it, it's them.
That N2 is likely a full node better than 18A and 18A-P?If only you knew the truth
The current LNC curve is scuffed. Huang even comments on such and tried to figure out why when he was looking at LNC in LNL vs LNC in ARL and workloads that primarily fit in the core private caches.we will see when David Huang compars Cougar Cove and Lion Cove.
It's interesting to see the gap be significantly high in other common client workloads though. GB6 specifically.Why would anyone think 5-6% advantage is significant in any terms? It's already quite small at only 10%.
I have never said they can't do this, I've always said I find it hard to believe.While you are arguing they can't do this, in some cases they already have.
Because a good bit of the engineering and area is going to be going to increasing the vector width.And why would that be the case, if they want to replace the predecessor? Advancing general purpose scalar integer performance has been dreams and aspirations for CPU engineers for 40 years.
And yet that's apparently what's going to be happening.Also getting 256-bit and AVX support just itself will result in minimal gains in majority of applications
What?The fact that there are still doubts about E core being vastly better suggests that they can pull more rabbits out of their hat.
And yet AMD even just changing how AVX-512 is implemented in Zen 5 doubled their FPU area.And like I said AVX-512 can be made quite small, especially if it's using 256-bit width.
Holy glaze lolIf there's a team that can do it, it's them.
The literally doubled FP in Skymont and got extra 25% out of the already impressive 30% gain. The great thing about aiming for scalar integer is that you gain that in FP as well, because you improve everything the instructions are passing through.The wouldn't change the fact that Int improvement will be less than FP considering they are going from 4*128b units to 4*256b units
Yea, that's what exactly happened when Pentium M and Pentium 4M and Pentium 4 Mobile coexisted. Confusingly small gains by going to power hungry P4 parts. Of course the P cores will clock higher enough to negate the difference and more.But anyway, you think that Arctic Wolf will have higher ~10% higher IPC than PTC, keeping with the 30% improvement trend? Or even a 20% improvement trend, you think that Arctic Wolf will have outright higher IPC than a P-core?
I mean even if this is the case, would undeniably be funny. Unless @Doug S has another objection about my sense of humor lol.
They won't. It's already 1:3. It's going to 1:2 in NVL for a reason. E core does need to grow. I'm expecting 30% growth due to overall uarch and 25% on top of that for total of 60-70%.Because a good bit of the engineering and area is going to be going to increasing the vector width.
And they presumably still need to maintain that ~1:4 P-core to E-core cluster ratio.
Not only.And yet that's apparently what's going to be happening.
You think Arctic Wolf is also going to have small changes.What?
Obviously they are not, considering they are now apparently being the ones in charge of unified core, despite the large political hurdles against them.
AMD is in the big picture following Intel, just with better execution.And yet AMD even just changing how AVX-512 is implemented in Zen 5 doubled their FPU area.
Enough past historical evidence and this is all you could say?Holy glaze lol
Not for Lunar that was perfectly fineThe current LNC curve is scuffed. Huang even comments on such and tried to figure out why when he was looking at LNC in LNL vs LNC in ARL and workloads that primarily fit in the core private caches.
18A and 18AP has 8% difference in PPW also I am done with this get the simulation details yourself for 18A and N3B lol.That N2 is likely a full node better than 18A and 18A-P?
We will see. You must understand why at the very least, if that is the case, it would be surprising.Yea, that's what exactly happened when Pentium M and Pentium 4M and Pentium 4 Mobile coexisted.
The ratio has been getting worse, but you can still roughly equate a P-core to an E-core cluster, which is where the 1:4 ratio is coming from.They won't. It's already 1:3.
Where has been this rumored? I've not heard of it tbh.It's going to 1:2 in NVL for a reason
If the area increase is that drastic, sure, than Arctic Wolf can have such a large IPC improvement.I'm expecting 30% growth due to overall uarch and 25% on top of that for total of 60-70%.
For a lack of trying.You think Arctic Wolf is also going to have small changes.
That's not the point, the point is that getting to 256 bit width to support AVX-512 will be a substantial area cost.AMD is in the big picture following Intel, just with better execution.
I think people overhype the E-cores too much.Enough past historical evidence and this is all you could say?
Sure, that's fine.Not for Lunar that was perfectly fine
A massive difference....18A and 18AP has 8% difference in PPW
Intel is also done with this. Hence why they are using N2 over 18A-P for NVL-S lol. Power on announced in the earnings call this week hopefully.also I am done with this get the simulation details yourself for 18A and N3B lol.
The best comparison should be NVL 4+8 tiles vs NVL 8+16 tiles.We should just wait for Panther lake to judge 18A.
We can simply do core comparison as both the cores are same and so is SoC in PPA.The best comparison should be NVL 4+8 tiles vs NVL 8+16 tiles.
Confusing decisions happen because these are mega corporations while we treat them as an essentially a hive mind.We will see. You must understand why at the very least, if that is the case, it would be surprising.
It's logic. 8+16 means 2:1 area.The ratio has been getting worse, but you can still roughly equate a P-core to an E-core cluster, which is where the 1:4 ratio is coming from.
Where has been this rumored? I've not heard of it tbh.
That's not large for potential 1.3x in scalar integer(meaning everything) plus gains in vector. P cores were 1.15-1.2x in scalar gains with 1.5x core size.If the area increase is that drastic, sure, than Arctic Wolf can have such a large IPC improvement.
Never heard of anyone saying it would be that large though.
Of course it is. It's still going to end up substantially small(2:1).That's not the point, the point is that getting to 256 bit width to support AVX-512 will be a substantial area cost.
It's 8+16 rn, but one P-core is still being swapped out by a 4 core E-core cluster?It's logic. 8+16 means 2:1 area.
For the perf improvement, sure, it won't be large...That's not large for potential 1.3x in scalar integer(meaning everything) plus gains in vector. P cores were 1.15-1.2x in scalar gains with 1.5x core size.
No, the E core team unexpectedly caught up, just as Pentium M was originally designed to address Transmeta's lineup, but eventually came to be their bread and butter.For the perf improvement, sure, it won't be large...
But it would screw up the area ratio that Intel has been cultivating since, IIRC, tremont?
Only when you count the fact that LNC has large core private caches.such as Apple's M4(and yes it's smaller than Lion Cove)
But the E-cores aren't on NVL for ST performance though.No, the E core team unexpectedly caught up, just as Pentium M was originally designed to address Transmeta's lineup, but eventually came to be their bread and butter.
LNC is the worst P-core by any of the major vendors, I agree, but LNC is only that large because of them being stuffed with massive core private caches.Lion Cove is similar in perf to Zen 5 while being larger even though Zen 5 is on a less dense process.
I wouldn't be suprised if Qualcomm over took Apple this year now that they got NUVIA and been working on V3 cores for a while now.
Just need someone to "ENHANCE!" this image and we can figure it out https://pbs.twimg.com/media/GvOsGkYWoAATRMq?format=jpg&name=smallWhere has been this rumored? I've not heard of it tbh.
Image is too blurred out for even AI to enhance without generating garbageJust need someone to "ENHANCE!" this image and we can figure it out https://pbs.twimg.com/media/GvOsGkYWoAATRMq?format=jpg&name=small