Question Discussion over P-core and E-core vs AMDs regular vs C-core

Page 2 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.
Jul 27, 2020
24,796
17,247
146
By creating the c cores AMD has validated Intel's hybrid approach.
I'm still not sure if Alder Lake was always supposed to launch with E-cores. They just seem hurriedly tacked on and the post-launch disabling of AVX-512 points to that possibility.

Also, Intel having to create a special Thread Director whereas AMD hasn't revealed anything of the sort in their design seems really odd to me. Like what was wrong with Intel's design that they needed a special Thread Director?
 

Hulk

Diamond Member
Oct 9, 1999
5,109
3,635
136
I'm still not sure if Alder Lake was always supposed to launch with E-cores. They just seem hurriedly tacked on and the post-launch disabling of AVX-512 points to that possibility.

Also, Intel having to create a special Thread Director whereas AMD hasn't revealed anything of the sort in their design seems really odd to me. Like what was wrong with Intel's design that they needed a special Thread Director?

I know there is quite a bit of R&D that goes into CPU manufacturing and generally the lead time is measured in years so I'm not exactly sure what you mean by "tacked on" as it implies a last minute design change, which would have probably been months if not years ahead of release?

As for the Thread Director, if it works for Intel hybrid CPU's wouldn't it also work for AMD?
 

Khato

Golden Member
Jul 15, 2001
1,251
321
136
You got half of it correct. AMD and Intel are using c and E cores for area efficiency. But they are not using them for power efficiency. P cores and normal size Zen 4 cores are more power efficient at iso-performance than their "full size" counterparts. Well, to be totally honest this is definitely the case for Intel, for AMD efficiency is probably about the same since the architecture is the same, but the c cores are smaller so the only advantage of using them for AMD is area efficiency.

Due to the nonlinear nature of the v/f curve a big (in terms of die area), wide (in terms of architecture) CPU running at a low frequency will generally be more power efficient than a smaller physical core running at higher frequency to achieve the same performance. But of course die area is expensive so we have c and E cores.
The p-core/normal zen4 only has higher performance per watt above a certain point. While the v/f curve and hence dynamic power favors the 'large' cores, that comes at the cost of higher static power. The Chips and Cheese article on Alder Lake has some nice graphs showing such - https://chipsandcheese.com/2022/01/28/alder-lakes-power-efficiency-a-complicated-picture/
 
  • Like
Reactions: igor_kavinski

jpiniero

Lifer
Oct 1, 2010
16,326
6,806
136
I'm still not sure if Alder Lake was always supposed to launch with E-cores. They just seem hurriedly tacked on and the post-launch disabling of AVX-512 points to that possibility.

No it was always intended to be the case. Speculation was that Intel had originally planned for a software based solution to deal with AVX-512 but was abandoned... far enough along that completely removing AVX-512 wasn't worth the effort, esp when they had probably originally intended Alder Lake Xeon E's to support it still.
 
Jul 27, 2020
24,796
17,247
146
As for the Thread Director, if it works for Intel hybrid CPU's wouldn't it also work for AMD?

Built directly into the hardware, Intel Thread Director uses machine learning to schedule tasks on the right core at the right time (as opposed to relying on static rules).
So my question stands, why does Intel need to depend on this feature taking up die space whereas AMD manages to keep their P and c cores working without it?
 
Jul 27, 2020
24,796
17,247
146
Speculation was that Intel had originally planned for a software based solution to deal with AVX-512 but was abandoned... far enough along that completely removing AVX-512 wasn't worth the effort
So at Alder Lake launch, they suddenly decided to abandon that software effort? Seems weird. Markfw had his Alder Lake's AVX-512 active and managed to evade its disabling by staying on Linux where Intel or mobo vendors can't push microcode updates as easily as they do on Windows. This AVX-512 drama is why I think the E-cores were added to Alder Lake as a last ditch effort when Intel realized that they would lag significantly behind AMD if they stuck to just P cores. Alder Lake's 12900K was probably originally intended as a 10C/20T part, just like 10900K. At the last minute, they changed the floor plan and replaced two of the P-cores with E-core clusters.
 

jpiniero

Lifer
Oct 1, 2010
16,326
6,806
136
So at Alder Lake launch, they suddenly decided to abandon that software effort? Seems weird.

Not suddenly. You have to remember that a normal chip design is like a 4-5 year effort... and with the 10 nm problems, Alder Lake was likely more than that.
 
Jul 27, 2020
24,796
17,247
146
Juicy tidbits there:

Between 3 and 4 GHz, these P-Cores can give the E-Cores a run for their money. In an integer workload, Golden Cove consumes about the same amount of total energy while completing the task faster. With a vectorized load, Golden Cove finishes the task so much faster that it ends up using less total energy than Gracemont, even though Gracemont draws less power. That means running Gracemont above 3.2 GHz is pointless if energy efficiency is your primary concern. Running the E-Cores at 3.8 GHz basically makes them worse P-Cores. But that’s exactly what Alder Lake does by default.

Below 3 GHz, Gracemont shines.
It’s able to maintain better performance at very low power targets (as we saw in the previous section) while consuming less energy. With an integer workload, it’s especially efficient.

I hope no one tells me that Intel didn't know this. They deliberately ran Alder Lake outside its power efficiency sweet spot to combat the threat of AMD eating their lunch and making them look like amateurs. So while Intel's architecture may not have been completely crap, their process node disadvantage really hurt them like hell. Like "fire ant bit me!" kind of hurt.
 

Hulk

Diamond Member
Oct 9, 1999
5,109
3,635
136


So my question stands, why does Intel need to depend on this feature taking up die space whereas AMD manages to keep their P and c cores working without it?
I don't know. Maybe since AMD c cores are the same cores just clocked lower than their "P" cores it's easier for the OS to direct threads to the cores that can clock higher? It's more complicated with Intel because the P's and E's have significantly different architecture as well as clocks. Intel's E's are more like better HT threads.

Windows has been directing threads to preferred (higher max frequency) cores for quite a few years now so I think that is basically what AMD is utilizing By that I mean the non-c cores are the "preferred cores."
 

moinmoin

Diamond Member
Jun 1, 2017
5,213
8,378
136
Always wonder why they didn't continue using HDL for mobile
Do you mean high density libraries with HDL? If so AMD actually does just that with all Zen and many Radeon designs. But that doesn't mean the resulting design has a focus on density, just that the grid is smaller than with high performance libraries.

By creating the c cores AMD has validated Intel's hybrid approach.
If you want to think so.

In my opinion AMD's focus was clearly to compete with the massive ARM cores server chips. With those high frequency is not the focus but perf/area is, and that's what Bergamo is for.

With PHX2 for a long time it wasn't known whether it actually contains Zen 4c cores, some people here and elsewhere claimed that that still isn't a hybrid design, and Ian Cutress said we won't see a hybrid design from AMD before next gen.
 
  • Like
Reactions: Tlh97 and KompuKare
Jul 27, 2020
24,796
17,247
146
Windows has been directing threads to preferred (higher max frequency) cores for quite a few years now so I think that is basically what AMD is utilizing By that I mean the non-c cores are the "preferred cores."
Plausible explanation
 

jpiniero

Lifer
Oct 1, 2010
16,326
6,806
136
With PHX2 for a long time it wasn't known whether it actually contains Zen 4c cores, some people here and elsewhere claimed that that still isn't a hybrid design, and Ian Cutress said we won't see a hybrid design from AMD before next gen.

There are rumors that AMD is working on an new Cat core but it's some ways off.
 

Doug S

Diamond Member
Feb 8, 2020
3,176
5,442
136
Intel is going from 10nm to 5nm with Arrow Lake. That means Intel has much more silicon real estate on their chips. They can be more flexible. Dumping hyperthreading indicates they have some fancy new technology moving forward.

I don't think it indicates any "fancy new technology", they likely believe it is easier to increase MT throughput with their little cores than the additional complexity hyperthreading brings.
 

Hulk

Diamond Member
Oct 9, 1999
5,109
3,635
136
I don't think it indicates any "fancy new technology", they likely believe it is easier to increase MT throughput with their little cores than the additional complexity hyperthreading brings.

I don't know about "fancy new technology" either but I do think that Intel is excising HT structures from the P cores so that they can design for maximum ST performance now that they can "go to" the E's for as much MT compute as needed. Need more MT compute? Add another E cluster. Or as I read somewhere the E's are "Cinebench accelerators."
 

Abwx

Lifer
Apr 2, 2011
11,818
4,740
136
, for AMD efficiency is probably about the same since the architecture is the same, but the c cores are smaller so the only advantage of using them for AMD is area efficiency.

No, the C cores are quite more efficient at same frequency assuming it s low enough, AMD s comparison slide is at same frequency for both regular and C cores, beside they have published a slide comparing 2P + 4C and a 6P, and below a given frequency the former is more efficient.
 

Hulk

Diamond Member
Oct 9, 1999
5,109
3,635
136
No, the C cores are quite more efficient at same frequency assuming it s low enough, AMD s comparison slide is at same frequency for both regular and C cores, beside they have published a slide comparing 2P + 4C and a 6P, and below a given frequency the former is more efficient.
Interesting. Two nearly identical architectures on the same nodes but different power for the same frequency and workload. I'd like to read more about this. Can you post that link?

The 2P+4C to 6P comparison isn't apples-to-apples because the 2P's could be running faster, 4C's lower, same compute output but more power for the 2+4.

Let's dig into this!
 

Abwx

Lifer
Apr 2, 2011
11,818
4,740
136
Interesting. Two nearly identical architectures on the same nodes but different power for the same frequency and workload. I'd like to read more about this. Can you post that link?

The 2P+4C to 6P comparison isn't apples-to-apples because the 2P's could be running faster, 4C's lower, same compute output but more power for the 2+4.

Let's dig into this!

The comparison slide i posted is at isofrequency.


Now for the 2P + 4C vs 6P.

The 2P + 4C efficency is somewhat lower than it should be at low frequency because of the 2 regular Zen cores, these increase the efficiency at the higher powers but at the lower powers (below 15W) 6 C cores would do better than the 2P + 4C assymmetrical arrangement.

Zen4c-Technology-for-Laptops-Press-Deck-8.png


 
Last edited:

KompuKare

Golden Member
Jul 28, 2009
1,223
1,578
136
AMD added the c cores for the exact same reason Intel did. Most software doesn't utilize more than 8 or so threads so there are decreasing gains to adding more full size cores. Instead of designing an entirely new architecture AMD reduced the physical size of Zen 4, which required a decrease in clocks. This way they got quite a bit of additional multicore performance for less die area than would have been required with "full speed" Zen 4 (larger) cores.
i think that is a very consumer centric way to think about it.

Where Zen 4C excels is server chips and density.
Yes, they can repurpose it for desktop and more importantly laptops, but I do not think that was that got them to develop the C cores.

Intel is almost the polar opposite. The did not abandon their smaller cores (but Atom has long no longer been a part purposefully crippled whose main purpose was to keep old fabs busy). But the E cores were not designed for servers. Yes, they will start using E cores in Server with their 'forest' Xeon but they are way late compared to Bergamo or the various ARM many-small-cores server chips.
 

BorisTheBlade82

Senior member
May 1, 2020
688
1,085
136
Slightly :

9-1280.jpg



Sadly, that slide is misleading. It already takes into account, that for the same MT performance more Zen4c cores are used and that they clock lower than a comparable Zen4 SKU. Looking at the V/f curve, Zen4c is quite disappointing IMHO.

I fully expect AMD to specialize their two cores more in the coming generations, where the c-cores show real efficiency improvements in terms of Work per Joule. This might only be the case for Integer.
All in all, in the client space the heterogenous approach makes sense to me due to Amdahl's law. Server space is different, because there it does not apply in the same way. There, many smaller cores might become the standard while big cores might only be used for special workloads and applications that get licensed by core count usage.