Can someone calculate the [1] Performance per Area and [2] Performance per Watt of the following cores:
1) Apple Firestorm
2) AMD Zen 3
3) Intel Golden Cove
Thanks in advance : )
Edit : I Wouldn't mind values of Sunny Cove if Golden Cove isn't available.
Are the x86 core values corrected for fab node geometry differences to the N5 that A14 is produced on?Okay, I was able to finish a Quick Research on each core area(mm2 on the die) and performance using Geekbench I will try to find info for power usage.
![]()
CPU Benchmarks and Hierarchy 2025: CPU Rankings
We've run thousands of CPU benchmarks on all new and older Intel and AMD CPUs and ranked them.www.anandtech.com
So these scores are for Performance per Area(square millimeter)
A single Apple Firestorm core is about 9 mm2 in size, they get about 1740 points on GB5, that puts them at 193 points per area.
A single Amd Zen3 core is about 6.5 mm2 in size, they get about 1500 points on GB5, that puts them at 230 points per area.
A single Intel Golden Cove core is 10.5 mm2 and they get abut 190 points on GB5, that gets them about 100 points per area.
Edit.
Also as I pointed out earlier, this is 1C/1T which puts the X86 CPUs on a disadvantage when compared to a 1C/1T Apple Firestorm. so add about 30% performance per core since the HT/SMT is already built on the Core
Okay, I was able to finish a Quick Research on each core area(mm2 on the die) and performance using Geekbench I will try to find info for power usage.
![]()
CPU Benchmarks and Hierarchy 2025: CPU Rankings
We've run thousands of CPU benchmarks on all new and older Intel and AMD CPUs and ranked them.www.anandtech.com
So these scores are for Performance per Area(square millimeter)
A single Apple Firestorm core is about 9 mm2 in size, they get about 1740 points on GB5, that puts them at 193 points per area.
A single Amd Zen3 core is about 6.5 mm2 in size, they get about 1500 points on GB5, that puts them at 230 points per area.
A single Intel Golden Cove core is 10.5 mm2 and they get abut 190 points on GB5, that gets them about 100 points per area.
Edit.
Also as I pointed out earlier, this is 1C/1T which puts the X86 CPUs on a disadvantage when compared to a 1C/1T Apple Firestorm. so add about 30% performance per core since the HT/SMT is already built on the Core
Performance in what ? Cinebench r23 or Cpumark99 ? Packman ?
What architecture of Zen3 ? APU or 5xx0 series or maybe Epyc ? What Alder lake SKU ? At what binning, cherry picked silicon or over how big average sample size ?
Area including what ? Cores alone ? Uncore? with or without L1, L2 and L3 ? Memory controller ? IO die for Zen3 ? Maybe you mean mm2 socket ?
How do you measure power ? In idle or in what workload ? avx512 or SSE ? Including gpu (apu) ? Powerdraw shown in hwinfo or measured from the wall ? What bios settings, MCE enabled ?
Your question is like me asking what the color purple taste like.. it have no meaning.
NoAre the x86 core values corrected for fab node geometry differences to the N5 that A14 is produced on?
I omitted Area without L3 because the M1 lacks L3$...But its a big Fat L2, without any Cache or with full Cache would have been better. Amd Full 8 Mb $ per core is as big as the Core/L2 sizeThank you very much for this. Really appreciate the effort !
This was the type of calculation that i was exactly looking for !
It would be good if you or someone else can also do the same calculation for other benchmarks like Cinebench and SPEC.
That would help us to arrive at meaningful conclusion.
Interesting, the Zen 3 has high performance per area than the other two, as per your calculation.
Shall we do a performancex MTr/mm²?Are the x86 core values corrected for fab node geometry differences to the N5 that A14 is produced on?
I omitted Area without L3 because the M1 lacks L3$...But its a big Fat L2, without any Cache or with full Cache would have been better. Amd Full 8 Mb $ per core is as big as the Core/L2 size
I am currently gathering all of the data to present it with and without L2 cache$ on Firestorm(A14), Zen3(Cezanne) and Golden Cove/Willow CoveAh the huge L2 cache must be the reason why the Firestorm core has such a large silicon footprint, despite being on the smaller 5nm node.
I am currently gathering all of the data to present it with and without L2 cache$ on Firestorm(A14), Zen3(Cezanne) and Golden Cove/Willow Cove
It's a hard comparison to make because having an SLC instead of a traditional L2 is obviously a design choice that isn't made in a vacuum. Apple had much larger L1 caches (the I-cache for each Firestorm core is 192 KB) compared to the x86 CPUs from Intel and AMD.
If you have a traditional L2 cache you're probably comfortable making a trade for a faster, but smaller L1 cache. If Apple's next stop is SLC then they obviously need a larger L1 and the lower clock speeds they operate at mean that the size doesn't create as much of a penalty in terms of delay cycles.
Really you should consider the chip and memory system as a whole. No one benchmark or workload will give a perfect answer as to efficiency because some tests will favor that larger L1 cache and others won't care about it at all.
That would be great just for reference sake, but I've no idea how you would dial that back into your current comparison values in the post I replied to.Shall we do a performancex MTr/mm²?



Okay basing this chart I built off this article.
https://wccftech.com/why-apple-m1-single-core-comparisons-are-fundamentally-flawed-with-benchmarks/
True Performance of x86 cores with SMT
View attachment 54788
Intel Golden Cove core 2330 points / 7.04 mm2 : 330 per mm2
AMD Zen3 core 1997 points / 4.2 mm2 : 475 points per mm2
Apple Firestorm core 1521 points / 3.76 mm2 : 404 points per mm2
I agree...but in this case its valid since the OP requested info on performance/die area per core and thats exactly this because SMT is built on the coreWhat an incredibly stupid take.
The whole point of a single core test is to test what happens when you run a single thread of code. Tells me the author of that wccftech article is butthurt about how well M1 compares to x86 and wants to find a way to put some artificial distance between them.
Rename it single thread test then if you want, but running two threads and calling it a "single core" test is down there with correcting spelling like replying "*you're" to someone when you can't come up with any actual defense against their position.
I agree...but in this case its valid since the OP requested info on performance/die area per core and thats exactly this because SMT is built on the core