Intel Unleashes Haswell-EX Xeon E7 V3 Processors

Page 2 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

zir_blazer

Golden Member
Jun 6, 2013
1,260
573
136
Haswell chips for 2011v3 have a dual imc. But only ddr4 is used.
False. Check these: 1, 2, 3

The point is that only very few models seems to use it...

This motherboard only supports the DDR3 compatible CPU (E5-2669 v3, E5-2649 v3, E5-2629 v3). You cannot power on the system if you use a DDR4 compatible CPU.

...which is rather sad. Heck, these parts seems to be OEM only and not even released to the public, they even recently deleted them from Intel Ark as I recall seeing them. Its like they are actively hidding that DDR3 support is there, since they want to push DDR4 instead. This is rather similar to what happened at some point with the Pentium 3 i820 Chipset and RAMBus, just that they didn't had a true backup plan ready when they figured out than RAMBus was going nowhere and screwed it when they had to fall back, while they're currently prepared to do so anytime now.
 
Last edited:

Idontcare

Elite Member
Oct 10, 1999
21,110
64
91
Lets not turn this into a dinosaur thread, but I got to be a master at taking the the RLL controllers off bad 30mb Seagates and swapping them to the 20mb mfm Seagate drives & low leveling it for 30megs. Must have done 50 or 60 of those. Yeah, things have come quite a ways since -debug days. 5.6 billion transistors on chip is hella impressive to an old timer like me :eek:
https://books.google.com/books?id=x...mands for 30mb seagate rll controller&f=false

Don't kid yourself, it is hella impressive even to the folks who designed the damned thing that consumes 5.6 billion transistors.

The only sentient species not impressed by such pinnacles of humankind's CMOS achievements are the aliens that gifted us said alien technology in the first place, and the random forum goer who truly seems incapable of drawing even a single NOR logic circuit let alone enough of them to consume a 5.6 billion xtor project budget ;) :p
 
Mar 10, 2006
11,715
2,012
126
Don't kid yourself, it is hella impressive even to the folks who designed the damned thing that consumes 5.6 billion transistors.

The only sentient species not impressed by such pinnacles of humankind's CMOS achievements are the aliens that gifted us said alien technology in the first place, and the random forum goer who truly seems incapable of drawing even a single NOR logic circuit let alone enough of them to consume a 5.6 billion xtor project budget ;) :p

I wonder what kind of yields Intel is getting on these bad boys. Intel is selling SKUs with all sorts of different core counts...from 4 all the way to 18...based on the same die.
 

ShintaiDK

Lifer
Apr 22, 2012
20,378
146
106
I wonder what kind of yields Intel is getting on these suckers. Intel is selling SKUs with all sorts of different core counts...from 4 all the way to 18...based on the same die.

The 4 is the exception, due to high ST performance requirement. Else the next one starts at 10 cores that also looks binned. Its first at 14 it really says defects.

But it looks more like binning than yield in most cases.

HaswellEP_DieConfig.png
 
Last edited:

mrmt

Diamond Member
Aug 18, 2012
3,974
0
76
I wonder what kind of yields Intel is getting on these bad boys. Intel is selling SKUs with all sorts of different core counts...from 4 all the way to 18...based on the same die.

4 cores with 140W at only 3.2GHz base clock is way bad. I think it's a very niche SKU, maybe to build some validation boxes for testing RAS features?
 

Idontcare

Elite Member
Oct 10, 1999
21,110
64
91
The 4 is the exception, due to high ST performance requirement. Else the next one starts at 10 cores.

But it looks more like binning than yield in most cases.

It's all yield, parametric versus functional, but I know what you meant. Just wanted to clarify for the rest of the audience.

I doubt Intel has much of a D0 (functional yield) issue, even with their 600+ mm^2 dies, but the parametric yield is probably the dominating issue for them. Particularly in terms of clockspeed versus power-consumption (itself dominated by the clockspeed vs. Vcc shmoo distribution).
 

imported_ats

Senior member
Mar 21, 2008
422
64
86
4 cores with 140W at only 3.2GHz base clock is way bad. I think it's a very niche SKU, maybe to build some validation boxes for testing RAS features?

4 cores at 140W but 6-12TB memory capable, which matters for lots of DB workloads. Low cost licensing for large scale in memory DBs.
 

Burpo

Diamond Member
Sep 10, 2013
4,223
473
126
It's not your ignorance. Amazing how people can speculate when NOBODY has had access to this new chip. :colbert:
 

TuxDave

Lifer
Oct 8, 2002
10,571
3
71
Pardon my ignorance, but I can't get either one of those URLs to work (tried Waterfox and Spartan/Edge). How does one access such content?

Not sure why it opens for me but:

IBM Power Enterprise System E870, 8 processors / 80 cores / 640 threads,
POWER8, 4.19 GHz, 32 KB (I) and 64 KB (D) L1 cache and 512 KB L2
cache per core, 8 MB L3 cache per core, 2048 GB main memory

vs

Fujitsu PRIMEQUEST 2800E2, 8 processors / 144 cores / 288 threads,
Intel Xeon Processor E7-8890 v3, 2.50 GHz, 64 KB L1 cache and
256 KB L2 cache per core, 45 MB L3 cache per processor,
2048 GB main memory

And then a bunch of numbers but I think IBM is winning.
 

DrMrLordX

Lifer
Apr 27, 2000
22,937
13,024
136
It's not your ignorance. Amazing how people can speculate when NOBODY has had access to this new chip. :colbert:

Apparently, Fujitsu has . . .?

Not sure why it opens for me but:

IBM Power Enterprise System E870, 8 processors / 80 cores / 640 threads,
POWER8, 4.19 GHz, 32 KB (I) and 64 KB (D) L1 cache and 512 KB L2
cache per core, 8 MB L3 cache per core, 2048 GB main memory

vs

Fujitsu PRIMEQUEST 2800E2, 8 processors / 144 cores / 288 threads,
Intel Xeon Processor E7-8890 v3, 2.50 GHz, 64 KB L1 cache and
256 KB L2 cache per core, 45 MB L3 cache per processor,
2048 GB main memory

And then a bunch of numbers but I think IBM is winning.

Hmm okay. I would like to know what the net power draw is for both of those machines during testing.
 

thunng8

Member
Jan 8, 2013
167
72
101
Not sure why it opens for me but:

IBM Power Enterprise System E870, 8 processors / 80 cores / 640 threads,
POWER8, 4.19 GHz, 32 KB (I) and 64 KB (D) L1 cache and 512 KB L2
cache per core, 8 MB L3 cache per core, 2048 GB main memory

vs

Fujitsu PRIMEQUEST 2800E2, 8 processors / 144 cores / 288 threads,
Intel Xeon Processor E7-8890 v3, 2.50 GHz, 64 KB L1 cache and
256 KB L2 cache per core, 45 MB L3 cache per processor,
2048 GB main memory

And then a bunch of numbers but I think IBM is winning.

You can browse all results here:

http://global.sap.com/solutions/benchmark/sd2tier.epx
The power8 results are from last year so you have to go to 2nd or 3rd page.

It is basically 79750 vs 58626 users

POWER8 36% higher on per chip basis and an amazing 2.45x faster on per core basis. Those Power8 cores make the Haswell core look very slow on this workload.
 

Burpo

Diamond Member
Sep 10, 2013
4,223
473
126
I don't see much difference especially since one is using Windows Server 2012 & the other Red Hat Enterprise Linux.
 

imported_ats

Senior member
Mar 21, 2008
422
64
86
You can browse all results here:

http://global.sap.com/solutions/benchmark/sd2tier.epx
The power8 results are from last year so you have to go to 2nd or 3rd page.

It is basically 79750 vs 58626 users

POWER8 36% higher on per chip basis and an amazing 2.45x faster on per core basis. Those Power8 cores make the Haswell core look very slow on this workload.

Or 40% slower per thread. All depends on how you want to look at it. SAP 2D tends to have a lot of idle time on a per context basis due to latency of storage and cache misses. Xeon would likely show near linear gains from SMT2 -> SMT4.
 

thunng8

Member
Jan 8, 2013
167
72
101
That's statement is laughable. Xeon was not designed for 4 threads and trying to shoehorn something that it wasn't designed for would likely decrease performance.

Ibm has given estimated scaling from 1 thread to 8 threads per core in their performance reports. I am taking the 192 core e880 as an example

1: 1936
2: 2808
4: 3650
8: 3905

http://public.dhe.ibm.com/common/ssi/ecm/po/en/poo03017usen/POO03017USEN.PDF

Going from 4 threads to 8 threads give only a small increase

Also going by those numbers, power8 running in single thread would still be faster than Xeon
 
Last edited:

imported_ats

Senior member
Mar 21, 2008
422
64
86
That's statement is laughable. Xeon was not designed for 4 threads and trying to shoehorn something that it wasn't designed for would likely decrease performance.

Ibm has given estimated scaling from 1 thread to 8 threads per core in their performance reports. I am taking the 192 core e880 as an example

1: 1936
2: 2808
4: 3650
8: 3905

http://public.dhe.ibm.com/common/ssi/ecm/po/en/poo03017usen/POO03017USEN.PDF

Going from 4 threads to 8 threads give only a small increase

Also going by those numbers, power8 running in single thread would still be faster than Xeon

all depends on the workload and what the limiters are. Some workloads scale amazingly well with SMT contexts, others not so well.
 

DrMrLordX

Lifer
Apr 27, 2000
22,937
13,024
136
Okay, it is interesting to see how Xeon is performing here, at least in this one benchmark. The systems, as configured, show that POWER8 wins with an even socket count. I still would like to know, what are the individual power draw numbers for both of these machines?