Nehalem is 10 years old now and was their first core i5/i7 architecture. Is intel limited by the fact that their groundwork is old now? Do they need a clean slate for their future generations?
Actually it's based off Sandy Bridge. Nehalem core is a minor change(even miniscule) from original Core uarch while Sandy Bridge does the overhaul to finally get it away from having the remnants of Pentium Pro.
Didn't Intel's first FinFET lower the performance? I was very surprised by Ivy Bridge's power/thermal characteristics after hearing all the fanfare of FinFET's awesomeness. Compared to 32nm Sandy Bridge, its power consumption did not decrease and it clocked lower. That made me somewhat skeptical of FinFET, then I was surprised again when Samsung's 14nm FinFET actually did improve upon its 20nm by a sizable margin. TSMC followed shortly thereafter with 16nm FF/FF+ which appeared even better than Samsung's 14nm FF. That was when I was convinced that Intel's process lead is nowhere near where it was thought to be.I'd be surprised if the already disappointing 5 to 10% gain today is maintained in the future. Even with transistors so-called revolutionary changes like FinFET were required just to keep the performance increases going, nevermind bring a true breakthrough in performance. In the 90s, we got 50% improvements with new transistors in every area, while FinFET with all its complexities brought 40% only in very limited scenarios.
Ivy Bridge had fine power/thermals, but bad TIM on k chips sort of wrecked it.Didn't Intel's first FinFET lower the performance? I was very surprised by Ivy Bridge's power/thermal characteristics after hearing all the fanfare of FinFET's awesomeness.
Was not 20nm considered a rolling disaster for both Samsung and TSMC? There's a reason they cancelled GPUs on it.Samsung's 14nm FinFET actually did improve upon its 20nm by a sizable margin.
Actually it's based off Sandy Bridge. Nehalem core is a minor change(even miniscule) from original Core uarch while Sandy Bridge does the overhaul to finally get it away from having the remnants of Pentium Pro.
Doing an overhaul does not indicate it'll turn out to have more headroom. CPU architectures have largely hit limits on IPC and clock. Look at CPUs from various companies, market positioning, and ISAs. The level is converging. And yes, Zen is another example.
I'd be surprised if the already disappointing 5 to 10% gain today is maintained in the future. Even with transistors so-called revolutionary changes like FinFET were required just to keep the performance increases going, nevermind bring a true breakthrough in performance. In the 90s, we got 50% improvements with new transistors in every area, while FinFET with all its complexities brought 40% only in very limited scenarios.
Entire cache hierarchy and northbridge integration, though.Nehalem core was very modestly changed compared to the core in the C2D/C2Q line.
That is one explanation. I wonder why Intel would do such a thing, though? That made the whole FinFET achievement look underwhelming.Ivy Bridge had fine power/thermals, but bad TIM on k chips sort of wrecked it.
You are talking about 28 nm to 20 nm transition which was a mediocre improvement, and more importantly 20 nm was not FinFET. Also I am not talking about financial/business point of view, which I am sure has a lot of drama in it depending on who you speak to. Technology-wise, the Exynos 7420, which was the 2nd FinFET after Ivy Bridge, was a significant improvement over the Exynos 5433.Was not 20nm considered a rolling disaster for both Samsung and TSMC? There's a reason they cancelled GPUs on it.
HSW->SKLY is the least impressive "jump" in the whole core generation history.Basically for Haswell desktop users, it is is ~6% higher IPC according to AT. For BDW users it was ~3% "jump". The only saving grace is KL and its crazy high OC potential.Nehalem core was very modestly changed compared to the core in the C2D/C2Q line. Sandy Bridge made some significant enhancements, but it was still based on the same Nehalem core with a lot of bits reused.
I would say NHM -> SNB is on the same order as SNB->HSW or HSW->SKL.
AT results were deflated, for one early Z170 bug reasons, so pretty please, stop bringing them up.Basically for Haswell desktop users, it is is ~6% higher IPC according to AT.
We will have new results soon as per Ian. He is preparing new CPU charts and there you can check how much "faster" is Skylake.AT results were deflated, for one early Z170 bug reasons, so pretty please, stop bringing them up.
iirc the anandtech nehalem article talked about how similar it was to phenom. just better.Nehalem was a very big overhaul of the CPU as a whole- whole new cache hierarchy with introduction of the L3, L2 shrinking dramatically, memory controller unified into the CPU... it was a very Phenom-like architecture. The core itself didn't change that much apart from the (pretty major) addition of Hyperthreading, but everything around it did.
Nehalem was a very big overhaul of the CPU as a whole- whole new cache hierarchy with introduction of the L3, L2 shrinking dramatically, memory controller unified into the CPU... it was a very Phenom-like architecture. The core itself didn't change that much apart from the (pretty major) addition of Hyperthreading, but everything around it did.
I think that the best solution is POWER like. Separate schedulers for int, fp, memory and branch. Intel is at unified scheduler. AMD split in int+mem+branch and FP. Still better than unified, because you can have more ports at even lower FO4, but with fully split schedulers, you can lower further the FO4...
Many CPU experts were saying Sandy Bridge marks the true departure from Pentium Pro, and I am inclined to say the same. Also I'd think modifying the core is a far widespread change than adding some uncore and deleting some cache blocks. Nehalem was a good server chip not entirely due to the IMC/QPI but because the good core was hobbled by the lack of it. I'd bet Prescott being on an IMC/QPI wouldn't have been so impressive.
The lack of core changes were also why the initial impact on consumer applications were not so good. Single thread without Turbo were often only about 5% faster.
Because you might as well say every current chip today is based on Intel's 8086 chip. At what point do we talk about not being based off anymore?
Funny thing about being perceived a "loser" and a "winner". When Intel chips were dominating, unified schedulers were "it". They take up less space, they were power efficient, and just a more efficient way of doing things. When AMD was doing well with Athlon, people said the same thing about separate schedulers. That different way of doing things were so consistent between the two companies over such a long period it may just be company culture.
Just a different way of doing things folks.
iirc the anandtech nehalem article talked about how similar it was to phenom. just better.
Many CPU experts were saying Sandy Bridge marks the true departure from Pentium Pro, and I am inclined to say the same. Also I'd think modifying the core is a far widespread change than adding some uncore and deleting some cache blocks. Nehalem was a good server chip not entirely due to the IMC/QPI but because the good core was hobbled by the lack of it. I'd bet Prescott being on an IMC/QPI wouldn't have been so impressive.
The lack of core changes were also why the initial impact on consumer applications were not so good. Single thread without Turbo were often only about 5% faster.
Because you might as well say every current chip today is based on Intel's 8086 chip. At what point do we talk about not being based off anymore?
You do understand that you compare 4c+HT against 4c only and get ~33% gain from HT + IPC improvement, entirely in-line with SMT scaling in Cinebench and ~10% ST improvement per clock.Which is why you see ridiculous gains in performance (at the same clock speed!) over C2Q in multithreaded benches:
Yeah, 4 more threads certainly change a lot of performanceIf a change can make that much difference to the performance of the CPU, I'd say that it was pretty significant Sure, the core didn't change much, but the uncore is just as important.