Originally posted by: IntelUser2000
- Memory technology (because Rambus left the desktop memory market) was stagnant and could not provide the necessary banwidth to feed the Pentium-4
- The shared FSB that could also not feed the Pentium-4.
- Too much leakage and bad design on Prescott that prevented high scaling
Oh right, then what was up with the 3.46GHz EE with 1066MHz FSB not more than 2% faster than its 800MHz brethren?? Shared FSB is only apparent for SMP systems!!!
The Itanium-2 Monecito has a mere 10 stage long pipeline, but also carries its own SMT.
That's not really a valid argument as Itanium 2 Montecito core has SoEMT, not SMT, which is primarily for hiding memory latency, by switching threads when necessary. Itanium 2 by the way, has 8 stage pipeline not 10.
With longer pipelines (and higher clockspeeds), the latency introduced by HT is a relatively trivial amount and HT gains much more than it loses.
Contrarily, with a shorter pipeline latency and efficiency are everything! The gains from SMT are far outweighed by the latency penalty.
Latency as in.... You are being very general here.
HT is not the god send you are making out to be. It does nothing, and even hinders performance on efficient processors (ie: Short Instruction Pipelines).
Oh really?? You would know it would hinder on short pipelined processors because...? Only other real SMT implementation is IBM's Power, which takes considerably more die size, and is more advanced than Intel's HT. Add to the fact that IBM's Power 5 is VERY wide, and they are not comparable.
HT IS the god send in terms of efficiency. Contrary to what people are claiming, single thread performance was basically equal in most apps. Since it only adds less than 5% to die size, giving potential 30%, and average 5%, is VERY efficient. Which other technology does that??