Gamingphreek
Lifer
- Mar 31, 2003
- 11,679
- 0
- 81
although HyperThreading was just a substitute and short lived until Dual Cores were released.
Hyper Threading was no substitute. It was a "We have to cover our asses move".
-Kevin
although HyperThreading was just a substitute and short lived until Dual Cores were released.
Originally posted by: Gamingphreek
although HyperThreading was just a substitute and short lived until Dual Cores were released.
Hyper Threading was no substitute. It was a "We have to cover our asses move".
-Kevin
Originally posted by: FelixDeKat
AMD is taking the right stance - if you cant beat 'em, copy 'em! :laugh:
Originally posted by: munky
Originally posted by: FelixDeKat
AMD is taking the right stance - if you cant beat 'em, copy 'em! :laugh:
Ah, so we can expect from AMD a 30-stage pipeline Netburst abomination that heats up your whole house any day now, right?
Originally posted by: munky
Originally posted by: FelixDeKat
AMD is taking the right stance - if you cant beat 'em, copy 'em! :laugh:
Ah, so we can expect from AMD a 30-stage pipeline Netburst abomination that heats up your whole house any day now, right?
Originally posted by: Gamingphreek
although HyperThreading was just a substitute and short lived until Dual Cores were released.
Hyper Threading was no substitute. It was a "We have to cover our asses move".
-Kevin
Originally posted by: RichUK
[
I was referring to AMD?s ?actual? implementation of new technology, aka an on die memory controller and the use of HT links, and therefore not carrying on using inferior data transport models, such as a single I/O bus for all communication to the CPU, namely the FSB! Yes Intel may patent quite a few technologies but that doesn?t mean they go ahead and implement them. AMD are innovative as they implemented these technologies and have been leading by example, not following behind in Intel?s foot steps.
Originally posted by: Madwand1
Why do these threads have to always degenerate into Intel vs. AMD wars? More intesting is how this is actually supposed to work, the role of the compilers, and the role of the developers, and what performance gains may be seen, what performance costs.
Originally posted by: FelixDeKat
Originally posted by: Gamingphreek
although HyperThreading was just a substitute and short lived until Dual Cores were released.
Hyper Threading was no substitute. It was a "We have to cover our asses move".
-Kevin
No Kevy,
It was revolutionary. Hyperthreading was extraordinary. It allowed Intel to "Leap Ahead" at the time allowing multiple threads on a single core. Amd couldnt touch it.
No, HT and SMT in general is revolutionary. Conroe based derivatives WILL have some kind of HT implementation in the future, as will most CPU's.
Like it or not, Pentium-4 was well ahead of its time, in terms of CPU technology. The main backdraws where:
- Memory technology (because Rambus left the desktop memory market) was stagnant and could not provide the necessary banwidth to feed the Pentium-4
- The shared FSB that could also not feed the Pentium-4.
- Too much leakage and bad design on Prescott that prevented high scaling.
Researchers at IBM concluded the "optimal" pipeline length is 50 (yes 50). Researchers at Intel concluded the "optimal" pipeline length to be 30.
Originally posted by: Gamingphreek
Revolutionary in the sense that it was a cover up. HT is merely a cover up for an extremely long instruction pipeline. It is revolutionary in that it effectively increases performance on a long pipeline.
HT is not the god send you are making out to be. It does nothing, and even hinders performance on efficient processors (ie: Short Instruction Pipelines).
-Kevin
It makes up performance for any CPU that that has execution resources not fully utilized from a single thread, which happens in nearly all applications.Originally posted by: Gamingphreek
Revolutionary in the sense that it was a cover up. HT is merely a cover up for an extremely long instruction pipeline. It is revolutionary in that it effectively increases performance on a long pipeline.
You don't know that but if the K8 was so efficient, it wouldn't need an IMC. But it does, in order to reduce the amount of time the core is stalled waiting for data. HT does something similar, reduces the amount of time the core is stalled by allowing it work on another thread while its waiting for data. If HT and SMT only worked on long pipeline architecture, why do all three major server CPU manufacturer, IBM, Intel and Sun already have or are introducing multi-threading processors?HT is not the god send you are making out to be. It does nothing, and even hinders performance on efficient processors (ie: Short Instruction Pipelines).
I'll give a very easy counter-arguement to debunk that myth. The IBM Power4's have shorter pipelines, and yet carry their own SMT. The sun Niagara is also a very short pipelined CPU, and it also carries its own SMT. The Itanium-2 Monecito has a mere 10 stage long pipeline, but also carries its own SMT. SMT is not a "patch" for long pipelined CPU's.
You don't know that but if the K8 was so efficient, it wouldn't need an IMC. But it does, in order to reduce the amount of time the core is stalled waiting for data. HT does something similar, reduces the amount of time the core is stalled by allowing it work on another thread while its waiting for data. If HT and SMT only worked on long pipeline architecture, why do all three major server CPU manufacturer, IBM, Intel and Sun already have or are introducing multi-threading processors?
Originally posted by: dexvx
Originally posted by: Gamingphreek
Revolutionary in the sense that it was a cover up. HT is merely a cover up for an extremely long instruction pipeline. It is revolutionary in that it effectively increases performance on a long pipeline.
HT is not the god send you are making out to be. It does nothing, and even hinders performance on efficient processors (ie: Short Instruction Pipelines).
-Kevin
HT/SMT covering up for a long pipelined Pentium-4 is a myth.
I'll give a very easy counter-arguement to debunk that myth. The IBM Power4's have shorter pipelines, and yet carry their own SMT. The sun Niagara is also a very short pipelined CPU, and it also carries its own SMT. The Itanium-2 Monecito has a mere 10 stage long pipeline, but also carries its own SMT. SMT is not a "patch" for long pipelined CPU's.
HT/SMT will also not hinder performance (beyond the spread of error) on single threaded applications when properly implemented. The Pentium-4 in its entirity is just a first-generation implementation of SMT by Intel. The main causes of single threaded degraded performance came from cache-thrashing in its very small L1, along with a small L2 in the Northwood/Foster core. Since Conroe is a NGMA, it will take sometime before it receives its own version of HT.
SMT and thread-level parallelism is the future.
I advise you to do some reading. Each pipeline stage is operating on different instructions at once. There's no difference in the theoretical throughput of a 1 stage pipeline and a 100 stage pipeline. The relative latency to execute an instruction will be higher for the 100 stage pipeline, but then it will also clock higher.Originally posted by: Gamingphreek
Do you even know what you are talking about or are you just feeding off of devx.
First off i never said that you COULDN'T use HT on a short core it just doesn't make sense. Apprarently you dont know how it works.
In a long pipeline one packet is sent through. It take so long to go through that much of the pipeline is not working and is idle. Additionally, if there were a cache miss then it would have to do the same thing over again. HT staggers packets and sends another behind the first packet. Therefore the pipeline is working as close to theoretical as possible all the time.
Depends, the high pipeline design can usually clock faster and while it may take more cycles to recover, each cycle will take less absolute time.Additionally, a cache miss or a branch misprediction is MUCH MUCH less costly on a short pipeline.
It's quite clear you don't understand HT.If you understand HT this is common knowledge...
Conre is "efficient", but it's also very powerful with a large number of integer and FP execution resources. This also works well with SMT.Originally posted by: RichUK
Yes it is a useful technique to implement for the right sort of processor (netburst derivative), but not a technique that is needed to help excel an already efficient design, ?Conroe? for example.
Originally posted by: Accord99
Conre is "efficient", but it's also very powerful with a large number of integer and FP execution resources. This also works well with SMT.Originally posted by: RichUK
Yes it is a useful technique to implement for the right sort of processor (netburst derivative), but not a technique that is needed to help excel an already efficient design, ?Conroe? for example.
Originally posted by: Gamingphreek
None of which are x86.
You cannnot compare the two.
Originally posted by: Gamingphreek
Do you even know what you are talking about or are you just feeding off of devx.
First off i never said that you COULDN'T use HT on a short core it just doesn't make sense. Apprarently you dont know how it works.
Originally posted by: Gamingphreek
In a long pipeline one packet is sent through. It take so long to go through that much of the pipeline is not working and is idle. Additionally, if there were a cache miss then it would have to do the same thing over again. HT staggers packets and sends another behind the first packet. Therefore the pipeline is working as close to theoretical as possible all the time.
In a short pipeline the pipeline is in use most of the time. There would be no point in trying to jam another packet in there; it would only delay the other packets. Additionally, a cache miss or a branch misprediction is MUCH MUCH less costly on a short pipeline.
If you understand HT this is common knowledge...
Originally posted by: dexvx
Apparently, you have no idea how SMT works. Whether its long or short pipelined makes no difference in SMT ideology.
