Originally posted by: Nemesis 1
Originally posted by: Viditor
Originally posted by: Nemesis 1
Penryn yorkfield will be up to 150% faster than C2Q/ Links are posted in the penryn up and running thread.
Could you repost them?
I couldn't find anything that said that...
If you read this.
http://www.intel.com/technology/magazine/research/speculative-threading-1205.htm
The Mitosis system has been designed to optimize the trade-off between software and hardware to exploit speculative thread-level parallelism.
The Results
To illustrate the performance potential of the Mitosis compiler, let's look at a subset of the Olden benchmark suite. Olden benchmarks are pointer-intensive programs that make it difficult for automatic parallel compilers to extract any thread-level parallelism.
Figure 3. In this graph, the blue bars show the performance improvement going from an in-order to an out-of-order
core with about twice the amount of resources. The red bars indicate the performance of a processor with perfect
memory, illustrating the potential performance improvement for any technique that targets simply reducing memory latency. The yellow bars show the performance gains that result when using Mitosis with a four-core processor.
The results obtained by the Mitosis compiler/architecture for this subset of the Olden benchmarks are very encouraging, outperforming single-threaded execution by 2.2x. When compared with a big out-of-order core, the speed increase is close to 2x. One can also see that the benefits of Mitosis do not come only from reducing memory latency?using Mitosis enables the system to outperform an ideal system with perfect memory by about 60 percent. Overall, this work demonstrates that significant amounts of thread-level parallelism can be exploited in irregular codes, with a rather low overhead in terms of extra (wasted) activity.
Wow, you pulled out a Mitosis article as evidence? Penryn is pretty much a Conroe with more cache and SSE4 and maybe hyperthreading (yay, 8 thread through a single socket... this is kind of getting ridiculous). I'd expect most of Penryn's performance benefits will come not from core improvements but from simple clockspeed hikes.
If Intel's metal gates and high-K dielectrics work as well as Intel claims at very high clocks (it's easier to lower leakage at lower clocks, after all) then a 50% performance increase (through a clock hike and the few improvements Intel will actually throw into it) could be doable at similar TDPs.