Originally posted by: Idontcare
I see no reason why not...but technically accomplishing this is not dependent on i7 nor QPI. It could be done with any existing hardware configuration.
The rate-limiting process for deploying the technology is software, not hardware.
Generally speaking there are two "things" that limit scaling (performance) of multi-threaded applications.
One is the percentage of overall computations that can be performed in parallel. (referred to as "percentage of serial code" as in "this is a highly serial code" or "this code is trivially highly parallelized")
The second has to do with the inter-relatedness of the threads, i.e. how frequently does one thread need to check on the results of another parallel thread to determine the impact on its next processing step. (referred to as "grained" as in "this code is fine-grained" or "this code is coarse-grained")
So the way it works is the relative percentage of serial code determines the limits to which you can ever expect your performance to increase as you add more and more threads (cores, processing power, etc) to your system. See Ahmdahl's law.
Even with infinitely fast inter-processor communications the speedup won't be 1:1 because the code itself can never be 100% parallelized, there will always be some non-zero percentage of serial code present if for no other reason than the process of thread spawning and re-assembling the results from the threads is intrinsically a serial computation.
In addition to the limits imposed by the percentage of serial code present in the application you still have the question of how fast can information be communicated between the threads? (between the cores) This is where QPI and HT come in, they are superior to their predecessors but this extra speed is really only needed for those so-called fine-grained applications.
A course-grained application (pov-ray, cinebench, etc) won't be rate limited by interprocessor communications so an improvement in this area won't be reflected in a performance improvement at the application level.
With that in mind - would Larrabee help improve the performance of applications? Yes...but you really are going to be dependent on the specific application having been coded to be ridiculously parallel (>95% parallel code).
Will QPI make a difference? depends on how fine-grained the specific computations turn out to be and that is very application specific. The extra bandwidth won't hurt, that's for sure.