Originally posted by: Idontcare
Originally posted by: JackyP
I just offered another take on this problem, I don't even know if you disagree with my methodology. Maybe you really don't care.
I don't disagree with your methodology, but it isn't necessary to answer the question I am personally interested in. Sure if we could answer the questions you pose then I'd like to have that analyses as well, but I view it secondary to my current questions about Nehalem's SMT implementation.
Originally posted by: JackyP
I thought efficacy must always be seen in relation to die size or power draw.
And you would be correct if the objective were to understand the efficacy of SMT relative to die size or power draw.
While such info would be nice to know, I am primarily interested in understanding how efficient SMT is in truly emulating a full core in the absence of SMT.
Whether it does a good job at this task, emulating a full-blown core, or not is of relevance to me and anyone who pays per thread (vs per socket) for their apps or is currently creating multi-threaded code and needs to prioritize development resources to balance threading versus timeline to release.
The efficacy I am referring to is in ability to emulate another core in thread processing capability and not so much to answer a question of whether it does a shitty job but a justifiably shitty job because it only adds a paltry amount of xtors to the core or watts to the power consumption.
Those numbers would be great to have as well for the enthusiast/geeky side of me, but the business side needs to know is 5 threads on an i7 are going to scale as expected based on the performance scaling of 1 thru 4 threads. The Euler benchmark data discussed earlier does in fact suggest SMT is nearly hardware equivalent to emulating full-blown cores. More thread scaling data from more applications would be whats needed here. (easily generated by setting application affinity in order to exclude cores in sequential fashion)
I'm sure knowing SMT's power footprint or xtor footprint so we can create performance/footprint numbers would be nice too.