AtenRa, I picked the performance option myself but let's face the truth. If Intel could cram 50% more performance at a given TDP, they would do so if only for the desktop market.
Instead, core number is limited by Amadahl's law, core size is limited by law of diminishing returns, and clock speeds are limited by both power usage and heat density barriers. Intel has dedicated a lot of funding and time into maximizing IPC and introducing new instructions when possible.
What more can they really do? All the other options have significant drawbacks. How would they deliver 50% more performance without either a increase in cores which only benefits multithreaded code (not ubiquitous) or increase in clock speeds (requires expensive cooling and power delivery).
I want what you want. I just don't see how it is possible with current CPU tech.