Depends how the new GCN turns out, but they have actually got Hyper-threading for SPs. For REAL!
http://forums.anandtech.com/showpost.php?p=38154409&postcount=19
^ There's a patent paper there for next-gen GCN. Take some time to read it, it's mind blowing stuff.
On paper, there's potential for 4x the throughput for each SP. Though I suspect that's under perfect scenario, but still, x1 to x2 (game load dependent) per SP performance vs older GCN SP is there on the table.
Polaris GCN has gone wide with each SP being able to run multiple threads in parallel, a feat that's pretty crazy when you realize the amount of synchronization it requires to keep the hardware scheduler aware of each ALU uptime, to keep the warp scheduler keeping it busy.
There's also SP independent power gating and clock boost, so if an SP is only running one thread, it will auto boost to finish the task quicker.
Insane changes TBH, more than I expected.