Power efficiency is achieved by the manufacturing process itself. 2.5x more performance per watt translates directly to Samsung and GoFlo's claim of 60% power reduction.
Source:
http://www.samsung.com/semiconductor/foundry/process-technology/14nm/
So three things stand out,
1. 50% chip reduction means that a Fiji die would be 298mm2 instead of 596mm2 under the 28nm node.
2. Clock speeds are around 50% improved under 14LPP. So a 1,050 MHz Fiji die could be clocked to 1,575 MHz.
3. So if you take a 50% die reduction at 60% power reduction with 50% improved clocks you get 2.5x performance per watt without any architectural changes to Fiji. (50+50+60= 160% or 2.6x).
So the "new" components are performance oriented tweaks rather than power reduction tweaks.
Basically,
Baffin XT could have these specs:
3,200 SIMDs clocked at 1.35Ghz in 50 CUs
64 ROps clocked at 1.35Ghz
200 Texture Units clocked at 1.35Ghz
4 New Polaris Geometry "Processors" (GCN had Units)
4,096-bit Memory Interface on an improved controller
4GB HBM
232mm2 die
And easily outperform both a Fury-X or a GTX 980 Ti. How?
- 3,200 SIMDs at 1.35Ghz = 8.64 TFlops (theoretically the same as Fiji but with improved shader efficiency)
- 64 ROps at 1.35Ghz = 86.4 GPixels (vs 67 for Fiji)
- 200 TMUs at 1.35Ghz = 270 GTexels (same as Fiji)
- Improved Geometry Culling (Conservative Rasterization requires this a.k.a "Primitive Discard Acceleration")
- Better memory throughput from the new controller and memory compression
=
Better than Fury-X performance at less than half the power consumption.
But Polaris won't have these features?
Yes it will.
What is instruction pre-fetch? Fetching instructions from memory and placing them into cache. Basically as stated by the AMD engineer in the video linked above...
Since DX12 isn't single threaded then he's evidently referring to DX11 performance. Where would you place a Command Buffer? In the Command Processor of course. Hence "new".
And if you look here:
http://wccftech.com/amd-radeon-r9-400-gpus/
You see that an AMD Engineer worked on a 232mm2 die, that shipping manifests are showing a BaffinXT with HBM and the price ball parks BaffinXT as being an R9 390x replacement.
This isn't Greenland, this is BaffinXT. Greenland XT could therefore be a Titan-X class ($999) GPU.
Baffin/Baffin XT are likely both based on Polaris 11.
Greenland and Greenland XT are likely both based on Vega 11.
This leaves Polaris 10 and Vega 10 open to other SKUs.
I think that AMD will be targeting Pascal as such..
Baffin vs GTX 970 successor $349
Baffin XT vs GTX 980 successor $499
Greenland vs GTX 980 Ti successor $650
Greenland XT vs Titan-X successor $999