32 ROPs is the new rumour?
ROPs and bandwidth are very important. You cannot downplay this unless AMD has made very, very large strides over Tonga.
Let me demonstrate how to cripple a card with ROPs and Bandwidth.
Tonga claims to have improved ROPs over earlier GCN as well as memory compression. Despite these GCN 1.2 improvements, the older GCN 1.1 Hawaii seems better than it is relative to the 380X precisely because of ROPs and Bandwidth.
ASUS 380X 1030MHz 32 CU's
290 reference 947MHz 40 CU's
This is only a 15% TFLOP advantage for the 290 over the factory OC 380X.
Yet the 290 is 27% faster:
https://tpucdn.com/reviews/ASUS/R9_380X_Strix/images/perfrel_2560_1440.png
Twice the ROPs and 75% more bandwidth matters. A lot. It really, really matters. This is clear demonstration that it allows those Compute Units the room to work without bottlenecks. Remember, Tonga already has improvements over Hawaii.
So, I post this to ponder if the improvements with Polaris 10 are enough. It has more bandwidth now, and the ROPs are higher clocked, but could these limitations be holding back the potential of AMD's new arch?