Guess the people who said Pascal was Maxwell on 16FF+ were dead wrong.
Lol is that your take away from the info?
Pascal still is Maxwell+ on 16FF. What's the plus? FP64, FP16 mix-mode, NVLink, HBM2.
This is what NV says about GP100 on that blog post: Tesla P100: Built for HPC and Deep Learning
300W TDP is actually going backwards. AS EXPECTED because they tore out all the power hungry features in Maxwell to make it a gaming focused chip, and now Pascal needs to put those back in to compete in the HPC market.
This is actually the biggest change in terms of gaming performance:
GP100’s SM incorporates 64 single-precision (FP32) CUDA Cores. In contrast, the Maxwell and Kepler SMs had 128 and 192 FP32 CUDA Cores, respectively. The GP100 SM is partitioned into two processing blocks, each having 32 single-precision CUDA Cores, an instruction buffer, a warp scheduler, and two dispatch units. While a GP100 SM has half the total number of CUDA Cores of a Maxwell SM, it maintains the same register file size and supports similar occupancy of warps and thread blocks.
For those who have paid attention to the talks of wavefronts and warp sizes in game engines.
Each SM under the control of the instruction buffer, scheduler and dispatch units now only has 64 CC to process, instead of 128 (Maxwell) and 192 (Kepler).
This is GCN-like and means that a warp/wavefront of 64 used by console optimized engines will instantly hit peak CC utilization.
What this means in real effective terms is much less potential for inefficiency of CC usage if the game engine is poorly optimized for the warp/wavefront. I'm going to call it now, Pascal will perform great in GCN-optimized game engines, better than Maxwell and vastly better than Kepler.
Despite only a small increase in FP32 or potential gaming performance, comparing GM200 ~7TFlops to GP100 ~10.6 TFlops, in effect the change above means each paper spec flop is worth more for gaming due to improved CC utilization (for those under the impression Maxwell was already 100% utilization, lololol).
While some were expecting a double of performance, it's not going to happen given how HPC compute focused GP100 has to be, compared to GM200 which was made for gaming.
IMO, expect a ~60% improvement in gaming,
higher if games are GCN-optimized, which is actually good news because most of the AAA stuff now are already console optimized. We could well see it performing ~80% faster in most AAA games due to the console-effect in full swing. Not too bad for a HPC focused chip!