Wrong.Register file size is same per SM as pascal GP100 or maxwell.
Yes it is. But the amount of cores that have access to this "pool" of data is lower in GP100 chip than any Maxwell/Consumer Pascal GPU. Similar situation is with Kepler vs Maxwell.
Kepler 192 cores/256 KB RF Size.
Maxwell - 128 Cores/256 KB RF Size.
GP10X - 128 cores/256 KB RF Size.
GP100 chip - 64 cores/256 KB RF Size.
GV100 - 64 cores/256 KB RF Size.
That is why you get increase in performance in Nvidia GPUs. The cores are "less starved" for resources with each generation.
I have to say. Right now I am a bit staggered. I have looked in the wrong part of the diagram, after all.
There may be no difference in FP32 performance in GV100 compared to GP100 chip, clock for clock, core for core. It has the same 256 KB available to the same 64 cores as are in GP100.
It will be actually interesting to observe the performance of GV100 chip. It appears that there was a point why Nvidia demoed today only DL and nothing else. No word on gaming, FP32 improvement, nothing else. It appears that GV100 only architectural improvements may be in DL and in scheduling, but not overall throughput of the GPUs.