Any volta rumors in this thread or just business talk?![]()
Agree -- thread cleaned; get this discussion back on topic.
-- stahlhart
Any volta rumors in this thread or just business talk?![]()
With SK Hynix confirmed they're entering volume production of GDDR6 ram for a high end graphics card early 2018. 12 nm FinFET is scheduled to hit mass production before the year is over.
Both those together suggests an Q1 2018 launch for Volta unless nvidia sits on it. It could be that TSMC's "new 12nm node" is just a rehash of their 16nm node not necessarily a die shrink. Then again, it could very well be 12nm is taped out early Q1 2018, with decent supplies starting at Q2 2018
They got ~50% from V100 vs P100, so something of that magnitude is likely for the 'normal' cards.
Always good to have multiple sources of information to correlate the data.So does Anand, and it's more detailed:
http://www.anandtech.com/show/11367...v100-gpu-and-tesla-v100-accelerator-announced
Consumer Volta could be taped out on 12 nm FFN by now, and they are working toward an early 2018 release.
They got 50% more TFLOPs. Im wondering if it is possible 1024 CUDA cores on GV107, with this improvement: 1024 CUDA cores, 1.5 GHz = 3 TFLOPs. 50% more than GTX 1050 Ti.They got ~50% from V100 vs P100, so something of that magnitude is likely for the 'normal' cards.
GTX 980 Ti is on average 20% faster than GTX 1060, in new games. Sometimes the gap between both of them is smaller, around 7-10%. It depends on game and settings.GTX 1060 is close to GTX 980. Step down Volta probably is close to GTX 980 as well but not the Ti. GTX 2070 should be really interesting... If it can do the same what GTX 1070 did.
Different situation. That was architectural changes and massive node shrink. At best they can do similar improvement than what they did with Maxwell but I somewhat doubt that they can pull improvement like that again.GTX 980 Ti is on average 20% faster than GTX 1060, in new games. Sometimes the gap between both of them is smaller, around 7-10%. It depends on game and settings.
For example in BF1 GTX 980 Ti is 15% faster than GTX 1060. Let GTX 2050 Ti be 10-15% faster than GTX 1060, maintaining the same performance gap between GTX 960 and GTX 1050 Ti, and you are looking at that performance level.
Maxwell increased the throughput of cores in CUDA architecture, thanks to shift from 192 cores/256 KB Register File size, to 128 core/256 KB Register File Size. Consumer Pascal maintained this layout, hence the no difference in core for core, clock for clock performance.Different situation. That was architectural changes and massive node shrink. At best they can do similar improvement than what they did with Maxwell but I somewhat doubt that they can pull improvement like that again.
If they do manage to get 50% thats amazing. And ill be one of the first in line for one!
Yeah. I don't think anyone should expect that. That is how you end up with Vega level disappointment. People read about new Vega features and start assuming IPC increases of 30%, and then it gets delivered and no IPC gains materialize...
New features these days are more likely to show improvements in some games, but not big across the board increases.
The real across the board increases will likely depend a lot on Core count increases (or clock speed increases), and if using 12nm FFN, there isn't much density increase, so core count increases would depend on making chips bigger. Making them 50% bigger doesn't seem to be in the cards.
So don't expect 50% increases across the board. If that turns up it would be a pleasant surprise, but I certainly wouldn't expect it.
Nvidia have consistently delivered 50% or greater from Kepler-Maxwell-Pascal. I see no reason they cannot do so again. Maxwell chips were also larger than Kepler in addition to higher IPC in order to get that 50%, so they can do the same with Volta.
This is nVidia we are talking about, not RTG. Past performance may not necessarily predict the future... but given their R&D budget and the capability to make a gaming-focused design in addition to their HPC/datacenter designs, Volta is not going to be another Vega.
Here's the thing... I'm not sure if the current Volta whitepaper applies to consumer version. For example NVIDIA will remove tensor cores for sure. Who knows what other things they will cut. For that reason I am a bit conservative with my expectations.Maxwell increased the throughput of cores in CUDA architecture, thanks to shift from 192 cores/256 KB Register File size, to 128 core/256 KB Register File Size. Consumer Pascal maintained this layout, hence the no difference in core for core, clock for clock performance.
Pascal GP100 and Volta GV100 have 64 cores/256 KB Register File Size. So if Nvidia will use this layout, we will see again the same performance increase as we have seen with Maxwell versus Kepler.
However. Volta is much more advanced layout, and will have much better utilization, and IPC than even Pascal using the same core/RFS layout as Volta, because it is not as advanced.
From what Sweeper has touted, GV104 will score somewhere between 16000 and 17000 pts in 3dMark Fire Strike Extreme. This means GV104 will be 65% faster than GP104 has been.
I don't think its possible to pull off this type of improvement without changing the layout of the architecture, on similarly sized process node.
Even reusing GP100 chip architecture will bring 30-40% performance increase at the same clock speeds and core counts. Using Volta - the possibilities are bigger, because of improved cache, separate FP and INT cores, improved scheduling, massively, and the partitioning of each SM into 4 portions which improves load balancing, and scheduling even further.Here's the thing... I'm not sure if the current Volta whitepaper applies to consumer version. For example NVIDIA will remove tensor cores for sure. Who knows what other things they will cut. For that reason I am a bit conservative with my expectations.
Even reusing GP100 chip architecture will bring 30-40% performance increase at the same clock speeds and core counts. Using Volta - the possibilities are bigger, because of improved cache, separate FP and INT cores, improved scheduling, massively, and the partitioning of each SM into 4 portions which improves load balancing, and scheduling even further.
Considering there are rumors about bigger die size of GV104 compared to GP104, there is a chance we will see actual Volta architecture.
And one thing that could confirm this also is the rumor that consumer GPUs have to have Tensor cores for compatibility reasons(writing and testing software would that way had to be done on GV100 chips, if Nvidia would remove tensor cores completely).
I think what we can see from Nvidia is this, as it goes for GDDR VRAM in GPUs:
GV104 - GDDR6, 256 Bit, 16 GB
GV106 - GDDR5X, 256 Bit, 8 GB
GV107 - GDDR5X 192 Bit, 6 GB.
I assumed that increase in bus width is what is required to properly fed GV architecture, and each generation Nvidia increases the amounts of VRAM available in specific price tiers, and also increased the memory bus width.I am quite sure that Volta architecture will power the next gen Geforce stack. I think this is how it could turn out
GV102 - 384 bit GDDR6 at 14-16 Gbps - 672 - 768 GB/s
GV104 - 256 bit GDDR6 at 14-16 Gbps - 448 - 512 GB/s
GV106 - 192 bit GDDR5X at 11 Gbps - 264 GB/s
GV107 - 128 bit GDDR5X at 11 Gbps - 176 GB/s
Nvidia would want to keep the memory bus at the same sizes as Pascal as that would allow them to keep memory I/O power and board costs under control. Thats the reason I do not see Nvidia increase memory bus width for GV106/GV107. Volta is shaping up to be a true powerhouse and could go down as one of the most successful and forward looking GPU architectures ever after the legendary G80.