Let's not get carried away here. There is no way HBM1 benefits that much. They are still on 28nm node and the architecture is basically Tonga. 15% faster than Titan X is basically 70%+ faster than R9 290X. You are setting this card up for failure imo.
Just because you say there are no architectural improvements does not mean thats the truth. Frankly the amounts of obscene bandwidth combined with the improvements to memory bandwidth efficiency using color compression mean R9 390X would be a hugely imbalanced chip if there were no improvements to the core shader perf (per/sp). Think about it. Even assuming R9 390X has only 512 GB/s bandwidth combined with Tonga color compression which brings a 40% improved memory bandwidth efficiency it would have a >50% increase in bandwidth per sp or compute unit.
http://www.anandtech.com/show/8460/amd-radeon-r9-285-review/3
512 /320 x 1.4 = 2.24 times the effective bandwith of R9 290X. So R9 390X has more than twice the effective bandwidth of R9 290X for just 1.45x (4096/2816 = 1.45) the shaders. So effective bandwidth increase per sp is 2.24/1.45 = 1.54x more bandwidth per sp and per compute unit. Thats 50% more bandwidth per sp. What the heck is AMD gonna do with such an increase in bandwidth when we see that Hawaii is not bandwidth bottlenecked.
So you see its not HBM1 alone which brings the perf improvement. Its the actual micro architectural improvements. If Nvidia can improve perf/cc by 35% do you think its not possible for AMD to improve the perf/sp by 15-20%. Do you think the improvements to the tesselation, ROP and memory bandwidth efficiency (color compression) in Tonga were done without AMD having a design to scale shader perf and efficiency.
btw sweclockers mentioned that there are further microarchitectural improvements. They talked of tiled GCN. So why are you so confident that there are no architectural improvements to increase perf/sp, perf/sq mm and perf/watt.
https://translate.google.com/transl...ande-chip-monolitico-anche-per-amd&edit-text=
"
The third new feature is the micro-architecture. With Fiji design GCN should fully embrace the "tiled architecture" going to review the organization for ALU thread within Compute Units in order to improve workload management."
Let me give you a history of AT/AMD GPU architectures. These GPU architectures have a long life and are extremely versatile and extensible. The fundamental R600 Xenos GPU architecture found in Xbox 360 served ATI/AMD from late 2005 to late 2011. Minor tweaks were done but the underlying architecture was a phenomenally scalable architecture.
http://en.wikipedia.org/wiki/Xenos_(graphics_chip)
"The Xenos is a custom graphics processing unit (GPU) designed by ATI (now taken over by AMD), used in the Xbox 360 video game console developed and produced for Microsoft. Developed under the codename "C1",[1] it is in many ways related to the R520 architecture and therefore very similar to an ATI Radeon X1800 series of PC graphics cards as far as features and performance are concerned.
However, the Xenos introduced new design ideas that were later adopted in the TeraScale microarchitecture, such as the unified shader architecture. The package contains two separate dies, the GPU and an eDRAM, featuring a total of 337 million transistors."
http://en.wikipedia.org/wiki/TeraScale_(microarchitecture)
"
TeraScale is the codename for a family of graphics processing unit microarchitectures developed by ATI Technologies/AMD and their second microarchitecture implementing the unified shader model following Xenos. TeraScale replaced the old fixed-pipeline microarchitectures and competed directly with Nvidia's first unified shader microarchitecture named Tesla.
TeraScale was used in HD 2000 manufactured in 80 nm and 65 nm, HD 3000 manufactured in 65 nm and 55 nm, HD 4000 manufactured in 55 nm and 40 nm, HD 5000 and HD 6000 manufactured in 40 nm. TeraScale was also used in the AMD Accelerated Processing Units code-named "Brazos", "Llano", "Trinity" and "Richland". TeraScale is even found in some of the succeeding graphics cards brands."
That R600 Xenos architecture which launched with Xbox 360 in late 2005 had a life of 6 years in GPUs (8 yrs including APUs like Brazos, Llano,Trinity and Richland) and was replaced by GCN in late 2011. GCN is expected to have an even longer life than R600 .
I remember Raja Koduri saying GCN is the world's most scaleable GPU architecture at the R9 290X launch and I believe he was not exaggerating. GCN was designed to have a very long life and we now know from AMD FAD 2015 presentations that GCN will be around for 2015, 2016 and 2017. The 2016 GCN products might launch in Q3 2016 and they will have a life of atleast 24 months.
Frankly I don't like it when somebody talks about Maxwell as the next thing after sliced bread. Maxwell is impressive in every aspect - perf , perf/watt and perf /sq mm. But to say AMD cannot design a GPU which beats Titan-X comprehensively is just argumentative. Atleast wait for the products to prove what they are capable of before passing off such high handed dismissive comments.
If this card beats R9 290X by 65% at 4K on average at R9 290X's power usage, I'll be floored. 45% faster at $550 is already epic enough considering AMD isn't using a new architecture like NV is with Maxwell. Also, if it retains the double precision performance, even 1/4th or 1/5th of SP, that would be ridiculous. That would essentially mean the extra power usage on top of Titan X alone would be justifiable for the compute performance for their FirePro series. If AMD managed a gaming card faster than the Titan X and it has full DP compute in a smaller die size than 601mm2 (Titan X's), that would the most mind-blowing come back in AMD's history.
I personally think to get higher perf/watt, AMD needs to drop DP to like 1/32. Leave that functionality for FirePro.
This is exactly what I expect is going to happen. A 550 sq mm flagship GPU with 8 GB HBM for Radeon using 4Hi HBM and dual link interposer. 55-65% faster than R9 290X. 1/8 fp64 for Radeon (just like R9 290x) and 1/2 fp64 rate for Firepro (like Firepro W9100) . The Firepro versions will launch in Q4 2016. Those cards are waiting for the availability of 8 Hi HBM stacks which will double the effective memory capacity. Thus a 16 GB HBM Firepro is definitely on the cards. Apple could be a launch customer with the Mac Pro getting a Haswell-E and next gen Firepro with 16 GB HBM.
