Your argument falls apart when you take into account that the 290 is already usually cheaper than a 970 while offering similar performance. A 960 would have to be priced under 175 and have at least 3GBs of VRAM to be a viable card.
Not sure why RussianSensation is so doom & gloom about AMD. It flies in the face of every benchmark out there.
exactly. RS is in fact paranoid when it comes to AMD's impending doom. He has always underestimated AMD. He was one of the most vocal persons before R9 290X launched saying AMD will not be able to even catch up GTX 780. Guess what AMD beat Titan and has aged more gracefully than GTX 780 Ti. You only need to see performance in the major games released in the last 3 - 6 months to know that.
In fact the way Nvidia has treated its Kepler owners I wouldn't be surprised to see a lot more Nvidia enthusiasts go with R9 390X if the rumours are true about it taking the GPU crown.
look at this post here from RS focussing on die sizes and saying R9 390X at 550 sq mm can only be 30% faster than R9 290X . He forgets to see that there are lot of changes happening here. The architectural changes and other chip efficiency improvement techniques combined with the impact of a state of the art High bandwidth memory system (HBM) on a 2.5D silicon interposer which provides much higher bandwidth (> 60% vs R9 290X) at vastly lower power.
http://forums.anandtech.com/showpost.php?p=37013050&postcount=236
With a slightly modified GCN 1.1 architecture R9 290X was 35% faster than R9 280X with 37.5% more shaders. R9 390X has 45% more shaders and will inherit the massive improvements to tesselation performance, ROP performance and memory bandwidth efficiency from Tonga (GCN 1.2).
GCN 2.0 aka R9 390X will in fact improve the core shader architecture too to improve perf/sp, perf/watt and perf/sq mm. He also fails to see that HBM is a game changer as it moves the vast complexity of the on die memory controller logic to the memory stack saving area, transistors and power. HBM cuts power by 65% for a given bandwidth when compared to GDDR5 (memory controller, PHY and GDDR5 chips). see slide 45 of pdf below. The HBM chip is a 5mKGSD (molded known good stacked die) where the base die is logic and then you have 4 stacked RAM chips.
http://www.microarch.org/micro46/files/keynote1.pdf
In a high end GPU like R9 290X with a 512 bit memory bus the memory subsystem can consume anywhere between 35 - 50% of board power. Here is a study of older GPUs Quadro FX 9800 (based on GTX 280 with a 512 bit GDDR3 memory bus) and HD 6990 (with 2 HD 6970 GPUs each with a 256 bit GDDR5 memory bus) where the memory controller & DRAM chips consume 60+ % and 35+% respectively.
http://www.cse.psu.edu/~juz138/files/islped209-zhao.pdf
AMD has other tricks up its sleeve like Integrated voltage regulator (IVR), integrated passive devices (IPD) and adaptive voltage operation which further cut power. These are mentioned in the GF presentation below and also in AMD's power efficiency feature roadmap.
https://www.youtube.com/watch?v=po29B53bpic
IVR and IPD are the logical companions for HBM to cut memory and board power. Adaptive voltage operation is a a feature in the upcoming Carrizo APU. Since AMD's GPU and APU share a lot of the work on efficiency improvements its logical to see this tech make its way to next gen GPUs. Carrizo is also most likely to feature an IVR to improve power efficiency.
http://images.anandtech.com/doci/8742/Voltage Adaptive.png
http://images.anandtech.com/doci/8742/Carrizo Efficiency.png
Lastly the GF 28SHP process is a significantly better process than TSMC 28HP at which R9 290X was built. The vastly lower leakage power figures for Beema (GF 28SHP) compared to Kabini (TSMC 28HP) confirm the same.
http://www.anandtech.com/show/7974/...hitecture-a10-micro-6700t-performance-preview
"
AMD claims a 19% reduction in core leakage/static current for Puma+ compared to Jaguar at 1.2V, and a 38% reduction for the GPU. The drop in leakage directly contributes to a substantially lower power profile for Beema and Mullins."
Leakage power amounts to
30 - 35% of a chip's TDP.
http://www.realworldtech.com/intel-45nm-hkmg/4/
"To give an example of the benefits that the 45nm process could bring to a microprocessor, we will examine a hypothetical scenario. Tulsa, a 65nm implementation of the Pentium 4 microarchitecture, has plenty of available data from ISSCC and Hot Chips.
Tulsa dissipates 150W at 3.4GHz, with 16MB of shared L3 cache. Of that 150W TDP, roughly 45W is from leakage power, while the remainder is active power dissipation [2]. "
For a 250W GPU a 38% reduction in leakage power (which when assumed as 30% of chip TDP = 250 * 0.3 * 0.38 = 28.5w) means a 28W power reduction. Anyway I will be glad to see him be proven wrong once again as AMD achieves the huge 60 - 65% perf jump of R9 390X using high performance 28nm process (GF 28SHP) with a die size <= 550 sq mm. R9 390X is going to be an awesome performer.