VirtualLarry
No Lifer
- Aug 25, 2001
- 56,587
- 10,225
- 126
This makes sense, thanks. Good post.970/980 are clocked at about 1200Mhz on average, do the maths.
I stand by the perf/watt number, if perf/watt was intrinsicaly better, that is due to architectural superiority, then it would have the same max throughput for half the power comsumption but there s only 17% difference at peak throughput, what you fail to understand is that its efficency is not at a fixed value, it varies hugely with the GPU loading, it s not like an efficency gained from a node shrink where the power comsumption curve would be simply translated 30% lower, in this case the GPU more or less manage to gate off its non functional parts at very high speed within the flow of datas but if the flow is sustained and is close to max throughput then the GPU has to let all the parts supplied to process the bottlenecked datas in the waiting, comsumption will then reach what it is supposed to be at this node level.
Thg used the word compression as an analogy, what is compressed actualy is the size of the GPU in function of the computation needs, in a game this will translate in gated off unities when the scene is not demanding, and games are rarely, if ever, getting close to max throughput.
This chip adressed a few of the remaining thing to improve in Hawai s pipelines but it cant be considered next gen.
So a full distributed-computing load, would basically revert Maxwell to being quite a bit more inefficient, due to all computational units being utilized, and not being allowed to power-gate them.