phaxmohdem
Golden Member
I've always been curious as to what makes nVidia able to get away with using lower numbers of "Stream Processors" on their cards for performance parity with an ATI card touting 2-3 times as many SP's. (or conversely, why ATI must use so many SP's for performance parity) I've scoured various forums but I haven't seen much more than semi-informed fanboi drivel most places.
Nearest I can figure is that an nVidia SP must be able to do roughly twice the work of an ATI SP somehow and operate at a higher clock than the rest of the GPU:
For example purposes, lets take two roughly equivalent cards, a GTX 275 vs. a Radeon 4890.
GTX 275
----------------------
240 SP
GPU/SP Clocks = 633 / 1404 MHz respectively
Radeon 4890
----------------------
800 SP
GPU Clock = 850 MHz
Now simply multiplying the shader count by the clock speed yields the following results:
GTX 275: (240x1404) = 336,960
HD 4890: (800x850) = 680,000
This admittedly oversimplified example shows that ceteris-paribus the Radeon core should be able to do almost exactly twice the work of the GTX 275... which I take to mean that nVidia's SP's can somehow do twice the work of ATI's processors.
Does anyone have any low level explanation for why this is the case?
Nearest I can figure is that an nVidia SP must be able to do roughly twice the work of an ATI SP somehow and operate at a higher clock than the rest of the GPU:
For example purposes, lets take two roughly equivalent cards, a GTX 275 vs. a Radeon 4890.
GTX 275
----------------------
240 SP
GPU/SP Clocks = 633 / 1404 MHz respectively
Radeon 4890
----------------------
800 SP
GPU Clock = 850 MHz
Now simply multiplying the shader count by the clock speed yields the following results:
GTX 275: (240x1404) = 336,960
HD 4890: (800x850) = 680,000
This admittedly oversimplified example shows that ceteris-paribus the Radeon core should be able to do almost exactly twice the work of the GTX 275... which I take to mean that nVidia's SP's can somehow do twice the work of ATI's processors.
Does anyone have any low level explanation for why this is the case?