Originally posted by: Azn
It's pointless to argue with you when I already gave you an explanation why the 8800gt beats the 8800gts 640mb. Although the 8800gt is bandwidth starved it's able to beat 8800gts 640 mb because it has more SP and TMU.
Again all you?re doing is simply confirming what I?ve been saying all along: that despite having reduced memory bandwidth, the 8800 GT is faster because of improvements to the core, thereby proving memory bandwidth isn?t the primary limiting factor.
Do you understand what bandwidth limitation is? It's when a card can't fully stretch it's performance because of bandwidth. Not when the card has more bandwidth so it should be able to beat the card. That's where you base your entire argument on and have a hard time understanding this concept.
Uh, no. My entire argument comes from the fact that you?re claiming memory is the limitation, yet parts still get faster when it?s reduced. Clearly it isn?t the primary limitation.
It?s like narrowing the neck of a bottle but still observing a higher flow of water. Clearly then the neck wasn?t primarily holding things back.
1680x1050
Crysis 4xAF high settings
8800gts @ 756/1836/1026
34.43 fps
CORE REDUCTION 590/1836/1026
32.13 fps -7.2% difference
BANDWIDTH REDUCTION 756/1836/800
29.72 fps -15.8% difference
So you were saying? I suggest you go pick up a G92 and go study up on it. This card severely bottlenecked by bandwidth.
Where to even start with this?
How about the fact that you used a single game, a single benchmark, and a single card and are trying to claim that sole result is somehow the norm when multiple G92 benchmarks disagree with you?
How about the fact that you never even touched the shader clock?
How about the fact that you didn?t even paint any kind of average across a range of scenarios?
Additionally, I?m pretty sure Chizow posted several tests from his G92 that showed the opposite of yours (i.e. memory showed the lowest performance impact). There were even several G92 overclocking threads that showed the same thing on the whole.
So yeah, your figures may be accurate, but they?re also an outlier. You absolutely cannot infer it?s the norm based on your sole result.
Again you have a hard time understanding what bandwidth limitation means. I tried to explain to you in many occasions and gave you plenty of examples. Not my fault you can't seem to understand it spit out rhetoric. All I can hope is that you try.
Again you seem to have a hard time understanding that if you reduce something but performance goes up, it?s not a limitation.
We are trying to find out if G92 is hindered by bandwidth. After all that is the topic on hand.
Again we know it isn?t because G92 parts beat other parts that have more bandwidth.
We also know it isn?t because G92 parts? relative performance increased despite having their bandwidth reduced.
That and you?ve repeatedly agreed that core improvements to the G92 more than offset the reduction in bandwidth because that?s the explanation you gave us as to why it?s faster.
4850 is sure limited by bandwidth. You move the memory slider up and you get better average and minimum frame rates. Not to mention 4850 core clocks are locked with their SP clocks.
Oh, so you use the term ?limited? to describe anything that holds back performance? In that case every DX10 part is basically limited by everything (core/memory/shader/texturing/CPU) since it?s possible to find a scenario where improving one of these facets improves performance to some degree.
Heck, even overclocking the 2900XT?s VRAM can yield some performance improvement in certain situations. I guess using your terminology I can say the 2900XT is limited by bandwidth since moving the memory slider improves performance?
Using the term like that makes it lose meaning. More accurate usage would be based on the lowest proportionate ratio between the slider increase and the actual improvement. In that case the 4850 is clearly limited by the core, not the memory.
Now what you are describing is what gets more gains from the GPU. SP/core vs memory bandwidth. In this case combination of SP and core wins over bandwidth.
That?s my point, namely that the primary limitation is from the core.
LOL... Again with that same rhetoric. I already posted the answers already multiple times.
The only answers you posted yet again prove the 4850 isn?t primarily limited by bandwidth.
But it was techreport who posted 3dmark theoretical figures and even posted a conclusion why bandwidth limits fillrate. I'm just citing what they posted and giving you examples.
But again these ?examples? have little to no basis in the real world. I?m showing you actual game figures while you post 3DMark synthetic tests as ?proof?. Uh-huh.
Then you should do something about your partner since you can't trust 2900xt is faster than 8800gtx in 3dmark.
Uh, no, I don?t need to ?do? anything. Go assign homework exercises to someone who gives a shit.
If you don't know the reason why 2900xt beats 8800gtx in 3dmark 2k6 what makes you so qualified to do any kind of articles and make conclusions?
Is that supposed to incite some kind of a reaction on my part? Try harder.
To answer OP's question. Yes G92 is hindered by memory bandwidth.
Again multiple independent benchmarks disagree with your sole test. If the card was hindered by memory bandwidth then it couldn?t possibly be showing performance gains by reducing it.
In case of g92 memory bandwidth makes the most difference as shown by Crysis benchmark.
Yes, in your sole benchmark, which is neither complete or the norm.
Look at 4870 minimum frame rates compared to gainward 4850 that's clocked to 4870 core and maxed out memory clocks. You see 25fps on the Gainward @ 1920x1200 while 4870 minimum frame rate is 33fps. that's quite a bit of a jump than 10-15% average frame rate gains. That's 25% better minimum frame rates.
Right, but again I?m not trying to imply bandwidth makes zero difference. Not to mention that the Gainward is sometimes faster than the 4870 despite having less bandwidth, clearly indicating driver or benchmarking noise.
I've never said bandwidth is the primary factor with most dx10 parts. What you did say however is that bandwidth is a non-issue among dx10 parts although it is very much an issue.
From my second post I stated
no it?s not, not when it?s readily demonstrated that SP clocks generally have a bigger impact on performance than memory clocks. Again this is something that I?ve tested repeatedly with several parts.
Again my point is that bandwidth is
not the
primary limiting factor on DX10 parts (including the G92), not that bandwidth makes zero difference.
But based on my first comment in this thread I can see I used a poor choice of words to convey this meaning.