itsmydamnation
Diamond Member
- Feb 6, 2011
- 3,072
- 3,897
- 136
You are wrong here, games aren't bandwidth limited at all. But lower memory access times will always help performance. higher speed memory at same amount of cycles to access equals lower latency.I didn't miss them, for gaming the 8C version is useful for games with a good scaling over 4C, that's it. Bandwidth is a bigger factor btw. It can bring a nice boost in some games, faster RAM does it for Dualchannel.
I find it odd people think having 10GB of bandwdith per core ( quad core dual channel ddr4 2400) is bandwidth limited. AFAIK each core (>Haswell) support 16 outstanding memory requests. with ~50ns of memory latency that means in the worst case is 128bits (sse) * 16 per 50ns of time per core.
128 *16 * 1,000,000,000 /50 /8 /1024 /1024 = 4883MB/s per core
128=sse width
*16 = outstanding accesses
*1,000,000,000 = move everything to ns time per second
/50 = reduce by memory access time
/8 = convert bits to bytes
/1024 = convert bytes to kilo bytes
/1024 = convert kbytes to megabytes
This doesn't factor in any hazards accessing the cache system or the dram itself, which will happen and make that numbers even lower. But by lowering memory latency over a given period of time you can have more outstanding requests filled and thus better performance.
This is why those memory bandwidth tests are stupid because they are using streaming prefetchers ( moving large amount of contiguous data into L2/L3) which dont do very much for the types of workloads people then think they matter for ( low ILP code).
People in particular would bash steamroller/execavator over their low steam prefetching without understanding what they were actually bashing.
*now i know we move cache lines but the point still stands