Cogman
Lifer
- Sep 19, 2000
- 10,286
- 145
- 106
Actually in a multithreade FPU environment, we would have an advantage.
In AVX we will have 8 256-bit units
In non-AVX we will have 16 128-bit units.
Compared to everything I have seen on the server Sandybridge, they will have 8 256-bit AVX units, so we are generally tied on AVX code, but on non-AVX code they will only have 8 128-bit units, or half the FP capability.
Remember that most apps will not be recompiled to take advantage of AVX right away, so we have an advantage.
Also, unless they have changed their scheduler, they have 1 that covers 2 integer threads and the FPU. We have one for each integer thread plus one for the FPU, so in a multithreaded environement I would bet on Bulldozer.
So wait, I'm probably just looking at this slide wrong. Will this appear as 1 core or 2? If it is 2, than I stand by my claim that highly threaded FP performance might suffer. But if it is 1 then my claim is off the wall and you can kindly ignore me
