And which compiler did you use? GCC? :hmm:
gcc
Some results for Povray 3.6.1 (which is single thread only) on my Athlon II x4, 2.8 GHz
gcc 4.4.5, -march=barcelona, -ffast-math -unroll-loops
Parse Time: 0 hours 0 minutes 1 seconds (1 seconds)
Photon Time: 0 hours 0 minutes 16 seconds (16 seconds)
Render Time: 0 hours 13 minutes 23 seconds (803 seconds)
Total Time: 0 hours 13 minutes 40 seconds (820 seconds)
Official binary
Parse Time: 0 hours 0 minutes 1 seconds (1 seconds)
Photon Time: 0 hours 0 minutes 28 seconds (28 seconds)
Render Time: 0 hours 23 minutes 3 seconds (1383 seconds)
Total Time: 0 hours 23 minutes 32 seconds (1412 seconds)
And just to show it's instruction scheduling and not the x87's fault:
gcc 4.4.5, -march=barcelona -ffast-math -unroll-loops -mfpmath=387
Parse Time: 0 hours 0 minutes 1 seconds (1 seconds)
Photon Time: 0 hours 0 minutes 21 seconds (21 seconds)
Render Time: 0 hours 15 minutes 32 seconds (932 seconds)
Total Time: 0 hours 15 minutes 54 seconds (954 seconds)
Here's comparison with icc 11.1:
icc 11.1, -march=core2
Parse Time: 0 hours 0 minutes 1 seconds (1 seconds)
Photon Time: 0 hours 0 minutes 18 seconds (18 seconds)
Render Time: 0 hours 13 minutes 26 seconds (806 seconds)
Total Time: 0 hours 13 minutes 45 seconds (825 seconds)