• We’re currently investigating an issue related to the forum theme and styling that is impacting page layout and visual formatting. The problem has been identified, and we are actively working on a resolution. There is no impact to user data or functionality, this is strictly a front-end display issue. We’ll post an update once the fix has been deployed. Thanks for your patience while we get this sorted.

Povray on Quad Xeon E5-4624L and Quad Opteron 6344

I'm not quite understanding what this is testing, can anyone break it down or distill it?

Performance scaling based on number of threads and cores. That quad Xeon has a total of 40 cores and 80 hardware threads. Ideally, with 40 threads, there should be a 40x speedup in render times compared with one core. As you can see, the max is only about 24x to 29x. With HT on under Windows, it's even less.
 
Performance scaling based on number of threads and cores. That quad Xeon has a total of 40 cores and 80 hardware threads. Ideally, with 40 threads, there should be a 40x speedup in render times compared with one core. As you can see, the max is only about 24x to 29x. With HT on under Windows, it's even less.

ok and why do you think that is? and why doesnt the piledriver opteron have similar results?
 
ok and why do you think that is? and why doesnt the piledriver opteron have similar results?

1) I don't know, but I would guess memory access congestion due to so many threads.

2) The Opterons do have similar results: it's not 1:1 performance increase at high core counts.
 
They recompiled povray on linux but didn't do so on Windows. I'm not sure how much that would make a difference, but something to keep in mind.
 
They recompiled povray on linux but didn't do so on Windows. I'm not sure how much that would make a difference, but something to keep in mind.

And they use VS10 instead of Intel Composer XE, etc...

----------------------------------------------------------------

The drop off w/HT could be as simple as locking a thread to a logical core, that not how HT is designed to work.

I suppose, in reality, none of that really matters to end users who aren't using povray as part of a rendering system where they have their own developers optimizing the executables.
 
1) I don't know, but I would guess memory access congestion due to so many threads.
Unlikely given that there's a table that shows that under Linux HT brings some small speedups (with 40 cores, 40s, with 40+HT, 32s), and on top of that it's ~20% faster so memory congestion would be even more of an issue.
 
Back
Top