MilkyWay@H - Benchmark thread Winter 2016 on (updated 1-2021) - GPU & CPU times wanted for new WUs

Page 3 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

StefanR5R

Elite Member
Dec 10, 2016
5,498
7,786
136
I'm thinking something a little older and hopefully cheap like a Asus P9X79-Deluxe, Intel i7 3930k would get this card really rolling.

The X79/ socket 2011 platform with Sandy Bridge E is still on PCIe 2.0 though. PCIe 3.0 came to socket 2011 with Ivy Bridge E processors. Would it make a difference? From what I understand, there is at worst a negligible small difference in games, but what about computing?
 

iwajabitw

Senior member
Aug 19, 2014
828
138
106
The X79/ socket 2011 platform with Sandy Bridge E is still on PCIe 2.0 though. PCIe 3.0 came to socket 2011 with Ivy Bridge E processors. Would it make a difference? From what I understand, there is at worst a negligible small difference in games, but what about computing?

The 280x will never see a game, it will be for MW only, just looking at the 6 cores of the 3930 to run multiple gpu tasks with a cpu core for each. May check and see what AMD has to offer as well. Want to keep the cost down.
 

TennesseeTony

Elite Member
Aug 2, 2003
4,209
3,634
136
www.google.com
In the old days (sometime last year), Milkyway required almost NO bandwidth, you could use 1x adapters and not lose any performance. Not sure with these new work units though. Something I could test out (eventually).
 

salvorhardin

Senior member
Jan 30, 2003
389
35
91
On a stock i7-3770K with hyperthreading disabled and all cpus free:
Stock 7950 (860/1250) 56.48s with 11.45s cpu time (20 workunits averaged)
Stock R9 390 (1015/1500) 60.72s with 13.14 cpu time (17 workunits averaged)
 

salvorhardin

Senior member
Jan 30, 2003
389
35
91
I could test GTX 1070 and 1080 (which too have weak FP64), but they are still sitting together in a single PC, and I have not yet researched whether it is possible to peg boinc to a single card. And there are a few CPUs to test, but I'd rather postpone this to after January 13.

To ignore specific cards in boinc you'll need to edit/create a cc_config.xml file in ProgramData/Boinc folder. I attached what should work with nvidia cards, with N being the device number. Just exit boinc and edit/create the file and then start boinc. When you go into you event log in boinc you'll see at the top that one of your cards is ignored while the other will remain active. On my main computer my 390 is considered device 0 and my 7950 is device 1.

Code:
<cc_config>
  <options>
    <use_all_gpus>1</use_all_gpus>
    <ignore_nvidia_dev>N</ignore_nvidia_dev>
  </options>
</cc_config>
 

Assimilator1

Elite Member
Nov 4, 1999
24,120
507
126
Thanks for your times guys :), will add them shortly.

My HD 7970 plays Elite Dangerous Horizons just fine :), although I only have a small screen atm (1680x1050).
 

Assimilator1

Elite Member
Nov 4, 1999
24,120
507
126
This is a Core2Duo E6550 on a Asus mobo MX1333, 32bit install of Vista with 2gb memory, MSI R9 280x Gpu. Preferences set to 50% of CPUs', app_config set for 1 cpu, one gpu, so no cpu tasks running.


1930098169 1409312601 5 Jan 2017, 2:48:14 UTC 5 Jan 2017, 4:04:00 UTC Completed and validated 53.77 25.30 133.66 MilkyWay@Home v1.43 (opencl_ati_101)
1930096392 1409316120 5 Jan 2017, 2:45:00 UTC 5 Jan 2017, 4:00:47 UTC Completed and validated 55.62 25.27 133.66 MilkyWay@Home v1.43 (opencl_ati_101)
1930096201 1409108710 5 Jan 2017, 2:45:00 UTC 5 Jan 2017, 4:00:47 UTC Completed and validated 56.64 25.27 133.66 MilkyWay@Home v1.43 (opencl_ati_101)
1930094208 1409158476 5 Jan 2017, 2:41:46 UTC 5 Jan 2017, 3:55:59 UTC Completed and validated 55.68 26.58 133.66 MilkyWay@Home v1.43 (opencl_ati_101)
1930092637 1409315922 5 Jan 2017, 2:40:14 UTC 5 Jan 2017, 3:55:59 UTC Completed and validated 54.55 25.26 133.66 MilkyWay@Home v1.43 (opencl_ati_101)
1930091251 1409257506 5 Jan 2017, 2:37:00 UTC 5 Jan 2017, 3:52:44 UTC Completed and validated 54.55 25.71 133.66 MilkyWay@Home v1.43 (opencl_ati_101)
1930088218 1409322521 5 Jan 2017, 2:33:40 UTC 5 Jan 2017, 3:49:30 UTC Completed and validated 54.05 24.74 133.66 MilkyWay@Home v1.43 (opencl_ati_101)
1930089251 1409311776 5 Jan 2017, 2:33:40 UTC 5 Jan 2017, 3:49:30 UTC Completed and validated 54.54 25.71 133.66 MilkyWay@Home v1.43 (opencl_ati_101)
1930088326 1409267115 5 Jan 2017, 2:32:03 UTC 5 Jan 2017, 3:47:53 UTC Completed and validated 54.04 24.49 133.66 MilkyWay@Home v1.43 (opencl_ati_101)
1930088156 1409273599 5 Jan 2017, 2:32:03 UTC 5 Jan 2017, 3:47:53 UTC Completed and validated 54.54 24.80 133.66 MilkyWay@Home v1.43 (opencl_ati_101)
1930087477 1408857327 5 Jan 2017, 2:30:26 UTC 5 Jan 2017, 3:46:16 UTC Completed and validated 54.04 24.37 133.66 MilkyWay@Home v1.43 (opencl_ati_101)
1930086377 1409311441 5 Jan 2017, 2:28:49 UTC 5 Jan 2017, 3:44:39 UTC Completed and validated 54.58 24.91 133.66 MilkyWay@Home v1.43 (opencl_ati_101)
1930084450 1409296092 5 Jan 2017, 2:25:36 UTC 5 Jan 2017, 3:41:25 UTC Completed and validated 54.54 24.85 133.66 MilkyWay@Home v1.43 (opencl_ati_101)
1930084238 1409286076 5 Jan 2017, 2:25:36 UTC 5 Jan 2017, 3:41:25 UTC Completed and validated 54.04 24.15 133.66 MilkyWay@Home v1.43 (opencl_ati_101)
1930083864 1409317211 5 Jan 2017, 2:23:58 UTC 5 Jan 2017, 3:39:48 UTC Completed and validated 54.58 25.10 133.66 MilkyWay@Home v1.43 (opencl_ati_101)
1930083173 1409311121 5 Jan 2017, 2:22:21 UTC 5 Jan 2017, 3:38:11 UTC Completed and validated 54.56 25.18 133.66 MilkyWay@Home v1.43 (opencl_ati_101)
1930082322 1409218596 5 Jan 2017, 2:20:45 UTC 5 Jan 2017, 3:36:34 UTC Completed and validated 54.04 24.45 133.66 MilkyWay@Home v1.43 (opencl_ati_101)
1930082384 1409303133 5 Jan 2017, 2:20:45 UTC 5 Jan 2017, 3:36:34 UTC Completed and validated 54.54 24.84 133.66 MilkyWay@Home v1.43 (opencl_ati_101)
1930081665 1409314567 5 Jan 2017, 2:19:09 UTC 5 Jan 2017, 3:34:57 UTC Completed and validated 54.56 25.30 133.66 MilkyWay@Home v1.43 (opencl_ati_101)
1930081529 1409285857 5 Jan 2017, 2:19:09 UTC 5 Jan 2017, 3:34:57 UTC Completed and validated 54.04 24.21 133.66 MilkyWay@Home v1.43 (opencl_ati_101)
Seeing as you didn't give me an average, I'm taking a visual average of 54.3s (I ignored the last 4 results, as they seem to be outliers).
Thanks for your time :)

Is the GPU & CPU at stock clocks?
 

iwajabitw

Senior member
Aug 19, 2014
828
138
106
Sorry man, been swamped. Everything stock and I haven't checked the times at all since the first day, so I'll go with what ever you want to rate it at.
 

crashtech

Lifer
Jan 4, 2013
10,523
2,111
146
Apoogies if this has already been covered. I'd like to submit a result for my 5820K + R9 Fury, but even though I have tried to follow instructions regarding restriction of CPU use, the GPU task persists in displaying ".798 CPUs + 1 AMD/ATI GPU," which seems to indicate that a whole CPU core isn't actually being utilized. It's running other CPU only Milkyway@Home tasks concurrently as well, I don't know if this is a problem or how to stop it while allowing only GPU tasks. I hope I haven't missed anything ridiculously simple. The specific setting which I am using is in Options/Computing Preferences/Computing/Usage Limits: Use at most 83% of the CPUs, Use at most 100% of the CPU time. Further restriction down to 66% (which should leave 2 cores free) also has no effect, the ".798" stays the same.
 

StefanR5R

Elite Member
Dec 10, 2016
5,498
7,786
136
I think this "xyz CPUs + 1 ABC GPU" label on a work unit is only an estimation of how much CPU utilization that GPU WU is going to take. Actual usage could differ more or less.

If you set the menu items "Activity"/ "Suspend" (third entry in this menu) and "Activity"/ "Use GPU always" (fourth entry) together, would this stop all CPU WUs but run the GPU WU? (Can't check myself at the moment because I don't have a suitable project selected right now.)
 

iwajabitw

Senior member
Aug 19, 2014
828
138
106
@crashtech the .798 is going to stay the same unless you change your app_config to something like this.

<app_config>
<app>
<name>milkyway</name>
<max_concurrent>0</max_concurrent>
<gpu_versions>
<gpu_usage>1</gpu_usage>
<cpu_usage>1</cpu_usage>
</gpu_versions>
</app>
</app_config>

This will allow 1 CPU & GPU to run as the benchmark asks. This is a working app_config so feel free to use it, and then adjust later as needed to run more than one gpu task with .5/.25/.75 etc in the <gpu usage>.

With the 6-core 5820, if hyper threading is on, with settings at 100% of the cpu's You should see 11 CPU only tasks and 1 GPU task running, because the config file grabbed a core.
On my 5820, with 2 GTX 980's I have it set at 50% CPU usage, app_config <gpu usage>.5</gpu usage> each card doing 2 tasks. On my task screen I see 4 GPU tasks running and 2 CPU only tasks running. Leaving 6 hyperthreaded cores free for doing normal things on the system. Hope that helps.
 

crashtech

Lifer
Jan 4, 2013
10,523
2,111
146
@crashtech the .798 is going to stay the same unless you change your app_config to something like this.

<app_config>
<app>
<name>milkyway</name>
<max_concurrent>0</max_concurrent>
<gpu_versions>
<gpu_usage>1</gpu_usage>
<cpu_usage>1</cpu_usage>
</gpu_versions>
</app>
</app_config>

This will allow 1 CPU & GPU to run as the benchmark asks. This is a working app_config so feel free to use it, and then adjust later as needed to run more than one gpu task with .5/.25/.75 etc in the <gpu usage>.

With the 6-core 5820, if hyper threading is on, with settings at 100% of the cpu's You should see 11 CPU only tasks and 1 GPU task running, because the config file grabbed a core.
On my 5820, with 2 GTX 980's I have it set at 50% CPU usage, app_config <gpu usage>.5</gpu usage> each card doing 2 tasks. On my task screen I see 4 GPU tasks running and 2 CPU only tasks running. Leaving 6 hyperthreaded cores free for doing normal things on the system. Hope that helps.
Yes, thank you, that's exactly what I needed! I see from actually reading a bit of the manual (fancy that) that this is to be an xml file placed in the specific project subdirectory that you wish to modify, but I don't see one. There are many xml files in the %\ProgramData\BOINC directory, though. That seems like where I would drop it. Is there a good reference resource for this aside from the manual?
 

iwajabitw

Senior member
Aug 19, 2014
828
138
106
I learned about the app_config files over at the Seti forums, they are pretty active, a long time ago, but did have to ask at the Milkyway forums about what settings were good to start with. Mostly trial and error on my part figuring out how much to squeeze out more Wu from the cards and not crash the system.
 

iwajabitw

Senior member
Aug 19, 2014
828
138
106
If you don't know how to make the file just open notepad, copy and paste that one of mine in there. Then save as app_config.xml into the milkyway directory in the post above. Restart boinc, should say 1 CPU & 1 GPU now.
 

crashtech

Lifer
Jan 4, 2013
10,523
2,111
146
@iwajabitw, thanks for helping me sort this out. Now I have a result for the OP that seems consistent, if not very exciting. From the data presented, it seems as if I might be better off with the R9 290 that is laying in my spare parts drawer? Wow.

133_66.png


I get 65.9 seconds out of that.

5820K @ 4.4GHz, 16GB DDR4@2666Mhz, R9 Fury @ 1050MHz (stock).
 

iwajabitw

Senior member
Aug 19, 2014
828
138
106
Cool, glad you got it going. I have a R9 280 I recently got and it's in a old vista system with a Core2Duo. I can only squeeze 4 tasks at a time out of it and it blows the 2 980's away on this project. With that 5820 you can probably crunch 3-5 times that at once.
 

crashtech

Lifer
Jan 4, 2013
10,523
2,111
146
Cool, glad you got it going. I have a R9 280 I recently got and it's in a old vista system with a Core2Duo. I can only squeeze 4 tasks at a time out of it and it blows the 2 980's away on this project. With that 5820 you can probably crunch 3-5 times that at once.
With your last sentence you seem to be saying that I am leaving a lot on the table. I can devote 100% of this machine to M@H most of the time, do you have any suggestions? I could put my spare 290 in a vacant slot or swap it out for the Fury. Would they work in tandem? I don't game nearly as much these days, and what I do play worked on the 290 pretty well.

OP, please let me know if these digressions are unwanted. I'm not sure if my config questions merit a new thread or not...
 

iwajabitw

Senior member
Aug 19, 2014
828
138
106
Yeah I do believe they will both work on the same machine, not in Crossfire/SLI like you would for games since there a different series Fury/290 but if the drivers are the same series, I do believe so. Granted I use Nvidia much more than AMD, so I hope I am telling you right. You could possibly be able to put out 500,000 to a million ppd a day.

Check out this thread at MW's forums. It will give you a lot of info.

https://milkyway.cs.rpi.edu/milkyway/forum_thread.php?id=3118&sort_style=6&start=0
 

StefanR5R

Elite Member
Dec 10, 2016
5,498
7,786
136
Does the CPU application use AVX(-256/-128), or doesn't it?

milkyway_1.40_x86_64-pc-linux-gnu running on all threads of 2x E5-2690 v4 (Broadwell-EP, 2x 14C/28T) on Supermicro X10DAX runs at 2.9 GHz, which is the all-core AVX turbo. Yet,

milkyway_1.42_windows_x86_64.exe running on all threads of i7-6950X (Broadwell-E, 10C/20T) on Asus X99-A runs at the fixed non-AVX clock which I configured in the BIOS, not at the AVX clock for which I specified a negative clock offset. I just checked to be sure: Prime95 reliably pulls this CPU down to the AVX clock.

Is this a Linux <> Windows thing? Or Xeon <> i7-EE?

Edit:
Core temperatures of the Xeons: 53...58 °C, of the i7: 60...69 °C. In contrast, WCG OpenZika caused about 10 degrees higher temperatures on the Xeons and on the i7 but ran the Xeons at their non-AVX all-core turbo.

Edit 2:
Another striking difference is the runtime of the tasks:
E5-2690 v4, HT, 2.9 GHz: 3892 s
(app: MilkyWay@Home v1.40, host: 715668)
i7-6950X, HT, 4.0 GHz: 4099 s
(app MilkyWay@Home v1.42, host: 715667)

(proper report together with other CPUs for inclusion in the OP to follow later)
 
Last edited:

crashtech

Lifer
Jan 4, 2013
10,523
2,111
146
I've brought an R9 290 into production using an old low-end HP workstation. I haven't benchmarked it following the OP's rules, but it's churning out two at a time in about 106 seconds, which would mean 1 is occurring every 53 seconds. If Assimilator1 is interested in the performance of this machine, I will bench it by the rules on request.
 

waffleironhead

Diamond Member
Aug 10, 2005
6,919
429
136
Fired up the laptop for a few units.
M275x at stock: average over five WU= 389.9 seconds.
EDIT: being fed by a I5 4510u. This CPU throttles down to 1.9ghz from its supposed base of 2.1? due to the shared heatsink being overloaded i think.
 
  • Like
Reactions: TennesseeTony

StefanR5R

Elite Member
Dec 10, 2016
5,498
7,786
136
Here are some measurements of uninteresting hardware: CPUs, and of GPUs with weak double precision. Well, at least this documents that gear like this should not be considered for MW@H.

General notes:
  • All values are averages of 20 validated WUs.
  • All samples were taken in steady state, i.e. some time after ramp-up and some time before ramp-down of the work. Linux boxes stood at the login screen; Windows boxes at an empty idle desktop. No screensaver, no svchost.exe fuzzing around.
  • For GPU measurements, one WU at a time was run. No CPU work was run in parallel with GPU work.
  • For CPU measurements, as many WUs as available hardware threads were run. "HT off" means hyperthreading disabled in the BIOS.
  • As noted before, the Linux CPU application v1.40 appears to be much more optimized than the Windows CPU application v1.42: My Broadwell-EP/ Linux was faster per core than my higher-clocked Broadwell-E/ Windows. Unfortunately I have no dual-boot machine ready for direct testing of both applications on identical hardware.
  • COV = σ / µ

CPU times on Linux (MilkyWay@Home v1.40)

Xeon E5-2690 v4 @2.9 GHz (HT off: 14 threads for CPU) .......... 2300s (COV = 0.005)
Xeon E5-2690 v4 @2.9 GHz (HT on: 28 threads for CPU) .......... 3892s (COV = 0.015)
Phenom II X4 905e @2.5 GHz (4 threads for CPU) ...................... 5559s (COV = 0.001)
Core 2 T7600 @2.333 GHz (2 threads for CPU) ........................... 5507s (COV = 0.012)

Notes:
  • Xeon E5 was used in a 2P motherboard, both CPUs loaded with MW@H simultaneously. So, twice as many threads per box as noted above per CPU.
  • Xeon E5 ran at its all-core AVX turbo clock.
  • The T7600 ran a 32 bit OS and application, the others 64 bit OS and application.

CPU times on Windows (MilkyWay@Home v1.42)

i7-6950X @4.0 GHz (HT off: 10 threads for CPU) ........................ 2390s (COV = 0.007)
i7-6950X @4.0 GHz (HT on: 20 threads for CPU) ........................ 4099s (COV = 0.007)
i7-4960X @4.5 GHz (HT off: 6 threads for CPU) .......................... 2461s (COV = 0.003)
i7-4960X @4.5 GHz (HT on: 12 threads for CPU) ........................ 4130s (COV = 0.003)
i7-4900MQ @3.36 GHz (HT off: 4 threads for CPU) .................... 3164s (COV = 0.005)
i7-4900MQ @3.20 GHz (HT on: 8 threads for CPU) .................... 5780s (COV = 0.005)

Notes:
  • i7-4900MQ clock was determined by the laptop BIOS, driving the CPU at its thermal limit.
  • OS: Windows 7 64 bit

GPU times (MilkyWay@Home v1.43)

GTX 1080 @2000 MHz (CPU, i7-6950X @4.0 GHz) ............................................ 116s (COV = 0.005)
GTX 1080 @1607 MHz (CPU, i7-6950X @4.0 GHz) ............................................ 128s (COV = 0.004)
GTX 1070 @2025 MHz (CPU, i7-6950X @4.0 GHz) ............................................ 137s (COV = 0.003)
FirePro W7000 @950 MHz (CPU, i7-4960X @4.5 GHz) ..................................... 179s (COV = 0.0001)

Notes:
  • all GPUs at stock memory clocks
  • FirePro at stock GPU clock
  • GTX 1070 factory-OC'd and with boost clock, GTX 1080 at stock and manually OC'd
  • cards were slightly under-utilized by one WU per GPU:
    • GTX 1080 @2000 MHz: 83 % typical, 72 % average GPU load
    • GTX 1080 @1607 MHz: 92 % typical, 82 % average GPU load
    • GTX 1070 @2025 MHz: 90 % typical, 81 % average GPU load
    • W7000: 99 % typical, 95 % average GPU load
  • OS: Windows 7 64 bit, drivers NVIDIA 373.06 and AMD 1.4.1848
 
  • Like
Reactions: petrusbroder

waffleironhead

Diamond Member
Aug 10, 2005
6,919
429
136
I just put a rx460 in my sons minecraft system. The intel gpu was mostly fine, but had some strange pop in going on. Picked up the fastest amd single slot card I could find. anyway...
EDIT: Being fed by an I5-4460 @ 3.2 Ghz
It is a factory OC RX460 at 1244 mhz .
Average over 5 WU is 240.5 seconds.
Not surprising considering it is 1/16 DP ratio for only 122Gflops.

https://www.amazon.com/gp/product/B01K1JVO86/ref=oh_aui_detailpage_o00_s00?ie=UTF8&psc=1
 
Last edited: