Thanks Stefan !The LLR2 application makes use of AVX-512. The following projects are built on LLR2:
The PRST application, which is similar to LLR2 and is run at Private GFN Server's subproject of the same name, supports AVX-512 too.
- All of the PrimeGrid LLR subprojects,
- all SRBase subprojects except the GPU project "TF",
- the LLR2 testing subproject at Private GFN Server.
As does the genefer22 application, which is used by
All of these projects are concerned with finding primes, or with proving/ disproving conjectures connected with primes. For now I haven't heard of other active Distributed Computing projects whose applications benefit from AVX-512.
- PrimeGrid GFN-16…GFN-22, if they are run on CPUs instead of a GPU.
Perhaps Folding@Home's CPU-only FAHCore_a8 uses AVX-512, perhaps not. The older FAHCore_a7 most likely does not. Both are based on GROMACS which offers AVX-512 support, but the GROMACS builds in the F@H cores might not have it. If you enable a F@H CPU slot on an AVX-512 capable computer (probably with at most 64 logical CPUs to suit FAHCore_a8's limitations, IIRC), the client log will probably show you which SIMD flavor is being used.
Asteroids@home: New AVX512 application released We are very proud to announce our new set of optimized applications that will utilize AVX512 instruction set capable engines or to be precise those, which support AVX512dq instructions! These applications are built to support both Linux and Windows 64bit architecture OS. The development of this version was possible thanks to the great help provided by ahorek's team ! Unfortunately it turns out that BOINC client applications for Windows still do not report all processor options to the server correctly. It is because of a known bug and even after a lot of discussions in BOINC's channels it's still there. The good news is that thanks to ahorek's team a bugfix was already accepted and merged into the BOINC's repository and the fix will be applied when client version 7.26.0 is released. Till then in order to run the AVX512 application you might need to switch to the Anonymous platform. We'd like to remind you that while the Boinc server is capable of finding the best performing application for every particular system taking into account multiple factors, after a while it will start sending the right one for every particular system. Which means that even if your CPU supports AVX512dq instructions it still might receive FMA or AVX tasks and there is nothing to be concerned about. In such a case you might want to give a try to the so-called Anonymous platform where your client will explicitly request the AVX512 application. Happy crunching and thank you for your support! Asteroids@home's team More info here - https://asteroidsathome.net/boinc/forum_thread.php?id=988 |
The Genefer application for CPUs has got a command line switch which toggles between different instruction sets:Other posters in the CPU subforum have linked to it before, but anyway:
mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > Zen4's AVX512 Teardown
An analysis by the author of y-cruncher. One of the points to take home: Although theoretical peak throughput of Zen3/Zen4 AVX-256 and Zen4 AVX-512 is the same clock-for-clock, moving an application to AVX-512 on Zen4 can reduce bottlenecks of the CPU's frontend ( = utilize the execution units better), and also reduce energy spent in the CPU's frontend ( = spend respectively more of the overall power budget in the execution units and elsewhere).
-x <implementation> set a specific implementation (i32, sse2, sse4, avx, fma, 512)
genefer -n 20 -b 2615062
. This workunit gets 34,066.53 credit.tasks x threads, affinity | avg. task duration | tasks/day | points/day | power | efficiency
--------------------------+-----------------------+-----------+------------+-------+------------
8x8, ascending | 5:54:07 = 21247 s | 32.5 | 1,108,218 | 475 W | 2,330 PPD/W
tasks x threads, affinity | avg. task duration | tasks/day | points/day | power | efficiency
--------------------------+-----------------------+-----------+------------+-------+------------
8x8, ascending | 5:09:43 = 18583 s | 37.1 | 1,267,104 | 474 W | 2,670 PPD/W
(https://github.com/patnashev/llr2)On May 1 Pavel Atnashev said:DEPRECATED: The source code is no longer maintained and no further releases are planned. Replaced by PRST utility.
<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
PRST version 9.0.766, GWnum library version 30.12
Using all-complex AVX-512 FFT length 1152K, Pass1=128, Pass2=9K, clm=4, 4 threads.
Fermat probabilistic test of 62698*5^4906582+1, a = 3, complexity = 11525498.
Gerbicz-Li check enabled, L2 = 319*279.
Saving 128 proof points.
Testing complete.
62698*5^4906582+1 compressed 128 points to 7 products, time: 13.6 s.
Done.
15:14:25 (12780): called boinc_finish(0)
</stderr_txt>
]]>
<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
BOINC PrimeGrid wrapper 2.02 (Nov 23 2020 23:35:38)
running ../../projects/www.primegrid.com/llr2_1.3.0_win64_220821.exe -v
LLR2 Program - Version 1.3.0, using Gwnum Library Version 30.9
running ../../projects/www.primegrid.com/llr2_1.3.0_win64_220821.exe -oGerbicz=1 -oProofName=proof -oProofCount=128 -oProductName=prod -oPietrzak=1 -oCachePoints=0 -pSavePoints -q3*2^22008906-1 -d -t4 -oDiskWriteTime=1
Gerbicz check is requested, switching to PRP.
Starting probable prime test of 3*2^22008906-1
Using AVX-512 FFT length 1200K, Pass1=640, Pass2=1920, clm=2, 4 threads, a = 3, L2 = 539*319, M = 171941
Compressed 128 points to 7 products. Time : 41.962 sec.
Testing complete.
15:03:43 (11464): called boinc_finish(0)
</stderr_txt>
]]>