PrimeGrid LLR Races Thread 2016

TennesseeTony · Jun 25, 2016

Holy crap Michael nearly doubled his score again! Wow!

Very interesting race indeed!

Well done all, and thanks for the stats tracking Ken and Orange Kid.

GLeeM · Jun 25, 2016

w00t! Good race and thanks for the stats

Kiska · Jun 26, 2016

Great race! Thanks for the stats ken and orange kid.
Now that the race is over I am going to learn llr and try to write an app for gpu

BofRA · Jun 26, 2016

Thanks for the stats!

Great job on the race all

Ken g6 · Jun 26, 2016

Kiska said:
Now that the race is over I am going to learn llr and try to write an app for gpu

I wouldn't write a new LLR app. You might look into making the existing LLR CUDA app work better. Or porting it to OpenCL. There are restrictions on the numbers it can work on that make it unsuitable for any project except maybe 321.

Kiska · Jun 26, 2016

Ken g6 said:
I wouldn't write a new LLR app. You might look into making the existing LLR CUDA app work better. Or porting it to OpenCL. There are restrictions on the numbers it can work on that make it unsuitable for any project except maybe 321.

I am treating this as a new project for me. first year bachelor of computer science. It has to be original no porting allowed i get the remainder of the year to write it. But to write it I need to know the llr proof. I'll have a look at the cpu app though and see what its doing

Ken g6 · Jun 26, 2016

OK, then, I wouldn't do LLR. It's not an easy problem to solve.

As you may know, I wrote a PrimeGrid GPU sieve application. But it only works on large ranges of K's. I've wanted to write an application that would work on individual K's, like sr2sieve, but for a GPU.

Now, here's how a fixed-K sieve works. sr2sieve uses baby-step giant-step to solve the discrete logarithm, which is fast, but takes a lot of memory, which is slow to access on a GPU. I was thinking of using Pollard Rho instead, as it uses little memory. The problem with Pollard Rho is its runtime is unpredictable, but I was thinking it could be divided into chunks, probably of close to the average runtime, and once a P is solved on one process it could be swapped for a new P.

The other problem with sieves is you need to do modular multiplication while almost never computing a modulus, if you want your program to run fast. I used Montgomery multiplication in my sieve, which I mostly understood. Another guy used Barret reduction in his factoring program, which is not a sieve. I never quite understood how to use it, but it seemed to require 128-bit numbers.

Overall, anything in this area seems like it should be a multi-year graduate project, not an undergraduate project. I believe llrCUDA was written by a math professor at a Tokyo university.

waffleironhead · Jun 27, 2016

Ken g6 said:
Dangit, waffleironhead, how do you keep beating me? I'm going to have to turn on another computer or two for the end of the race.

Guess I should have stopped in and checked on things a bit during the race instead of working on my fences. I could have brought another system online and matched you. Oh well, the horses now have 2 more acres of pasture.

This is the closest we have been in output since I started participating in these races. :thumbsup:

Things were pretty toasty here during the race, so I wouldnt be surprised to lose a wu or two. This old athlonx2 errors hard when cpu temps pass 60c and things were hovering close the the limit the whole race.

Kiska · Jun 27, 2016

Ken g6 said:
OK, then, I wouldn't do LLR. It's not an easy problem to solve.

As you may know, I wrote a PrimeGrid GPU sieve application. But it only works on large ranges of K's. I've wanted to write an application that would work on individual K's, like sr2sieve, but for a GPU.

Now, here's how a fixed-K sieve works. sr2sieve uses baby-step giant-step to solve the discrete logarithm, which is fast, but takes a lot of memory, which is slow to access on a GPU. I was thinking of using Pollard Rho instead, as it uses little memory. The problem with Pollard Rho is its runtime is unpredictable, but I was thinking it could be divided into chunks, probably of close to the average runtime, and once a P is solved on one process it could be swapped for a new P.

The other problem with sieves is you need to do modular multiplication while almost never computing a modulus, if you want your program to run fast. I used Montgomery multiplication in my sieve, which I mostly understood. Another guy used Barret reduction in his factoring program, which is not a sieve. I never quite understood how to use it, but it seemed to require 128-bit numbers.

Overall, anything in this area seems like it should be a multi-year graduate project, not an undergraduate project. I believe llrCUDA was written by a math professor at a Tokyo university.

This may seem like a surprise but my math programming professor alternating with the programing prof, has started teaching us Lucas lehmer this term, he doesn't expect us to do llr, but if we do llr then he doesn't care how long it runs for, just that the code is readable. Then if we did decide to do llr, next year we would begin with optimisations. So all that is required is functional and operational code, there is no need for functional and ru the quickest that is for next year

Orange Kid · Aug 1, 2016

Bump for

PrimeGrid's 2016 Challenge Series
Summer Olympics Challenge
Aug 2 21:00:00 to Aug 5 21:00:00 (UTC)

Ken g6 · Aug 1, 2016

Oh! I wasn't paying attention. I thought the race started August 5!

Also, it's not an LLR race.

Ken g6 · Aug 27, 2016

Time for a bump - the next race is an LLR race.

TennesseeTony · Aug 29, 2016

Starts this Friday, 2-7 September 18:00:00 ESP-Sieve LLR Summer Paralympics Challenge 5 days.

And as a note to myself: Tony, that's 2pm EST.

EDIT: Update your signature Ken.

Kiska · Aug 30, 2016

I don't think I'll be participating at all in this race. 195k seconds for one unit Sigh

TennesseeTony · Aug 31, 2016

Oh come on Kiska, you'll finish one round of tasks with 19 hours to spare!

My one test task finished in just under 59k seconds, hyper-threaded, on a i7-5820K. I won't be able to use that system though due to AVX and overclocking. My motherboard is capable of feeding 300W to the CPU socket, and gladly does so on multiple AVX tasks!!

Kiska · Sep 1, 2016

TennesseeTony said:
Oh come on Kiska, you'll finish one round of tasks with 19 hours to spare!

My one test task finished in just under 59k seconds, hyper-threaded, on a i7-5820K. I won't be able to use that system though due to AVX and overclocking. My motherboard is capable of feeding 300W to the CPU socket, and gladly does so on multiple AVX tasks!!

How about you get yelled at by your data center provider for heating up your area to unbearable temperatures?

But all in all, 4 tasks ~195k seconds with 7 its going to be longer and I am doing extensive manual testing for seti, that seti will eat into the times for primegrid.

waffleironhead · Sep 1, 2016

Looks like 52 hours a wu for me.

TennesseeTony · Sep 2, 2016

Only 105 minutes until the start.

I'm getting a lot of LHC tasks right now, they are a priority for me (personal goals), and they are sporadic, so.....I'm debating participating in the PG race... I'm sure I will participate some, but....well, debating.

Edit: I guess for now I've decided to allow one round of tasks for Bee and WUSS.

Ken g6 · Sep 2, 2016

Still having trouble with the forums SSL. I'm not sure I can post stats anywhere near on time. But it looks like I'm doing tasks on my i3 in less than 12 hours each. So I'm expecting decent output.

TennesseeTony · Sep 2, 2016

Skylake i3? Impressive.

lol

I see the new Kaby Lake chips have already completed hundreds of tasks for one user.

TennesseeTony · Sep 2, 2016

Just Kidding!!!

TennesseeTony · Sep 3, 2016

Holy Crapoly. Not even 12 hours into the race yet and not only have many people scored already (Ken being one of them), but one of them already has over 100,000 points (~23 completed tasks)!

Ken g6 · Sep 3, 2016

Day 1 stats, only slightly late:

Rank___Credits____Username
65_____27961______Ken_g6
113____17476______waffleironhead

Rank__Credits____Team
15____64702______Canada
16____53460______Rechenkraft.net
17____45771______PrimeSearchTeam
18____45437______TeAm AnandTech
19____44276______US Navy
20____37562______Alien Prime Cult
21____36379______Special: Off-Topic

TennesseeTony said:
Holy Crapoly. Not even 12 hours into the race yet and not only have many people scored already (Ken being one of them), but one of them already has over 100,000 points (~23 completed tasks)!

It's all about the memory bandwidth.

VirtualLarry · Sep 3, 2016

I've got two tasks "in the oven", 11hr completed, 17hr to go on them. On my Skylake G4400 @ 4.455. (My Skylake i3-6100 is in an ITX box, that isn't plugged in right now. Not sure if the ITX box can take the heat, even if I did plug it in. Maybe I should.)

Ken g6 · Sep 3, 2016

Don't overclock the i3 if you use it. AVX works better when you don't. Edit: Maybe use just one core of you're worried about heat.

PrimeGrid LLR Races Thread 2016

Elite Member

Elite Member

Golden Member

Platinum Member

Programming Moderator, Elite Member

Golden Member

Programming Moderator, Elite Member

Diamond Member

Golden Member

Elite Member

Programming Moderator, Elite Member

Programming Moderator, Elite Member

Elite Member

Golden Member

Elite Member

Golden Member

Diamond Member

Elite Member

Programming Moderator, Elite Member

Elite Member

Elite Member

Elite Member

Programming Moderator, Elite Member

No Lifer

Programming Moderator, Elite Member