8th Annual BOINC Pentathlon

Discussion in 'Distributed Computing' started by TennesseeTony, Apr 7, 2017.

  1. StefanR5R

    StefanR5R Senior member

    Joined:
    Dec 10, 2016
    Messages:
    498
    Likes Received:
    179
    Maybe that accelerator of theirs isn't as well shielded as it should be?
     
  2. crashtech

    crashtech Diamond Member

    Joined:
    Jan 4, 2013
    Messages:
    6,696
    Likes Received:
    254
    Well I agree that this ought to be a case of user error. Here's a screenshot of my settings:

    [​IMG]

    I did change number of CPUs from "1" to "No limit" today, but nothing else was altered because nothing else was wrong. Both machines are Skylakes, and both had VBox installed. Once Vbox was removed, the problem went away. None of the other machines running LHC have experienced this problem.
     
  3. TennesseeTony

    TennesseeTony Elite Member

    Joined:
    Aug 2, 2003
    Messages:
    1,912
    Likes Received:
    361
    Nooo! All my downloaded Cosmo disappeared with the reporting of the tasks! Got more coming in now, pretty sure the 'old' ones weren't due until the 19th. Odd. I freaked at first before I remembered they were quorum of one. Out of work for a bit though.

    Ah, 20 minutes after GPU shutdown and it's already 5F cooler.
     
  4. crashtech

    crashtech Diamond Member

    Joined:
    Jan 4, 2013
    Messages:
    6,696
    Likes Received:
    254
  5. TennesseeTony

    TennesseeTony Elite Member

    Joined:
    Aug 2, 2003
    Messages:
    1,912
    Likes Received:
    361
    Well that is certainly the plan. Then pass them. :) The hopeful plan at any rate.

    I just understood your joke Stefan, about the accelerator. Lol!
     
  6. uallas5

    uallas5 Senior member

    Joined:
    Jun 3, 2005
    Messages:
    457
    Likes Received:
    33
    I just checked the stats page and we're tied with OcUK at 231 points each! :D
     
  7. Ken g6

    Ken g6 Programming Moderator, Elite Member
    Moderator

    Joined:
    Dec 11, 1999
    Messages:
    12,630
    Likes Received:
    291
    Well, we may not be great at swimming, but it looks like it enabled us to pull off The Italian Job. (One of my favorite movies. ;))

    And since we've beaten them in 3 projects (assuming we keep our lead in the Sprint), a tie is a win for us. :)
     
    TennesseeTony likes this.
  8. TennesseeTony

    TennesseeTony Elite Member

    Joined:
    Aug 2, 2003
    Messages:
    1,912
    Likes Received:
    361
    Thanks to Stefan's incredible bunkering ability, we are able to most likely only worry about LHC for the remainder, while others still have to juggle Cosmo into the mix.
     
  9. StefanR5R

    StefanR5R Senior member

    Joined:
    Dec 10, 2016
    Messages:
    498
    Likes Received:
    179
    @Ken g6, because we are not great at swimming, we are allowed to take the speedboat instead?

    [​IMG]
     
    Ken g6 likes this.
  10. crashtech

    crashtech Diamond Member

    Joined:
    Jan 4, 2013
    Messages:
    6,696
    Likes Received:
    254
    My new 1700X is cranking out LHC for all it's worth! It's tough to say how well anything is doing, is it not? Looks like wild variations between WUs.
     
  11. TennesseeTony

    TennesseeTony Elite Member

    Joined:
    Aug 2, 2003
    Messages:
    1,912
    Likes Received:
    361
    Wild variations in LHC SixTrack? What are your talking about? My tasks are all taking exactly somewhere between 10 seconds and 14 hours. Very consistent. :p ;)

    I mean, really though, 10s, 2m, 6m 10m, 12m, 28m, 32m, 38m, 41m, 48m, 54m, 1h10m, 1h50m, 2h, 3h, 3.5h, 4h, 4h20m, 6h, 7h, 9h, 10h, and on and on. LHC must have the people who designed Einstein's website creating the tasks for them.
     
    iwajabitw, crashtech and Ken g6 like this.
  12. TennesseeTony

    TennesseeTony Elite Member

    Joined:
    Aug 2, 2003
    Messages:
    1,912
    Likes Received:
    361
    Other teams are looking around, baffled. Their MP3 player of choice is just fine. So where is TAAT sound coming from?
     
    iwajabitw and crashtech like this.
  13. crashtech

    crashtech Diamond Member

    Joined:
    Jan 4, 2013
    Messages:
    6,696
    Likes Received:
    254
    I think my new Ryzen 1700X has suddenly made my contributions much more substantial. It's a beast!
     
    TennesseeTony likes this.
  14. StefanR5R

    StefanR5R Senior member

    Joined:
    Dec 10, 2016
    Messages:
    498
    Likes Received:
    179
    One of my boxes is not having a lot success at LHC: It received long stretches of those 10-seconds tasks, and there is also quite a high rate of computational errors. The former means that the queue of "ready to start" tasks goes barely deeper than a few minutes work, or often runs dry. The latter means that there are sometimes dozens of error'ed tasks sitting there to be reported, which somehow causes the boinc client to request new work less frequently. Or rather, the LHC server likely takes note of the error rate and replies with "No tasks are available" more often then not, and the client increases its intervals in which it asks for new work. If not monitored closely, these backoff periods soon become hours, or a day.

    (Remember: "No tasks are available" at LHC does not mean that LHC doesn't have any work ready. They have plenty. It only means that LHC denies that particular client any new work for whatever reasons, for whatever duration.)

    LHC's strategy is to send out alpha-quality WUs, and to limit the ensuing damage by keeping clients' task queues shallower than shallow.
     
    #439 StefanR5R, May 16, 2017
    Last edited: May 16, 2017
  15. Kiska

    Kiska Senior member

    Joined:
    Apr 4, 2012
    Messages:
    391
    Likes Received:
    48
    I am not sure what is causing that since all of mine validate fine: https://pastebin.com/nuLa84bQ
    Its a defence mechanism, against faulty computers. I would recommend looking into the issue. Then if the server sees valid work it will slowly increase work limits
     
  16. StefanR5R

    StefanR5R Senior member

    Joined:
    Dec 10, 2016
    Messages:
    498
    Likes Received:
    179
    Those are likely application bugs, not a host-side issue. The box had been doing fine at the other projects so far.
     
  17. StefanR5R

    StefanR5R Senior member

    Joined:
    Dec 10, 2016
    Messages:
    498
    Likes Received:
    179
    Glad to read that the "That company" pseudonym found some appreciation at [H].
     
  18. Kiska

    Kiska Senior member

    Joined:
    Apr 4, 2012
    Messages:
    391
    Likes Received:
    48
    Because here are my task stats:
    State: All (1230) · In progress (378) · Validation pending (115) · Validation inconclusive (7) · Valid (716) · Invalid (0) · Error (14)

    I would love to look at your task list so I can see any issues, but you have your computers set to hidden...
     
  19. StefanR5R

    StefanR5R Senior member

    Joined:
    Dec 10, 2016
    Messages:
    498
    Likes Received:
    179
    [LHC SixTrack]

    My machine with the high error rate is the only one which sets the estimated remaining time of newly downloaded tasks to something on the order of 10 seconds, even though it had been working at 8-hour units lately. The estimation is obviously still trained badly from the first few downloads which were mostly few-seconds tasks.

    I wonder if the watchdog timer which kills a task when elapsed time exceeds this timer, is influenced by the initial estimated runtime of a WU.

    I deinstalled VirtualBox and rebooted now, although presence of VirtualBox should have no influence on SixTrack. Next I will let the current tasks finish, then detach the machine from LHC, then reattach, and hope that the very first batch of downloaded WUs is of a reasonable sort.

    Erroring machine:
    OpenSuse
    Validation pending (378) · Validation inconclusive (17) · Valid (839) · Invalid (0) · Error (486)
    total credit = 18,000​
    Successful machine with same hardware but different OS:
    Gentoo
    Validation pending (83) · Validation inconclusive (9) · Valid (542) · Invalid (0) · Error (0)
    total credit = 46,000​

    Both machines have been doing LHC/SixTrack for about the same time now; actually the good one started a bit earlier. The large difference of total number of tasks (1720 : 634) shows how the former machine had been abused for many of those few-seconds WUs. The latter machine got its share of short WUs too, but evidently not as many.
     
  20. Orange Kid

    Orange Kid Elite Member

    Joined:
    Oct 9, 1999
    Messages:
    3,120
    Likes Received:
    154
    I have one box that has errors also. All of the bad tasks have 5/16 for a date. both received and returned. I am guessing a bad batch of work got out?
     
  21. Orange Kid

    Orange Kid Elite Member

    Joined:
    Oct 9, 1999
    Messages:
    3,120
    Likes Received:
    154
    all the bad work was..
    SixTrack v451.07 (pni)
    windows_intelx86
     
  22. StefanR5R

    StefanR5R Senior member

    Joined:
    Dec 10, 2016
    Messages:
    498
    Likes Received:
    179
    Almost all errors which I had, i.e. hundreds, were from
    SixTrack v451.07 (pni) i686-pc-linux-gnu
    SixTrack v451.07 (pni) x86_64-pc-linux-gnu​
    Only a few (just 4) were from
    SixTrack v451.07 (sse2) x86_64-pc-linux-gnu​

    Removing and re-adding the project did not help.
    Adding LHC to a different client instance on the same box may have helped. I will know more in a few hours, or tomorrow.
     
  23. TennesseeTony

    TennesseeTony Elite Member

    Joined:
    Aug 2, 2003
    Messages:
    1,912
    Likes Received:
    361
    There you go buddy, a little trophy for you to hang on your wall! (Yoyo)

    [​IMG]
     
    crashtech likes this.
  24. StefanR5R

    StefanR5R Senior member

    Joined:
    Dec 10, 2016
    Messages:
    498
    Likes Received:
    179
  25. Ken g6

    Ken g6 Programming Moderator, Elite Member
    Moderator

    Joined:
    Dec 11, 1999
    Messages:
    12,630
    Likes Received:
    291
    Now getting:

    Tue 16 May 2017 04:23:20 PM MDT | LHC@home | No tasks are available for SixTrack