I just noticed I have a few COVID19 GPU tasks
WU Distribution Update
We are working towards resuming a consistent WU supply similar to what we had before the storage system failure. The recent sparsity of OPN1 WU was caused by a batch that has blocked the create-work process for all other projects. We have found and fixed the glitch, and the system is busy creating work for OPN1 right now. We still have an ARP1 backlog of unsent results (see ARP project update ), but we now have a spare capacity for a larger backlog. After OPN1 work units are prepared, the system will prepare ARP1 work units.
On the back end, we still had to finalize setup of the new storage as there was a networking issue that was preventing us from accessing the tape archive. Data center admins have helped to fix it, and the production system on the new storage is being backed up.
We continue to investigate the errors in the BOINC system services, specifically assimilators and validators. Unfortunately, the application is written such that an unexpected error halts the service (which happened when our storage system failed). We are attempting to clear out the problematic data to allow the applications to continue processing other results, but BOINC doesn't seem to have an easy method of flushing specific workunits or results out of its system.
If you have any comments or questions, please leave them in this thread for us to answer. Thank you for your support, patience and understanding.
WCG team
Thanks ! I had no idea there were problems, as I have hundreds processing on 6 different boxes.
Yesterday (i.e. Tuesday) evening, my last uploads finally succeeded. Now the wait for the next ARP1 batch commences.80% of the ARP1 results which I made during the past week are still uploading… very, very slowly.