Seti had another unexpected power-outage...

Rattledagger

Elite Member
Feb 5, 2001
2,994
19
81
February 28, 2005
Around 18:00 UTC we had another unexpected lab-wide power outage. Systems were able to shut down more gracefully than last time, but we are leaving many services off as we survey the damage. The cause of these outages is still unknown.
 

Fardringle

Diamond Member
Oct 23, 2000
9,200
765
126
I think its the aliens trying to shut down the system before we discover their soap opera broadcasts streaming across deep space. The licensing and royalty costs from that type of network coverage would be staggering! :evil: :laugh:
 

Specabecca

Member
Nov 30, 2004
44
0
0
It's funny, each time I have difficulty connecting, I think 'maybe today is the day they make us migrate to SETI BOINC!'

You'd think I'd wise up. :eek:
 

Dragonbate

Senior member
Mar 1, 2004
324
0
0
Again an unexpected outage to SETI due to an unknown cause.... Where's Scully when I need her?!
 

kamper

Diamond Member
Mar 18, 2003
5,513
0
0
Originally posted by: Fardringle
I think its the aliens trying to shut down the system before we discover their soap opera broadcasts streaming across deep space. The licensing and royalty costs from that type of network coverage would be staggering! :evil: :laugh:
That's almost exactly what I thought when I read the post (except for the soaps part) :p We must be getting too close!!
 

DaFinn

Diamond Member
Jan 24, 2002
4,725
0
0
Bleh... maybe its time to shutdown SETI alltogether. They can't even keep the servers running, how can one expect them to find INTELLIGENT life from mouter space???
Have they not heard of UPS? I've had my company servers running almost non-stop from 1998. Infact I just sold another of our old and trusty IBM netfinity thats been running 24/7 since then.

:(
 

Rattledagger

Elite Member
Feb 5, 2001
2,994
19
81
February 28, 2005
Around 18:00 UTC we had another unexpected lab-wide power outage. The cause of these random failures is being investigated by campus, but all our systems/databases survived without any corruption. We will be down as we work to further protect ourselves. More info in Technical News.
February 28, 2005 - 22:30 UTC
So we had another unexpected lab-wide power outage again this morning. This time around we had the BOINC database on battery backup so we were able to shut it down safely. After the power returned we brought the database back up briefly to check it out - and it's in perfect health. You can all thank Court for bringing in his personal UPS (and leaving his own systems unprotected) to put on the BOINC database server until we were able to obtain a new one.

But we shut the BOINC database right back down, and will leave most of the BOINC back-end services off for the time being until we have all our important systems on smart UPS (the systems will shut themselves off once they realize they are on battery power). This has always been the future plan (and please note that our previous configuration allowed for zero or minimal loss in the event of a power failure), but now that frequent random outages are part of the scenario, it would make life easier not to have to do damage control every time.

We are actually going to take this time off to do additional maintenance. For example, the disk array holding the upload/download directories is 98% full - Jeff discovered a bug in the file_deleter code that left a lot of old workunits around. So we need to get rid of those stale files before anything else.
 

JWMiddleton

Diamond Member
Aug 10, 2000
5,686
172
106
I have a stack of WUs waiting to be sent as well. I am amazed that we are having these problems as this seems to be a new type of problem. Surely they have had UPS units before.
 

Rattledagger

Elite Member
Feb 5, 2001
2,994
19
81
They have ups in their server-closet, but the new BOINC db-server is currently in their lab till have shuffled things around in the server-closet to make room for it.
 

Unforgiven

Golden Member
May 11, 2001
1,827
0
0
2 suggestions for the seti folks....

1) invest in ups backups and rtfm.

2) go to a windows operating system because obviously they dont know what the hell they are doing with their equipment and need something a little more basic that they can get a handle on.
 

Rattledagger

Elite Member
Feb 5, 2001
2,994
19
81
March 1, 2005
Power Outage Update: Since the cause of the random power outages is still unknown, we are leaving the data server off during the evenings (users will not get workunits/send results at that time). We can handle power outages during the day while we're at the lab, and are working towards a better system to handle outages at night. Meanwhile, campus is trying to diagnose and fix the problem (which effects the entire building - not just us).
 

Polo

Diamond Member
Oct 10, 1999
4,185
0
0
Thanks for the infos RD. :)

175 queued results... and my SETI queue is OK. :)
 

Ken g6

Programming Moderator, Elite Member
Moderator
Dec 11, 1999
16,643
4,581
75
Last updated: 20:05:00 (UTC) on 2005-03-01

Data Server Status

The data server is up and running, but dropping connections
:)
 

Ken g6

Programming Moderator, Elite Member
Moderator
Dec 11, 1999
16,643
4,581
75
Getting data - connecting to server.
Receiving data: 10K
Receiving data: 20K
Receiving data: 30K
Receiving data: 40K
Receiving data: 50K
Receiving data: 60K
Receiving data: 70K
...
:D
 

Ken g6

Programming Moderator, Elite Member
Moderator
Dec 11, 1999
16,643
4,581
75
News

March 1, 2005
Power Outage Update: Since the cause of the random power outages is still unknown, we are leaving the data server off during the evenings (users will not get workunits/send results at that time). We can handle power outages during the day while we're at the lab, and are working towards a better system to handle outages at night. Meanwhile, campus is trying to diagnose and fix the problem (which effects the entire building - not just us).
 

BadThad

Lifer
Feb 22, 2000
12,100
49
91
Originally posted by: Ken_g6
News

March 1, 2005
Power Outage Update: Since the cause of the random power outages is still unknown, we are leaving the data server off during the evenings (users will not get workunits/send results at that time). We can handle power outages during the day while we're at the lab, and are working towards a better system to handle outages at night. Meanwhile, campus is trying to diagnose and fix the problem (which effects the entire building - not just us).

ARRRGGGGGGGG :(

 

Soggysocks

Golden Member
Jun 20, 2001
1,250
0
0
quote:
February 28, 2005 - 22:30 UTC
So we had another unexpected lab-wide power outage again this morning. This time around we had the BOINC database on battery backup so we were able to shut it down safely. After the power returned we brought the database back up briefly to check it out - and it's in perfect health. You can all thank Court for bringing in his personal UPS (and leaving his own systems unprotected) to put on the BOINC database server until we were able to obtain a new one.


If recall......didn't they have a recent shut down for a power upgrade on the building there?


I think I'd find a new Electrical Contractor.