- Oct 28, 1999
- 62,387
- 8,154
- 126
Wednesday the 12th of this month I get a call from a store 1800 miles away saying that the server that runs the retail equipment wasn't booting. After an hour of troubleshooting on the phone, plus an hour and a half with Dell on the phone I decide that's it's bad, and hop a plane down there. Book at 4:30, fly out at 6:30.
I get down there and end up finding out that the Raid-1 volume is gone and unrecognized by the raid controller. Somehow, the RAID card went on the fritz nuking both drives in the array and tosting the volume. D'oh! So I get the the thing rebuilt, go to apply the backups and find out that the database was writing bad backups the whole time. So...after 42 straight hours of work I manage to get the thing rebuilt, three weeks worth of DB work done in 16 hours, and everything running somewhat smooth.
Fastforward to today. I've been on the road for a week on business and have an accountant here jockey tapes for me. So I come in and take a look at the server and see a big fault light flashing on it and one of the drives in the cage is blinking with a fault too.
Fsck me. I almost started crying. The volume degraded to critical but is in the process of rebuilding itself. I'm crossing my fingers and hoping that all is well. The rebuild should finish up shortly but then I have to do a consistency check on the data to get a final judgement on if I'm totally hosed or not.
It's days like this that make regret hoping into this field
Edit - The server that had the Raid-1 volume die was an 8 month old Dell Poweredge 2600
The one right now that is rebuilding is my Exchange box which is a 1.5 year old Poweredge 2500.
I get down there and end up finding out that the Raid-1 volume is gone and unrecognized by the raid controller. Somehow, the RAID card went on the fritz nuking both drives in the array and tosting the volume. D'oh! So I get the the thing rebuilt, go to apply the backups and find out that the database was writing bad backups the whole time. So...after 42 straight hours of work I manage to get the thing rebuilt, three weeks worth of DB work done in 16 hours, and everything running somewhat smooth.
Fastforward to today. I've been on the road for a week on business and have an accountant here jockey tapes for me. So I come in and take a look at the server and see a big fault light flashing on it and one of the drives in the cage is blinking with a fault too.
Fsck me. I almost started crying. The volume degraded to critical but is in the process of rebuilding itself. I'm crossing my fingers and hoping that all is well. The rebuild should finish up shortly but then I have to do a consistency check on the data to get a final judgement on if I'm totally hosed or not.
It's days like this that make regret hoping into this field
Edit - The server that had the Raid-1 volume die was an 8 month old Dell Poweredge 2600
The one right now that is rebuilding is my Exchange box which is a 1.5 year old Poweredge 2500.