So, we had a routine 'power failure' drill at work. The main idea was to test the effects of a mains power failure, and ensure that all critical systems were correctly supported by the backup generators.
All well and good, but a few things struck me as being a bit odd:
Drill was scheduled to start at 0900. Shortly after 0800, instructions went round to cease all computer work, save all data and shut down. By 0830, the core servers started being taken down and powered off prior to the test. Instructions were given not to use any computer until the test was complete and the all clear was given.
All normal work was cancelled, and only 'emergencies' were to be handled using contingency systems (pen and paper). Any appointemnts and meetings were cancelled.
Elevators were all grounded and switched off (20 storey tower blocks FTW)
Once all preparatory work was completed, the generator was started and power switched over. This kept some lights and receptacles in 'critical' areas running. (but not IT systems or elevators)
After 2 hours, the test was stopped after being hailed a resounding success, and IT systems started being restored and elevators were switched back on.
So now we all know we're ready for anything. Yay!
--
Still, better than last time. They tried this a few years ago - same thing. Except that time, the domain servers and other critical servers didn't come back up (apparently some sort of hardware failure - whether it was related to the generator test, or just bad luck when 24/7 critical servers were power cycled). However, that was on a Friday. It took until Monday before the new hardware could be installed. Boy, working that weekend was a real PITA - everything had to be done manually or by phone to departments that had working IT systems.
All well and good, but a few things struck me as being a bit odd:
Drill was scheduled to start at 0900. Shortly after 0800, instructions went round to cease all computer work, save all data and shut down. By 0830, the core servers started being taken down and powered off prior to the test. Instructions were given not to use any computer until the test was complete and the all clear was given.
All normal work was cancelled, and only 'emergencies' were to be handled using contingency systems (pen and paper). Any appointemnts and meetings were cancelled.
Elevators were all grounded and switched off (20 storey tower blocks FTW)
Once all preparatory work was completed, the generator was started and power switched over. This kept some lights and receptacles in 'critical' areas running. (but not IT systems or elevators)
After 2 hours, the test was stopped after being hailed a resounding success, and IT systems started being restored and elevators were switched back on.
So now we all know we're ready for anything. Yay!
--
Still, better than last time. They tried this a few years ago - same thing. Except that time, the domain servers and other critical servers didn't come back up (apparently some sort of hardware failure - whether it was related to the generator test, or just bad luck when 24/7 critical servers were power cycled). However, that was on a Friday. It took until Monday before the new hardware could be installed. Boy, working that weekend was a real PITA - everything had to be done manually or by phone to departments that had working IT systems.