chirping HDD and 3ware 9550SX raid...

Cr0nJ0b

Golden Member
Apr 13, 2004
1,141
29
91
meettomy.site
Hey everyone,

I'm trying to figure something out. I've just put a 9550 into my backup server with 10 x 750GB SATA drives. The drives are in pretty good shape, but they aren't brand new. When I put the drives in and started the controller, I started hearing a chirping from the disks. I'm not sure what disk or disks are chirping, but it worries me a bit. I have the same drive types on an Areca controller with no audible chirps. I'm starting to think that one or more of the drives is going bad with head resets or the controller if doing something funky. Right now, I have the drives setup in a 10 drive RAID 0 stripe, but I'm planning to break that rebuild it JBOD and do some disk checks...but I wanted to get some input form the experts as I go.

This is for a backup volume, so it's important, but not critical. Once setup I will turn it on once a week or so for a backup/sync and then just shut it down.

thanks
 

brokendash

Junior Member
Sep 29, 2012
1
0
0
I'm also having this occur, it only seems to be one drive but I'm not 100% certain. It also seems to happen when I initiate a rebuild of one of the sub mirrors, however the tw_cli has nothing that appears to indicate that it's a problem. Not sure why but this drive shows up as "Not In Use" when I reboot my machine and seems to complete the rebuild, but it appears that I have to manually initiate the sub mirror rebuild from the controllers alt-3 boot menu. I can only assume that one of the drives is on it's way out since it appears to be isolated to an individual port on the controller. I had to use this 3ware card initially because I was unable to install Debian Sid on the machines on board sata controller for some reason, but that was about a year ago. I think the issue was resolved but since the 3ware raid set worked I simply left it in the system. This chirping is the only issue I've had until just today when all the sudden my machine started throwing I/O errors from the shell for every command... sysrq wouldn't even work and I had to hard cycle the power and everything came up fine, manually started a rebuild upon boot, and then noticed the rebuild pause then restart but finishing. Anyways hope this info helps...

Also ensuring that your able to dump your backups onto the array is not the best way to make sure it's working. Assuming you work out your hardware issue you should consider performing the entire process. IE: perform your backups, then do an actual restore as practice. But that's just my opinion.... :) Cheers

My Setup:
ASUS M5A78L-MLX
8G Coasiar Value Select
3ware 9500S
4 Seagate 400G SATA

-- show diag --
### Time Stamp: 00:11:30 29-Sep-2012
### Host Name: srv1
### Host Architecture: x86_64 (64 bit)
### OS Version: Linux 3.2.0-3-amd64
### Model: 9500S-4LP
### Serial #: D19004A5360654
### Controller ID: 0
### CLI Version: 2.00.11.020
### API Version: 2.08.00.023
### Driver Version: 2.26.02.014
### Firmware Version: FE9X 2.08.00.009
### BIOS Version: BE9X 2.03.01.052
### Available Memory: 112MB


Rollcall, Begin : find drives, read DCBs
--Port[ 0]-
DIT status: DRV_PRESENT (0xFF)
Model #: ST3400633AS
Serial #: 3PM03XK4
Drv FW #: 3.AAD
Capacity: 781422768 (0x2E9390B0)
Features: SMART: 1, Security: 1, 48-bit addr: 1, Acoustic: 0,
Feat. Ext: TimeLimited R/W: 0, WDMA FUA: 0, Stream: 0
Security: Status=0x1 (disabled, unlocked)
SATA NCQ: 1
Udma Mode: 0x5 (UDMA-100)
Pwr Cycles: 597
--Port[ 1]-
DIT status: DRV_PRESENT (0xFF)
Model #: ST3400832AS
Serial #: 4NF0RWPQ
Drv FW #: 3.06
Capacity: 781422768 (0x2E9390B0)
Features: SMART: 1, Security: 1, 48-bit addr: 1, Acoustic: 0,
Feat. Ext: TimeLimited R/W: 0, WDMA FUA: 0, Stream: 0
Security: Status=0x3 (ENABLED, unlocked)
SATA NCQ: 1
Udma Mode: 0x5 (UDMA-100)
Pwr Cycles: 615
--Port[ 2]-
DIT status: DRV_PRESENT (0xFF)
Model #: ST3400633AS
Serial #: 3PM01C4A
Drv FW #: 3.AAD
Capacity: 781422768 (0x2E9390B0)
Features: SMART: 1, Security: 1, 48-bit addr: 1, Acoustic: 0,
Feat. Ext: TimeLimited R/W: 0, WDMA FUA: 0, Stream: 0
Security: Status=0x3 (ENABLED, unlocked)
SATA NCQ: 1
Udma Mode: 0x5 (UDMA-100)
Pwr Cycles: 617
--Port[ 3]-
DIT status: DRV_PRESENT (0xFF)
Model #: ST3400832AS
Serial #: 4NF0NYW8
Drv FW #: 3.03
Capacity: 781422768 (0x2E9390B0)
Features: SMART: 1, Security: 1, 48-bit addr: 1, Acoustic: 0,
Feat. Ext: TimeLimited R/W: 0, WDMA FUA: 0, Stream: 0
Security: Status=0x3 (ENABLED, unlocked)
SATA NCQ: 1
Udma Mode: 0x5 (UDMA-100)
Pwr Cycles: 623


UNIT: 0, capacity: 1562456064, is ONline
Stripelet Size=64k
Normal RAID0 (80) cap:1562456064 (task: 0%) (Unclean=0)
Normal TWINSTOR (80) cap:781228032 (task: 0%) (Unclean=0)
CBOD (0)[1] cap: 781228032 (lbaOff 0)
CBOD (1)[3] cap: 781228032 (lbaOff 0)
Degrade TWINSTOR (81) cap:781228032 (task: 0%) (Unclean=0)
+> CBOD (2)[0] cap: 781228032 (lbaOff 0)
CBOD (3)[2] cap: 781228032 (lbaOff 0)
Cache Settings: WCE=1 - RCD=1
LUN 0x0 Start LBA 0x0 Size 0x5d213000

Updating cache settings for unit: 0
Synchronizing cache write data...
DcbMgr::UpdateStatus: UNIT 0 (time 144614681)
DcbMgr::WriteSegment(map=0xE, segID=0x8, events=6, error=0x0)
DcbMgr::UpdateStatus: (finish 144614684)
Send AEN (code, time): 0x5e, 09/28/2012 12:02:40
Cache synchronized after power fail
(EC:0x5e, SK=0x00, ASC=0x00, ASCQ=0x00, SEV=03, Type=0x71)
unit=0
D: 0x0
P: 0x0
Send AEN (code, time): 0x1, 09/28/2012 12:02:40
Controller reset occurred
(EC:0x01, SK=0x06, ASC=0x29, ASCQ=0x00, SEV=03, Type=0x71)
resets=30
InitConnect credits: 256
Rollcall, Begin : find drives, read DCBs

Controller reset occurred
(EC:0x01, SK=0x06, ASC=0x29, ASCQ=0x00, SEV=03, Type=0x71)
resets=30
InitConnect credits: 256
Rollcall, Begin : find drives, read DCBs

E=0208 I=00925AFC T=12:02:45 : Not ready error
E=0208 I=00925AFC T=12:02:45 P=2 : Soft reset drive
ata task file read back : st dh ch cl sn sc er
: 80 00 00 00 00 00 00
E=0207 I=00925AFC T=12:02:45 P=2 : Reset failed
E=0208 I=00925AFC T=12:02:45 P=2 : Sata bridge reset
ata task file read back : st dh ch cl sn sc er
: 50 00 00 00 01 01 01
E=0208 I=00925AFC T=12:02:45 P=2 : Unlock drive
E=0208 I=00925AFC T=12:02:45 P=2 : Retrying chain

E=0208 I=00925AD4 T=12:02:45 : Not ready error
E=0208 I=00925AD4 T=12:02:45 P=2 : Soft reset drive
ata task file read back : st dh ch cl sn sc er
: 50 00 00 00 01 01 01
E=0208 I=00925AD4 T=12:02:45 P=2 : Retrying chain
Auto Clean: 0
DcbMgr::UpdateStatus: UNIT 0 (time 144615269)



--dmesg--
[ 461.918207] 3w-9xxx: scsi0: AEN: INFO (0x04:0x000B): Rebuild started:unit=0.
[ 1506.767714] 3w-9xxx: scsi0: AEN: INFO (0x04:0x003B): Rebuild paused:unit=0.
[ 1516.973169] 3w-9xxx: scsi0: AEN: INFO (0x04:0x000B): Rebuild started:unit=0.