Ok, I need your guys help to convince my local tech-support guy that we have a problem
First, the system...
Tyan S2721GNN - E7501 chipset MB, 533MHz FSB
2x2.6GHz Xeon
4GB PC2100 Registered DDR
3xU160, 73GB, 10K RPM SCSI drives
Mylex RAID controller with 32MB cache
RedHat Linux 8.0
Drives are configured in RAID5 as a single partition
The problem is that the drive performance SUCKS.
Bonnie++ results
Version 1.01d ------Sequential Output------ --Sequential Input- --Random-
-Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks--
Machine Size K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP
localhost.local 14G 4536 16 4495 2 3869 2 23837 74 110626 29 280.0 1
------Sequential Create------ --------Random Create--------
-Create-- --Read--- -Delete-- -Create-- --Read--- -Delete--
files:max:min /sec %CP /sec %CP /sec %CP /sec %CP /sec %CP /sec %CP
localhost.locald 16 2387 86 +++++ +++ +++++ +++ 2604 91 +++++ +++ 6695 100
95 100
Compare to a similar machine, only with 2GB ram and a single 7200RPM IDE drive:
Version 1.03 ------Sequential Output------ --Sequential Input- --Random-
-Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks--
Machine Size K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP
ssao3.aero.org 4G 21846 79 21285 13 7828 3 21298 65 45388 7 114.6 0
------Sequential Create------ --------Random Create--------
-Create-- --Read--- -Delete-- -Create-- --Read--- -Delete--
files /sec %CP /sec %CP /sec %CP /sec %CP /sec %CP /sec %CP
16 2634 96 +++++ +++ +++++ +++ 2742 97 +++++ +++ 7137 99
4G,21846,79,21285,13,7828,3,21298,65,45388,7,114.6,0,16,2634,96,+++++,+++,+++++,+++,2742,97,+++++,+++,7137,99
7137,99
per character & block output (writing to disk) is more then 4x better on the IDE machine, and rewrite is > 2x better!
The last straw came today as I'm trying to move a database onto this machine ... One one of the big tables, the network connection kept failing, so I dumped the table to a file and FTPed it over. The file is about 1GB, and has approx 6 million records and a few indexes. FWIW, this is MySQL we're dealing with.
Now I'm trying to cat the file straight into the database ... it's been going for a few hours now and I have about 60% of the data in the database at this point. Here's what top looks like:
5:45pm up 34 days, 4:10, 4 users, load average: 142.66, 138.09, 126.78
404 processes: 403 sleeping, 1 running, 0 zombie, 0 stopped
CPU0 states: 0.0% user, 0.2% system, 0.0% nice, 99.3% idle
CPU1 states: 0.0% user, 2.1% system, 0.0% nice, 97.3% idle
CPU2 states: 0.0% user, 0.2% system, 0.0% nice, 99.3% idle
CPU3 states: 0.1% user, 0.5% system, 0.0% nice, 98.4% idle
Mem: 3870836K av, 3855188K used, 15648K free, 0K shrd, 137492K buff
Swap: 17864240K av, 15028K used, 17849212K free 3476796K cached
PID USER PRI NI SIZE RSS SHARE STAT %CPU %MEM TIME COMMAND
2051 ******** 16 0 1224 1224 796 R 0.9 0.0 28:40 top
12 root 16 0 0 0 0 DW 0.3 0.0 18:16 bdflush
13 root 15 0 0 0 0 DW 0.3 0.0 0:58 kupdated
24 root 15 0 0 0 0 DW 0.1 0.0 13:02 kjournald
757 root 15 0 3136 728 624 S 0.1 0.0 1:21 ypserv
1 root 15 0 472 436 424 S 0.0 0.0 0:22 init
As you can see, the CPUs are basically idle, but the system load is astronomical, and of course, the performance is dismal.
When I ask one of our support guys about it he says its because of the RAID5 and suggest we do RAID0 and more frequent backups
Or says it might be because we don't have a seperate OS drive (OS is on the RAID partition). I'm not buying it. There is no way this should be that slow. Agreed?? Suggestions??
First, the system...
Tyan S2721GNN - E7501 chipset MB, 533MHz FSB
2x2.6GHz Xeon
4GB PC2100 Registered DDR
3xU160, 73GB, 10K RPM SCSI drives
Mylex RAID controller with 32MB cache
RedHat Linux 8.0
Drives are configured in RAID5 as a single partition
The problem is that the drive performance SUCKS.
Bonnie++ results
Version 1.01d ------Sequential Output------ --Sequential Input- --Random-
-Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks--
Machine Size K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP
localhost.local 14G 4536 16 4495 2 3869 2 23837 74 110626 29 280.0 1
------Sequential Create------ --------Random Create--------
-Create-- --Read--- -Delete-- -Create-- --Read--- -Delete--
files:max:min /sec %CP /sec %CP /sec %CP /sec %CP /sec %CP /sec %CP
localhost.locald 16 2387 86 +++++ +++ +++++ +++ 2604 91 +++++ +++ 6695 100
95 100
Compare to a similar machine, only with 2GB ram and a single 7200RPM IDE drive:
Version 1.03 ------Sequential Output------ --Sequential Input- --Random-
-Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks--
Machine Size K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP
ssao3.aero.org 4G 21846 79 21285 13 7828 3 21298 65 45388 7 114.6 0
------Sequential Create------ --------Random Create--------
-Create-- --Read--- -Delete-- -Create-- --Read--- -Delete--
files /sec %CP /sec %CP /sec %CP /sec %CP /sec %CP /sec %CP
16 2634 96 +++++ +++ +++++ +++ 2742 97 +++++ +++ 7137 99
4G,21846,79,21285,13,7828,3,21298,65,45388,7,114.6,0,16,2634,96,+++++,+++,+++++,+++,2742,97,+++++,+++,7137,99
7137,99
per character & block output (writing to disk) is more then 4x better on the IDE machine, and rewrite is > 2x better!
The last straw came today as I'm trying to move a database onto this machine ... One one of the big tables, the network connection kept failing, so I dumped the table to a file and FTPed it over. The file is about 1GB, and has approx 6 million records and a few indexes. FWIW, this is MySQL we're dealing with.
Now I'm trying to cat the file straight into the database ... it's been going for a few hours now and I have about 60% of the data in the database at this point. Here's what top looks like:
5:45pm up 34 days, 4:10, 4 users, load average: 142.66, 138.09, 126.78
404 processes: 403 sleeping, 1 running, 0 zombie, 0 stopped
CPU0 states: 0.0% user, 0.2% system, 0.0% nice, 99.3% idle
CPU1 states: 0.0% user, 2.1% system, 0.0% nice, 97.3% idle
CPU2 states: 0.0% user, 0.2% system, 0.0% nice, 99.3% idle
CPU3 states: 0.1% user, 0.5% system, 0.0% nice, 98.4% idle
Mem: 3870836K av, 3855188K used, 15648K free, 0K shrd, 137492K buff
Swap: 17864240K av, 15028K used, 17849212K free 3476796K cached
PID USER PRI NI SIZE RSS SHARE STAT %CPU %MEM TIME COMMAND
2051 ******** 16 0 1224 1224 796 R 0.9 0.0 28:40 top
12 root 16 0 0 0 0 DW 0.3 0.0 18:16 bdflush
13 root 15 0 0 0 0 DW 0.3 0.0 0:58 kupdated
24 root 15 0 0 0 0 DW 0.1 0.0 13:02 kjournald
757 root 15 0 3136 728 624 S 0.1 0.0 1:21 ypserv
1 root 15 0 472 436 424 S 0.0 0.0 0:22 init
As you can see, the CPUs are basically idle, but the system load is astronomical, and of course, the performance is dismal.
When I ask one of our support guys about it he says its because of the RAID5 and suggest we do RAID0 and more frequent backups
