I'm at a loss, what keeps crashing my system?

Red Squirrel

No Lifer
May 24, 2003
70,166
13,573
126
www.anyf.ca
Ok long story short, a month or so ago we got a power outage, the UPS kicked in properly and eventually shut down the system properly. The outage lasted several hours. When the power came back I turned everything back on.

As I'm loading my VMs, I lost a drive... then lost another, the raid 5 was now broken. After screwing around with it for a while I managed to get it back online, but I did lose one drive, then it stayed that way.

I ordered new backplanes, and all new drives, and changed all the cables. This seemed to have fixed the issues for about a month. Now today, it started crashing again, with tons of these errors:

Code:
INFO: task kjournald:1202 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
kjournald     D ffff88021a845040     0  1202      2
 ffff8802188a5ca0 0000000000000046 0000000000000086 ffff8802195e8000
 ffffffff8162a500 ffffffff8162a500 ffff88021956adc0 ffff8801001b2dc0
 ffff88021956b108 00000003a0286a3c ffff88021aad96c8 ffff88021956b108
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0de1>] __wait_on_bit+0x47/0x79
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0e7d>] out_of_line_wait_on_bit+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff8113cb81>] ? submit_bio+0xe0/0xe9
 [<ffffffff810e150a>] __wait_on_buffer+0x36/0x3a
 [<ffffffffa0024deb>] wait_on_buffer+0x41/0x45 [jbd]
 [<ffffffffa002540e>] journal_commit_transaction+0x55d/0xf2f [jbd]
 [<ffffffff81049a0d>] ? try_to_del_timer_sync+0x58/0x63
 [<ffffffffa0028ad8>] kjournald+0xe3/0x23a [jbd]
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffffa00289f5>] ? kjournald+0x0/0x23a [jbd]
 [<ffffffff810534bf>] kthread+0x49/0x76
 [<ffffffff81011719>] child_rip+0xa/0x11
 [<ffffffff81010a37>] ? restore_args+0x0/0x30
 [<ffffffff81053476>] ? kthread+0x0/0x76
 [<ffffffff8101170f>] ? child_rip+0x0/0x11

INFO: task VirtualBox:25932 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
VirtualBox    D 0000000000000002     0 25932  25560
 ffff8801d9e51928 0000000000000086 0000000000000096 ffff88021b4505d0
 ffffffff8162a500 ffffffff8162a500 ffff8800c2a20000 ffff88021b9fadc0
 ffff8800c2a20348 00000000a0286a3c ffff88021aad96c8 ffff8800c2a20348
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0de1>] __wait_on_bit+0x47/0x79
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0e7d>] out_of_line_wait_on_bit+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff8113cb81>] ? submit_bio+0xe0/0xe9
 [<ffffffff810e150a>] __wait_on_buffer+0x36/0x3a
 [<ffffffff810e154f>] wait_on_buffer+0x41/0x45
 [<ffffffff810e1b53>] sync_dirty_buffer+0x58/0xa7
 [<ffffffffa0023eec>] journal_dirty_data+0xed/0x1cb [jbd]
 [<ffffffffa00375d4>] ext3_journal_dirty_data+0x18/0x40 [ext3]
 [<ffffffffa003557b>] walk_page_buffers+0x6e/0x96 [ext3]
 [<ffffffffa00375bc>] ? ext3_journal_dirty_data+0x0/0x40 [ext3]
 [<ffffffffa0036f75>] ext3_ordered_write_end+0x7d/0x127 [ext3]
 [<ffffffff8108e167>] generic_file_buffered_write+0x1b8/0x643
 [<ffffffff810d623b>] ? mnt_drop_write+0x82/0x143
 [<ffffffff810d453d>] ? mnt_want_write+0x77/0x8d
 [<ffffffff8108e9e7>] __generic_file_aio_write_nolock+0x25e/0x292
 [<ffffffff8108f233>] generic_file_aio_write+0x67/0xc3
 [<ffffffffa003443f>] ext3_file_write+0x1e/0x9f [ext3]
 [<ffffffff810beb88>] do_sync_write+0xe7/0x12d
 [<ffffffffa03d88a2>] ? RTMemFree+0x1e/0x20 [vboxdrv]
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff812c20fa>] ? _spin_lock+0x9/0xc
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff81120e84>] ? security_file_permission+0x11/0x13
 [<ffffffff810bf444>] vfs_write+0xab/0x105
 [<ffffffff810bf562>] sys_write+0x47/0x6f
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task VirtualBox:25970 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
VirtualBox    D 0000000000000002     0 25970   3154
 ffff8801d6ed9948 0000000000000086 0000000000000092 ffff8802195e8000
 ffffffff8162a500 ffffffff8162a500 ffff8801f8c75b80 ffff88021b9fadc0
 ffff8801f8c75ec8 00000000a0286a3c ffff88021aad96c8 ffff8801f8c75ec8
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0cdc>] __wait_on_bit_lock+0x45/0x8c
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0d8d>] out_of_line_wait_on_bit_lock+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff810e1a7c>] __lock_buffer+0x25/0x27
 [<ffffffffa002361f>] lock_buffer+0x35/0x3a [jbd]
 [<ffffffffa0024149>] do_get_write_access+0x72/0x404 [jbd]
 [<ffffffffa0027cfb>] ? journal_add_journal_head+0xc7/0x14e [jbd]
 [<ffffffffa00244fd>] journal_get_write_access+0x22/0x33 [jbd]
 [<ffffffffa0042806>] __ext3_journal_get_write_access+0x1f/0x48 [ext3]
 [<ffffffffa0036391>] ext3_reserve_inode_write+0x3f/0x76 [ext3]
 [<ffffffffa0036403>] ext3_mark_inode_dirty+0x3b/0x58 [ext3]
 [<ffffffffa0036564>] ext3_dirty_inode+0x6c/0x83 [ext3]
 [<ffffffff810dc8f7>] __mark_inode_dirty+0x33/0x190
 [<ffffffff810d13e0>] file_update_time+0xbe/0x102
 [<ffffffff8108e8f1>] __generic_file_aio_write_nolock+0x168/0x292
 [<ffffffff8108f233>] generic_file_aio_write+0x67/0xc3
 [<ffffffffa003443f>] ext3_file_write+0x1e/0x9f [ext3]
 [<ffffffff810beb88>] do_sync_write+0xe7/0x12d
 [<ffffffffa03d88a2>] ? RTMemFree+0x1e/0x20 [vboxdrv]
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff81120e84>] ? security_file_permission+0x11/0x13
 [<ffffffff810bf444>] vfs_write+0xab/0x105
 [<ffffffff810bf562>] sys_write+0x47/0x6f
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task VirtualBox:25990 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
VirtualBox    D ffff880183ffd6c0     0 25990   3145
 ffff880179ba1948 0000000000000086 0000000000000092 ffff8802195e8000
 ffffffff8162a500 ffffffff8162a500 ffff88021151adc0 ffff8801001b2dc0
 ffff88021151b108 00000003a0286a3c ffff88021aad96c8 ffff88021151b108
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0cdc>] __wait_on_bit_lock+0x45/0x8c
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0d8d>] out_of_line_wait_on_bit_lock+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff810e1a7c>] __lock_buffer+0x25/0x27
 [<ffffffffa002361f>] lock_buffer+0x35/0x3a [jbd]
 [<ffffffffa0024149>] do_get_write_access+0x72/0x404 [jbd]
 [<ffffffffa0027cfb>] ? journal_add_journal_head+0xc7/0x14e [jbd]
 [<ffffffffa00244fd>] journal_get_write_access+0x22/0x33 [jbd]
 [<ffffffffa0042806>] __ext3_journal_get_write_access+0x1f/0x48 [ext3]
 [<ffffffffa0036391>] ext3_reserve_inode_write+0x3f/0x76 [ext3]
 [<ffffffffa0036403>] ext3_mark_inode_dirty+0x3b/0x58 [ext3]
 [<ffffffffa0036564>] ext3_dirty_inode+0x6c/0x83 [ext3]
 [<ffffffff810dc8f7>] __mark_inode_dirty+0x33/0x190
 [<ffffffff810d13e0>] file_update_time+0xbe/0x102
 [<ffffffff8108e8f1>] __generic_file_aio_write_nolock+0x168/0x292
 [<ffffffff8108f233>] generic_file_aio_write+0x67/0xc3
 [<ffffffffa003443f>] ext3_file_write+0x1e/0x9f [ext3]
 [<ffffffff810beb88>] do_sync_write+0xe7/0x12d
 [<ffffffffa03d88a2>] ? RTMemFree+0x1e/0x20 [vboxdrv]
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff81120e84>] ? security_file_permission+0x11/0x13
 [<ffffffff810bf444>] vfs_write+0xab/0x105
 [<ffffffff810bf562>] sys_write+0x47/0x6f
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task VirtualBox:20126 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
VirtualBox    D 0000000000000002     0 20126   3145
 ffff880090a2b9a8 0000000000000082 0000000000000092 ffff8802195e8000
 ffffffff8162a500 ffffffff8162a500 ffff8801d6075b80 ffff88021b9fadc0
 ffff8801d6075ec8 00000000a0286a3c ffff88021aad96c8 ffff8801d6075ec8
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0cdc>] __wait_on_bit_lock+0x45/0x8c
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0d8d>] out_of_line_wait_on_bit_lock+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff810e1a7c>] __lock_buffer+0x25/0x27
 [<ffffffffa002361f>] lock_buffer+0x35/0x3a [jbd]
 [<ffffffffa0024149>] do_get_write_access+0x72/0x404 [jbd]
 [<ffffffffa0027cfb>] ? journal_add_journal_head+0xc7/0x14e [jbd]
 [<ffffffffa00244fd>] journal_get_write_access+0x22/0x33 [jbd]
 [<ffffffffa0042806>] __ext3_journal_get_write_access+0x1f/0x48 [ext3]
 [<ffffffffa0036391>] ext3_reserve_inode_write+0x3f/0x76 [ext3]
 [<ffffffffa0036403>] ext3_mark_inode_dirty+0x3b/0x58 [ext3]
 [<ffffffffa0036564>] ext3_dirty_inode+0x6c/0x83 [ext3]
 [<ffffffff810dc8f7>] __mark_inode_dirty+0x33/0x190
 [<ffffffff810d1536>] touch_atime+0x112/0x11d
 [<ffffffff8108ef55>] generic_file_aio_read+0x53a/0x595
 [<ffffffff810becb5>] do_sync_read+0xe7/0x12d
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff8103c847>] ? finish_task_switch+0x31/0xc9
 [<ffffffff812c0812>] ? thread_return+0xab/0xd9
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff81120e84>] ? security_file_permission+0x11/0x13
 [<ffffffff810bf632>] vfs_read+0xa8/0x102
 [<ffffffff810bf750>] sys_read+0x47/0x6e
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task rsync:11446 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
rsync         D ffff8801ef08d380     0 11446  11438
 ffff880111159948 0000000000000082 0000000000000092 ffff8802195e8000
 ffffffff8162a500 ffffffff8162a500 ffff880201ec16e0 ffff8801006c96e0
 ffff880201ec1a28 00000003a0286a3c ffff88021aad96c8 ffff880201ec1a28
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0cdc>] __wait_on_bit_lock+0x45/0x8c
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0d8d>] out_of_line_wait_on_bit_lock+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff810e1a7c>] __lock_buffer+0x25/0x27
 [<ffffffffa002361f>] lock_buffer+0x35/0x3a [jbd]
 [<ffffffffa0024149>] do_get_write_access+0x72/0x404 [jbd]
 [<ffffffffa0027cfb>] ? journal_add_journal_head+0xc7/0x14e [jbd]
 [<ffffffffa00244fd>] journal_get_write_access+0x22/0x33 [jbd]
 [<ffffffffa0042806>] __ext3_journal_get_write_access+0x1f/0x48 [ext3]
 [<ffffffffa0036391>] ext3_reserve_inode_write+0x3f/0x76 [ext3]
 [<ffffffffa0036403>] ext3_mark_inode_dirty+0x3b/0x58 [ext3]
 [<ffffffffa0036564>] ext3_dirty_inode+0x6c/0x83 [ext3]
 [<ffffffff810dc8f7>] __mark_inode_dirty+0x33/0x190
 [<ffffffff810d13e0>] file_update_time+0xbe/0x102
 [<ffffffff8108e8f1>] __generic_file_aio_write_nolock+0x168/0x292
 [<ffffffff812342ca>] ? __sock_recvmsg+0x6d/0x7a
 [<ffffffff8108f233>] generic_file_aio_write+0x67/0xc3
 [<ffffffffa003443f>] ext3_file_write+0x1e/0x9f [ext3]
 [<ffffffff810beb88>] do_sync_write+0xe7/0x12d
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff8103c847>] ? finish_task_switch+0x31/0xc9
 [<ffffffff812c20fa>] ? _spin_lock+0x9/0xc
 [<ffffffff81120e84>] ? security_file_permission+0x11/0x13
 [<ffffffff810bf444>] vfs_write+0xab/0x105
 [<ffffffff810bf562>] sys_write+0x47/0x6f
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task smbd:26021 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
smbd          D 0000000000000002     0 26021   2498
 ffff88017e9a5bb8 0000000000000086 ffff88021a8c3a00 0000000000000000
 ffffffff8162a500 ffffffff8162a500 ffff88019eceadc0 ffff88021b9fadc0
 ffff88019eceb108 000000022801e568 ffff88017e9a5b88 ffff88019eceb108
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff8108d285>] sync_page+0x51/0x58
 [<ffffffff812c0de1>] __wait_on_bit+0x47/0x79
 [<ffffffff8108d234>] ? sync_page+0x0/0x58
 [<ffffffff8108d4f1>] wait_on_page_bit+0x6e/0x75
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff810965d2>] wait_on_page_writeback+0x2a/0x2e
 [<ffffffff81096e3c>] truncate_inode_pages_range+0x2e1/0x361
 [<ffffffff8109e06e>] ? unmap_mapping_range+0x21c/0x22c
 [<ffffffff810b7ac7>] ? virt_to_head_page+0x31/0x41
 [<ffffffff81096ec9>] truncate_inode_pages+0xd/0x10
 [<ffffffff8109e1f3>] vmtruncate+0x96/0xe4
 [<ffffffff810d29c8>] inode_setattr+0x2b/0x125
 [<ffffffffa003670b>] ext3_setattr+0x190/0x1f7 [ext3]
 [<ffffffff810d2c58>] notify_change+0x196/0x2ee
 [<ffffffff810be03d>] do_truncate+0x63/0x81
 [<ffffffff810be13c>] sys_ftruncate+0xe1/0xfe
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task smbd:26023 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
smbd          D ffff8801c5ab5a00     0 26023   2498
 ffff8801d28c1af8 0000000000000086 0000000000000092 ffff8802195e8000
 ffffffff8162a500 ffffffff8162a500 ffff8801232516e0 ffff8800617444a0
 ffff880123251a28 00000002a0286a3c ffff88021aad96c8 ffff880123251a28
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0cdc>] __wait_on_bit_lock+0x45/0x8c
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0d8d>] out_of_line_wait_on_bit_lock+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff81031dd9>] ? __wake_up_common+0x46/0x76
 [<ffffffff810e1a7c>] __lock_buffer+0x25/0x27
 [<ffffffffa002361f>] lock_buffer+0x35/0x3a [jbd]
 [<ffffffffa00236bb>] journal_invalidatepage+0x97/0x2a6 [jbd]
 [<ffffffffa003736f>] ext3_invalidatepage+0x3c/0x3e [ext3]
 [<ffffffff8109656d>] do_invalidatepage+0x20/0x22
 [<ffffffff81096b31>] truncate_complete_page+0x2a/0x54
 [<ffffffff81096c4f>] truncate_inode_pages_range+0xf4/0x361
 [<ffffffff8109e06e>] ? unmap_mapping_range+0x21c/0x22c
 [<ffffffff810b7ac7>] ? virt_to_head_page+0x31/0x41
 [<ffffffff81096ec9>] truncate_inode_pages+0xd/0x10
 [<ffffffff8109e1f3>] vmtruncate+0x96/0xe4
 [<ffffffff810d29c8>] inode_setattr+0x2b/0x125
 [<ffffffffa003670b>] ext3_setattr+0x190/0x1f7 [ext3]
 [<ffffffff810d2c58>] notify_change+0x196/0x2ee
 [<ffffffff810be03d>] do_truncate+0x63/0x81
 [<ffffffff810be13c>] sys_ftruncate+0xe1/0xfe
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task spamd:8628 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
spamd         D ffff88005c966700     0  8628   8626
 ffff880101fad6d8 0000000000000086 0000000000000092 ffff8802195e8000
 ffffffff8162a500 ffffffff8162a500 ffff88010060db80 ffff8800012396e0
 ffff88010060dec8 00000000a0286a3c ffff88021aad96c8 ffff88010060dec8
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0de1>] __wait_on_bit+0x47/0x79
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0e7d>] out_of_line_wait_on_bit+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff8113cb81>] ? submit_bio+0xe0/0xe9
 [<ffffffff810e150a>] __wait_on_buffer+0x36/0x3a
 [<ffffffff810e154f>] wait_on_buffer+0x41/0x45
 [<ffffffff810e34fc>] __bread+0x5e/0x7e
 [<ffffffffa0036911>] ext3_get_branch+0x76/0xea [ext3]
 [<ffffffffa00376ca>] ext3_get_blocks_handle+0x9d/0x858 [ext3]
 [<ffffffff810935af>] ? __alloc_pages_internal+0xfe/0x457
 [<ffffffff810b1323>] ? alloc_page_vma+0xc1/0xc6
 [<ffffffff81099ca5>] ? __inc_zone_page_state+0x25/0x27
 [<ffffffff810a6dfd>] ? page_add_new_anon_rmap+0x4c/0x4e
 [<ffffffff8109ec50>] ? handle_mm_fault+0x856/0x890
 [<ffffffff810448a3>] ? raise_softirq+0x41/0x58
 [<ffffffffa0037f43>] ext3_get_block+0xbe/0xfc [ext3]
 [<ffffffff811504a3>] ? __up_read+0x7a/0x83
 [<ffffffff810e7c1e>] do_mpage_readpage+0x1a8/0x4d5
 [<ffffffff8114fafd>] ? radix_tree_insert+0x186/0x1ca
 [<ffffffff81099ca5>] ? __inc_zone_page_state+0x25/0x27
 [<ffffffff8108d627>] ? add_to_page_cache_locked+0x9a/0xae
 [<ffffffff81096290>] ? lru_cache_add+0x2b/0x5c
 [<ffffffff810e804e>] mpage_readpages+0xb1/0xf4
 [<ffffffffa0037e85>] ? ext3_get_block+0x0/0xfc [ext3]
 [<ffffffff810935af>] ? __alloc_pages_internal+0xfe/0x457
 [<ffffffffa0037e85>] ? ext3_get_block+0x0/0xfc [ext3]
 [<ffffffff810d518e>] ? mntput_no_expire+0x31/0x144
 [<ffffffffa00375a5>] ext3_readpages+0x1a/0x1c [ext3]
 [<ffffffff81095490>] __do_page_cache_readahead+0xfc/0x172
 [<ffffffff81095804>] ondemand_readahead+0x178/0x18a
 [<ffffffff810958af>] page_cache_sync_readahead+0x17/0x1c
 [<ffffffff8108ec4a>] generic_file_aio_read+0x22f/0x595
 [<ffffffff810becb5>] do_sync_read+0xe7/0x12d
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff812c20fa>] ? _spin_lock+0x9/0xc
 [<ffffffff81120e84>] ? security_file_permission+0x11/0x13
 [<ffffffff810bf632>] vfs_read+0xa8/0x102
 [<ffffffff810bf6e8>] sys_pread64+0x5c/0x7d
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task pdflush:10196 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
pdflush       D ffff8801c5ab4000     0 10196      2
 ffff8801d71f1ad0 0000000000000046 ffff88021a8c3a00 0000000000000000
 ffffffff8162a500 ffffffff8162a500 ffff8802114f96e0 ffff8801f8c0db80
 ffff8802114f9a28 000000022801b628 ffff8801d71f1aa0 ffff8802114f9a28
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff8108d285>] sync_page+0x51/0x58
 [<ffffffff812c0cdc>] __wait_on_bit_lock+0x45/0x8c
 [<ffffffff8108d234>] ? sync_page+0x0/0x58
 [<ffffffff8108d1e7>] __lock_page+0x63/0x6a
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff810943e2>] write_cache_pages+0x1eb/0x3b4
 [<ffffffff81010a37>] ? restore_args+0x0/0x30
 [<ffffffff81093998>] ? __writepage+0x0/0x2f
 [<ffffffff810945ca>] generic_writepages+0x1f/0x25
 [<ffffffff810945ff>] do_writepages+0x2f/0x38
 [<ffffffff810dbf95>] __writeback_single_inode+0x1a2/0x332
 [<ffffffff81033569>] ? __dequeue_entity+0x61/0x6a
 [<ffffffff8100e717>] ? __switch_to+0xb9/0x3e0
 [<ffffffff810dc526>] generic_sync_sb_inodes+0x245/0x390
 [<ffffffff810dc86b>] writeback_inodes+0xa4/0xfd
 [<ffffffff8109474e>] wb_kupdate+0xa3/0x119
 [<ffffffff81095163>] pdflush+0x16e/0x231
 [<ffffffff810946ab>] ? wb_kupdate+0x0/0x119
 [<ffffffff81094ff5>] ? pdflush+0x0/0x231
 [<ffffffff81094ff5>] ? pdflush+0x0/0x231
 [<ffffffff810534bf>] kthread+0x49/0x76
 [<ffffffff81011719>] child_rip+0xa/0x11
 [<ffffffff81010a37>] ? restore_args+0x0/0x30
 [<ffffffff81053476>] ? kthread+0x0/0x76
 [<ffffffff8101170f>] ? child_rip+0x0/0x11

INFO: task VirtualBox:25951 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
VirtualBox    D ffff8800709fc680     0 25951  25560
 ffff880148fc16e8 0000000000000082 0000000000000096 ffff8802195e8000
 ffffffff8162a500 ffffffff8162a500 ffff8801e514adc0 ffff8801f8c0db80
 ffff8801e514b108 00000003a0286a3c ffff88021aad96c8 ffff8801e514b108
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0de1>] __wait_on_bit+0x47/0x79
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0e7d>] out_of_line_wait_on_bit+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff8113cb81>] ? submit_bio+0xe0/0xe9
 [<ffffffff810e150a>] __wait_on_buffer+0x36/0x3a
 [<ffffffff810e154f>] wait_on_buffer+0x41/0x45
 [<ffffffff810e34fc>] __bread+0x5e/0x7e
 [<ffffffffa0036911>] ext3_get_branch+0x76/0xea [ext3]
 [<ffffffffa00376ca>] ext3_get_blocks_handle+0x9d/0x858 [ext3]
 [<ffffffffa0037f43>] ext3_get_block+0xbe/0xfc [ext3]
 [<ffffffff810e7c1e>] do_mpage_readpage+0x1a8/0x4d5
 [<ffffffff8114fafd>] ? radix_tree_insert+0x186/0x1ca
 [<ffffffff81099ca5>] ? __inc_zone_page_state+0x25/0x27
 [<ffffffff8108d627>] ? add_to_page_cache_locked+0x9a/0xae
 [<ffffffff81096290>] ? lru_cache_add+0x2b/0x5c
 [<ffffffff810e804e>] mpage_readpages+0xb1/0xf4
 [<ffffffffa0037e85>] ? ext3_get_block+0x0/0xfc [ext3]
 [<ffffffff810935af>] ? __alloc_pages_internal+0xfe/0x457
 [<ffffffffa0037e85>] ? ext3_get_block+0x0/0xfc [ext3]
 [<ffffffffa00375a5>] ext3_readpages+0x1a/0x1c [ext3]
 [<ffffffff81095490>] __do_page_cache_readahead+0xfc/0x172
 [<ffffffff81095804>] ondemand_readahead+0x178/0x18a
 [<ffffffff810958af>] page_cache_sync_readahead+0x17/0x1c
 [<ffffffff8108ec4a>] generic_file_aio_read+0x22f/0x595
 [<ffffffff810becb5>] do_sync_read+0xe7/0x12d
 [<ffffffffa03d88a2>] ? RTMemFree+0x1e/0x20 [vboxdrv]
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff81120e84>] ? security_file_permission+0x11/0x13
 [<ffffffff810bf632>] vfs_read+0xa8/0x102
 [<ffffffff810bf750>] sys_read+0x47/0x6e
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task kjournald:1202 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
kjournald     D ffff88021a845040     0  1202      2
 ffff8802188a5ca0 0000000000000046 0000000000000086 ffff8802195e8000
 ffffffff8162a500 ffffffff8162a500 ffff88021956adc0 ffff8801001b2dc0
 ffff88021956b108 00000003a0286a3c ffff88021aad96c8 ffff88021956b108
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0de1>] __wait_on_bit+0x47/0x79
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0e7d>] out_of_line_wait_on_bit+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff8113cb81>] ? submit_bio+0xe0/0xe9
 [<ffffffff810e150a>] __wait_on_buffer+0x36/0x3a
 [<ffffffffa0024deb>] wait_on_buffer+0x41/0x45 [jbd]
 [<ffffffffa002540e>] journal_commit_transaction+0x55d/0xf2f [jbd]
 [<ffffffff81049a0d>] ? try_to_del_timer_sync+0x58/0x63
 [<ffffffffa0028ad8>] kjournald+0xe3/0x23a [jbd]
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffffa00289f5>] ? kjournald+0x0/0x23a [jbd]
 [<ffffffff810534bf>] kthread+0x49/0x76
 [<ffffffff81011719>] child_rip+0xa/0x11
 [<ffffffff81010a37>] ? restore_args+0x0/0x30
 [<ffffffff81053476>] ? kthread+0x0/0x76
 [<ffffffff8101170f>] ? child_rip+0x0/0x11

INFO: task VirtualBox:25932 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
VirtualBox    D 0000000000000002     0 25932  25560
 ffff8801d9e51928 0000000000000086 0000000000000096 ffff88021b4505d0
 ffffffff8162a500 ffffffff8162a500 ffff8800c2a20000 ffff88021b9fadc0
 ffff8800c2a20348 00000000a0286a3c ffff88021aad96c8 ffff8800c2a20348
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0de1>] __wait_on_bit+0x47/0x79
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0e7d>] out_of_line_wait_on_bit+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff8113cb81>] ? submit_bio+0xe0/0xe9
 [<ffffffff810e150a>] __wait_on_buffer+0x36/0x3a
 [<ffffffff810e154f>] wait_on_buffer+0x41/0x45
 [<ffffffff810e1b53>] sync_dirty_buffer+0x58/0xa7
 [<ffffffffa0023eec>] journal_dirty_data+0xed/0x1cb [jbd]
 [<ffffffffa00375d4>] ext3_journal_dirty_data+0x18/0x40 [ext3]
 [<ffffffffa003557b>] walk_page_buffers+0x6e/0x96 [ext3]
 [<ffffffffa00375bc>] ? ext3_journal_dirty_data+0x0/0x40 [ext3]
 [<ffffffffa0036f75>] ext3_ordered_write_end+0x7d/0x127 [ext3]
 [<ffffffff8108e167>] generic_file_buffered_write+0x1b8/0x643
 [<ffffffff810d623b>] ? mnt_drop_write+0x82/0x143
 [<ffffffff810d453d>] ? mnt_want_write+0x77/0x8d
 [<ffffffff8108e9e7>] __generic_file_aio_write_nolock+0x25e/0x292
 [<ffffffff8108f233>] generic_file_aio_write+0x67/0xc3
 [<ffffffffa003443f>] ext3_file_write+0x1e/0x9f [ext3]
 [<ffffffff810beb88>] do_sync_write+0xe7/0x12d
 [<ffffffffa03d88a2>] ? RTMemFree+0x1e/0x20 [vboxdrv]
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff812c20fa>] ? _spin_lock+0x9/0xc
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff81120e84>] ? security_file_permission+0x11/0x13
 [<ffffffff810bf444>] vfs_write+0xab/0x105
 [<ffffffff810bf562>] sys_write+0x47/0x6f
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task VirtualBox:25970 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
VirtualBox    D 0000000000000002     0 25970   3154
 ffff8801d6ed9948 0000000000000086 0000000000000092 ffff8802195e8000
 ffffffff8162a500 ffffffff8162a500 ffff8801f8c75b80 ffff88021b9fadc0
 ffff8801f8c75ec8 00000000a0286a3c ffff88021aad96c8 ffff8801f8c75ec8
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0cdc>] __wait_on_bit_lock+0x45/0x8c
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0d8d>] out_of_line_wait_on_bit_lock+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff810e1a7c>] __lock_buffer+0x25/0x27
 [<ffffffffa002361f>] lock_buffer+0x35/0x3a [jbd]
 [<ffffffffa0024149>] do_get_write_access+0x72/0x404 [jbd]
 [<ffffffffa0027cfb>] ? journal_add_journal_head+0xc7/0x14e [jbd]
 [<ffffffffa00244fd>] journal_get_write_access+0x22/0x33 [jbd]
 [<ffffffffa0042806>] __ext3_journal_get_write_access+0x1f/0x48 [ext3]
 [<ffffffffa0036391>] ext3_reserve_inode_write+0x3f/0x76 [ext3]
 [<ffffffffa0036403>] ext3_mark_inode_dirty+0x3b/0x58 [ext3]
 [<ffffffffa0036564>] ext3_dirty_inode+0x6c/0x83 [ext3]
 [<ffffffff810dc8f7>] __mark_inode_dirty+0x33/0x190
 [<ffffffff810d13e0>] file_update_time+0xbe/0x102
 [<ffffffff8108e8f1>] __generic_file_aio_write_nolock+0x168/0x292
 [<ffffffff8108f233>] generic_file_aio_write+0x67/0xc3
 [<ffffffffa003443f>] ext3_file_write+0x1e/0x9f [ext3]
 [<ffffffff810beb88>] do_sync_write+0xe7/0x12d
 [<ffffffffa03d88a2>] ? RTMemFree+0x1e/0x20 [vboxdrv]
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff81120e84>] ? security_file_permission+0x11/0x13
 [<ffffffff810bf444>] vfs_write+0xab/0x105
 [<ffffffff810bf562>] sys_write+0x47/0x6f
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task VirtualBox:25990 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
VirtualBox    D ffff880183ffd6c0     0 25990   3145
 ffff880179ba1948 0000000000000086 0000000000000092 ffff8802195e8000
 ffffffff8162a500 ffffffff8162a500 ffff88021151adc0 ffff8801001b2dc0
 ffff88021151b108 00000003a0286a3c ffff88021aad96c8 ffff88021151b108
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0cdc>] __wait_on_bit_lock+0x45/0x8c
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0d8d>] out_of_line_wait_on_bit_lock+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff810e1a7c>] __lock_buffer+0x25/0x27
 [<ffffffffa002361f>] lock_buffer+0x35/0x3a [jbd]
 [<ffffffffa0024149>] do_get_write_access+0x72/0x404 [jbd]
 [<ffffffffa0027cfb>] ? journal_add_journal_head+0xc7/0x14e [jbd]
 [<ffffffffa00244fd>] journal_get_write_access+0x22/0x33 [jbd]
 [<ffffffffa0042806>] __ext3_journal_get_write_access+0x1f/0x48 [ext3]
 [<ffffffffa0036391>] ext3_reserve_inode_write+0x3f/0x76 [ext3]
 [<ffffffffa0036403>] ext3_mark_inode_dirty+0x3b/0x58 [ext3]
 [<ffffffffa0036564>] ext3_dirty_inode+0x6c/0x83 [ext3]
 [<ffffffff810dc8f7>] __mark_inode_dirty+0x33/0x190
 [<ffffffff810d13e0>] file_update_time+0xbe/0x102
 [<ffffffff8108e8f1>] __generic_file_aio_write_nolock+0x168/0x292
 [<ffffffff8108f233>] generic_file_aio_write+0x67/0xc3
 [<ffffffffa003443f>] ext3_file_write+0x1e/0x9f [ext3]
 [<ffffffff810beb88>] do_sync_write+0xe7/0x12d
 [<ffffffffa03d88a2>] ? RTMemFree+0x1e/0x20 [vboxdrv]
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff81120e84>] ? security_file_permission+0x11/0x13
 [<ffffffff810bf444>] vfs_write+0xab/0x105
 [<ffffffff810bf562>] sys_write+0x47/0x6f
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task VirtualBox:20126 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
VirtualBox    D 0000000000000002     0 20126   3145
 ffff880090a2b9a8 0000000000000082 0000000000000092 ffff8802195e8000
 ffffffff8162a500 ffffffff8162a500 ffff8801d6075b80 ffff88021b9fadc0
 ffff8801d6075ec8 00000000a0286a3c ffff88021aad96c8 ffff8801d6075ec8
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0cdc>] __wait_on_bit_lock+0x45/0x8c
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0d8d>] out_of_line_wait_on_bit_lock+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff810e1a7c>] __lock_buffer+0x25/0x27
 [<ffffffffa002361f>] lock_buffer+0x35/0x3a [jbd]
 [<ffffffffa0024149>] do_get_write_access+0x72/0x404 [jbd]
 [<ffffffffa0027cfb>] ? journal_add_journal_head+0xc7/0x14e [jbd]
 [<ffffffffa00244fd>] journal_get_write_access+0x22/0x33 [jbd]
 [<ffffffffa0042806>] __ext3_journal_get_write_access+0x1f/0x48 [ext3]
 [<ffffffffa0036391>] ext3_reserve_inode_write+0x3f/0x76 [ext3]
 [<ffffffffa0036403>] ext3_mark_inode_dirty+0x3b/0x58 [ext3]
 [<ffffffffa0036564>] ext3_dirty_inode+0x6c/0x83 [ext3]
 [<ffffffff810dc8f7>] __mark_inode_dirty+0x33/0x190
 [<ffffffff810d1536>] touch_atime+0x112/0x11d
 [<ffffffff8108ef55>] generic_file_aio_read+0x53a/0x595
 [<ffffffff810becb5>] do_sync_read+0xe7/0x12d
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff8103c847>] ? finish_task_switch+0x31/0xc9
 [<ffffffff812c0812>] ? thread_return+0xab/0xd9
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff81120e84>] ? security_file_permission+0x11/0x13
 [<ffffffff810bf632>] vfs_read+0xa8/0x102
 [<ffffffff810bf750>] sys_read+0x47/0x6e
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task smbd:32220 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
smbd          D ffff880183ffc340     0 32220   2498
 ffff8800c3771b78 0000000000000082 0000000000000096 ffff8802195e8000
 ffffffff8162a500 ffffffff8162a500 ffff8800010ddb80 ffff8801001b2dc0
 ffff8800010ddec8 00000003a0286a3c ffff88021aad96c8 ffff8800010ddec8
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0de1>] __wait_on_bit+0x47/0x79
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0e7d>] out_of_line_wait_on_bit+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff810e150a>] __wait_on_buffer+0x36/0x3a
 [<ffffffffa0035cc7>] wait_on_buffer+0x41/0x45 [ext3]
 [<ffffffffa0038160>] ext3_bread+0x4c/0x76 [ext3]
 [<ffffffffa003cb48>] htree_dirblock_to_tree+0x33/0x13d [ext3]
 [<ffffffffa003ccc4>] ext3_htree_fill_tree+0x72/0x1ea [ext3]
 [<ffffffff810c817b>] ? path_walk+0xb7/0xc4
 [<ffffffff810b7ac7>] ? virt_to_head_page+0x31/0x41
 [<ffffffff810b7ac7>] ? virt_to_head_page+0x31/0x41
 [<ffffffffa0033e6b>] ext3_readdir+0x188/0x5c8 [ext3]
 [<ffffffff810caa56>] ? filldir+0x0/0xc5
 [<ffffffff810ccdcd>] ? locks_free_lock+0x4a/0x4e
 [<ffffffff810cdd41>] ? fcntl_setlk+0x29a/0x2ae
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff810caa56>] ? filldir+0x0/0xc5
 [<ffffffff810cac3d>] vfs_readdir+0x79/0xaf
 [<ffffffff810cadb2>] sys_getdents+0x7d/0xc4
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task rsync:11446 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
rsync         D ffff8801ef08d380     0 11446  11438
 ffff880111159948 0000000000000082 0000000000000092 ffff8802195e8000
 ffffffff8162a500 ffffffff8162a500 ffff880201ec16e0 ffff8801006c96e0
 ffff880201ec1a28 00000003a0286a3c ffff88021aad96c8 ffff880201ec1a28
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0cdc>] __wait_on_bit_lock+0x45/0x8c
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0d8d>] out_of_line_wait_on_bit_lock+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff810e1a7c>] __lock_buffer+0x25/0x27
 [<ffffffffa002361f>] lock_buffer+0x35/0x3a [jbd]
 [<ffffffffa0024149>] do_get_write_access+0x72/0x404 [jbd]
 [<ffffffffa0027cfb>] ? journal_add_journal_head+0xc7/0x14e [jbd]
 [<ffffffffa00244fd>] journal_get_write_access+0x22/0x33 [jbd]
 [<ffffffffa0042806>] __ext3_journal_get_write_access+0x1f/0x48 [ext3]
 [<ffffffffa0036391>] ext3_reserve_inode_write+0x3f/0x76 [ext3]
 [<ffffffffa0036403>] ext3_mark_inode_dirty+0x3b/0x58 [ext3]
 [<ffffffffa0036564>] ext3_dirty_inode+0x6c/0x83 [ext3]
 [<ffffffff810dc8f7>] __mark_inode_dirty+0x33/0x190
 [<ffffffff810d13e0>] file_update_time+0xbe/0x102
 [<ffffffff8108e8f1>] __generic_file_aio_write_nolock+0x168/0x292
 [<ffffffff812342ca>] ? __sock_recvmsg+0x6d/0x7a
 [<ffffffff8108f233>] generic_file_aio_write+0x67/0xc3
 [<ffffffffa003443f>] ext3_file_write+0x1e/0x9f [ext3]
 [<ffffffff810beb88>] do_sync_write+0xe7/0x12d
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff8103c847>] ? finish_task_switch+0x31/0xc9
 [<ffffffff812c20fa>] ? _spin_lock+0x9/0xc
 [<ffffffff81120e84>] ? security_file_permission+0x11/0x13
 [<ffffffff810bf444>] vfs_write+0xab/0x105
 [<ffffffff810bf562>] sys_write+0x47/0x6f
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task smbd:26021 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
smbd          D 0000000000000002     0 26021   2498
 ffff88017e9a5bb8 0000000000000086 ffff88021a8c3a00 0000000000000000
 ffffffff8162a500 ffffffff8162a500 ffff88019eceadc0 ffff88021b9fadc0
 ffff88019eceb108 000000022801e568 ffff88017e9a5b88 ffff88019eceb108
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff8108d285>] sync_page+0x51/0x58
 [<ffffffff812c0de1>] __wait_on_bit+0x47/0x79
 [<ffffffff8108d234>] ? sync_page+0x0/0x58
 [<ffffffff8108d4f1>] wait_on_page_bit+0x6e/0x75
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff810965d2>] wait_on_page_writeback+0x2a/0x2e
 [<ffffffff81096e3c>] truncate_inode_pages_range+0x2e1/0x361
 [<ffffffff8109e06e>] ? unmap_mapping_range+0x21c/0x22c
 [<ffffffff810b7ac7>] ? virt_to_head_page+0x31/0x41
 [<ffffffff81096ec9>] truncate_inode_pages+0xd/0x10
 [<ffffffff8109e1f3>] vmtruncate+0x96/0xe4
 [<ffffffff810d29c8>] inode_setattr+0x2b/0x125
 [<ffffffffa003670b>] ext3_setattr+0x190/0x1f7 [ext3]
 [<ffffffff810d2c58>] notify_change+0x196/0x2ee
 [<ffffffff810be03d>] do_truncate+0x63/0x81
 [<ffffffff810be13c>] sys_ftruncate+0xe1/0xfe
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task smbd:26023 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
smbd          D ffff8801c5ab5a00     0 26023   2498
 ffff8801d28c1af8 0000000000000086 0000000000000092 ffff8802195e8000
 ffffffff8162a500 ffffffff8162a500 ffff8801232516e0 ffff8800617444a0
 ffff880123251a28 00000002a0286a3c ffff88021aad96c8 ffff880123251a28
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0cdc>] __wait_on_bit_lock+0x45/0x8c
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0d8d>] out_of_line_wait_on_bit_lock+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff81031dd9>] ? __wake_up_common+0x46/0x76
 [<ffffffff810e1a7c>] __lock_buffer+0x25/0x27
 [<ffffffffa002361f>] lock_buffer+0x35/0x3a [jbd]
 [<ffffffffa00236bb>] journal_invalidatepage+0x97/0x2a6 [jbd]
 [<ffffffffa003736f>] ext3_invalidatepage+0x3c/0x3e [ext3]
 [<ffffffff8109656d>] do_invalidatepage+0x20/0x22
 [<ffffffff81096b31>] truncate_complete_page+0x2a/0x54
 [<ffffffff81096c4f>] truncate_inode_pages_range+0xf4/0x361
 [<ffffffff8109e06e>] ? unmap_mapping_range+0x21c/0x22c
 [<ffffffff810b7ac7>] ? virt_to_head_page+0x31/0x41
 [<ffffffff81096ec9>] truncate_inode_pages+0xd/0x10
 [<ffffffff8109e1f3>] vmtruncate+0x96/0xe4
 [<ffffffff810d29c8>] inode_setattr+0x2b/0x125
 [<ffffffffa003670b>] ext3_setattr+0x190/0x1f7 [ext3]
 [<ffffffff810d2c58>] notify_change+0x196/0x2ee
 [<ffffffff810be03d>] do_truncate+0x63/0x81
 [<ffffffff810be13c>] sys_ftruncate+0xe1/0xfe
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task spamd:8628 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
spamd         D ffff88005c966700     0  8628   8626
 ffff880101fad6d8 0000000000000086 0000000000000092 ffff8802195e8000
 ffffffff8162a500 ffffffff8162a500 ffff88010060db80 ffff8800012396e0
 ffff88010060dec8 00000000a0286a3c ffff88021aad96c8 ffff88010060dec8
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0de1>] __wait_on_bit+0x47/0x79
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0e7d>] out_of_line_wait_on_bit+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff8113cb81>] ? submit_bio+0xe0/0xe9
 [<ffffffff810e150a>] __wait_on_buffer+0x36/0x3a
 [<ffffffff810e154f>] wait_on_buffer+0x41/0x45
 [<ffffffff810e34fc>] __bread+0x5e/0x7e
 [<ffffffffa0036911>] ext3_get_branch+0x76/0xea [ext3]
 [<ffffffffa00376ca>] ext3_get_blocks_handle+0x9d/0x858 [ext3]
 [<ffffffff810935af>] ? __alloc_pages_internal+0xfe/0x457
 [<ffffffff810b1323>] ? alloc_page_vma+0xc1/0xc6
 [<ffffffff81099ca5>] ? __inc_zone_page_state+0x25/0x27
 [<ffffffff810a6dfd>] ? page_add_new_anon_rmap+0x4c/0x4e
 [<ffffffff8109ec50>] ? handle_mm_fault+0x856/0x890
 [<ffffffff810448a3>] ? raise_softirq+0x41/0x58
 [<ffffffffa0037f43>] ext3_get_block+0xbe/0xfc [ext3]
 [<ffffffff811504a3>] ? __up_read+0x7a/0x83
 [<ffffffff810e7c1e>] do_mpage_readpage+0x1a8/0x4d5
 [<ffffffff8114fafd>] ? radix_tree_insert+0x186/0x1ca
 [<ffffffff81099ca5>] ? __inc_zone_page_state+0x25/0x27
 [<ffffffff8108d627>] ? add_to_page_cache_locked+0x9a/0xae
 [<ffffffff81096290>] ? lru_cache_add+0x2b/0x5c
 [<ffffffff810e804e>] mpage_readpages+0xb1/0xf4
 [<ffffffffa0037e85>] ? ext3_get_block+0x0/0xfc [ext3]
 [<ffffffff810935af>] ? __alloc_pages_internal+0xfe/0x457
 [<ffffffffa0037e85>] ? ext3_get_block+0x0/0xfc [ext3]
 [<ffffffff810d518e>] ? mntput_no_expire+0x31/0x144
 [<ffffffffa00375a5>] ext3_readpages+0x1a/0x1c [ext3]
 [<ffffffff81095490>] __do_page_cache_readahead+0xfc/0x172
 [<ffffffff81095804>] ondemand_readahead+0x178/0x18a
 [<ffffffff810958af>] page_cache_sync_readahead+0x17/0x1c
 [<ffffffff8108ec4a>] generic_file_aio_read+0x22f/0x595
 [<ffffffff810becb5>] do_sync_read+0xe7/0x12d
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff812c20fa>] ? _spin_lock+0x9/0xc
 [<ffffffff81120e84>] ? security_file_permission+0x11/0x13
 [<ffffffff810bf632>] vfs_read+0xa8/0x102
 [<ffffffff810bf6e8>] sys_pread64+0x5c/0x7d
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task pdflush:10196 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
pdflush       D ffff8801c5ab4000     0 10196      2
 ffff8801d71f1ad0 0000000000000046 ffff88021a8c3a00 0000000000000000
 ffffffff8162a500 ffffffff8162a500 ffff8802114f96e0 ffff8801f8c0db80
 ffff8802114f9a28 000000022801b628 ffff8801d71f1aa0 ffff8802114f9a28
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff8108d285>] sync_page+0x51/0x58
 [<ffffffff812c0cdc>] __wait_on_bit_lock+0x45/0x8c
 [<ffffffff8108d234>] ? sync_page+0x0/0x58
 [<ffffffff8108d1e7>] __lock_page+0x63/0x6a
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff810943e2>] write_cache_pages+0x1eb/0x3b4
 [<ffffffff81010a37>] ? restore_args+0x0/0x30
 [<ffffffff81093998>] ? __writepage+0x0/0x2f
 [<ffffffff810945ca>] generic_writepages+0x1f/0x25
 [<ffffffff810945ff>] do_writepages+0x2f/0x38
 [<ffffffff810dbf95>] __writeback_single_inode+0x1a2/0x332
 [<ffffffff81033569>] ? __dequeue_entity+0x61/0x6a
 [<ffffffff8100e717>] ? __switch_to+0xb9/0x3e0
 [<ffffffff810dc526>] generic_sync_sb_inodes+0x245/0x390
 [<ffffffff810dc86b>] writeback_inodes+0xa4/0xfd
 [<ffffffff8109474e>] wb_kupdate+0xa3/0x119
 [<ffffffff81095163>] pdflush+0x16e/0x231
 [<ffffffff810946ab>] ? wb_kupdate+0x0/0x119
 [<ffffffff81094ff5>] ? pdflush+0x0/0x231
 [<ffffffff81094ff5>] ? pdflush+0x0/0x231
 [<ffffffff810534bf>] kthread+0x49/0x76
 [<ffffffff81011719>] child_rip+0xa/0x11
 [<ffffffff81010a37>] ? restore_args+0x0/0x30
 [<ffffffff81053476>] ? kthread+0x0/0x76
 [<ffffffff8101170f>] ? child_rip+0x0/0x11

INFO: task VirtualBox:25951 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
VirtualBox    D ffff8800709fc680     0 25951  25560
 ffff880148fc16e8 0000000000000082 0000000000000096 ffff8802195e8000
 ffffffff8162a500 ffffffff8162a500 ffff8801e514adc0 ffff8801f8c0db80
 ffff8801e514b108 00000003a0286a3c ffff88021aad96c8 ffff8801e514b108
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0de1>] __wait_on_bit+0x47/0x79
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0e7d>] out_of_line_wait_on_bit+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff8113cb81>] ? submit_bio+0xe0/0xe9
 [<ffffffff810e150a>] __wait_on_buffer+0x36/0x3a
 [<ffffffff810e154f>] wait_on_buffer+0x41/0x45
 [<ffffffff810e34fc>] __bread+0x5e/0x7e
 [<ffffffffa0036911>] ext3_get_branch+0x76/0xea [ext3]
 [<ffffffffa00376ca>] ext3_get_blocks_handle+0x9d/0x858 [ext3]
 [<ffffffffa0037f43>] ext3_get_block+0xbe/0xfc [ext3]
 [<ffffffff810e7c1e>] do_mpage_readpage+0x1a8/0x4d5
 [<ffffffff8114fafd>] ? radix_tree_insert+0x186/0x1ca
 [<ffffffff81099ca5>] ? __inc_zone_page_state+0x25/0x27
 [<ffffffff8108d627>] ? add_to_page_cache_locked+0x9a/0xae
 [<ffffffff81096290>] ? lru_cache_add+0x2b/0x5c
 [<ffffffff810e804e>] mpage_readpages+0xb1/0xf4
 [<ffffffffa0037e85>] ? ext3_get_block+0x0/0xfc [ext3]
 [<ffffffff810935af>] ? __alloc_pages_internal+0xfe/0x457
 [<ffffffffa0037e85>] ? ext3_get_block+0x0/0xfc [ext3]
 [<ffffffffa00375a5>] ext3_readpages+0x1a/0x1c [ext3]
 [<ffffffff81095490>] __do_page_cache_readahead+0xfc/0x172
 [<ffffffff81095804>] ondemand_readahead+0x178/0x18a
 [<ffffffff810958af>] page_cache_sync_readahead+0x17/0x1c
 [<ffffffff8108ec4a>] generic_file_aio_read+0x22f/0x595
 [<ffffffff810becb5>] do_sync_read+0xe7/0x12d
 [<ffffffffa03d88a2>] ? RTMemFree+0x1e/0x20 [vboxdrv]
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff81120e84>] ? security_file_permission+0x11/0x13
 [<ffffffff810bf632>] vfs_read+0xa8/0x102
 [<ffffffff810bf750>] sys_read+0x47/0x6e
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

I was getting those previously as well after I changed the backplanes and drives but it just went away. Now it's back. This causes all Linux VMs to crap out with disk I/O errors. The windows vms are fine. The system is Fedora Core 9 running Virtualbox 3.2.12. I know there's newer versions, but that's besides the point. Everything was ok before the power outage.

I put over 1k into replacing everything I can in hopes it goes away, and no luck I guess. I thought I was doing good given it's been running ok for a while. :(. Any suggestions on what else to try? Motherboard? Sata controller? Right now I'm using the mobo + 2 pcie controllers. Now they make 8 port + controllers so I'm wondering if I should get one of those. It's hard to find ones that don't do raid though. I had bought one but learned the hard way that if it has raid you HAVE to use it, it wont present the drives normally to the OS before they are initialized. I'm running out of money here, but this system just has to work.
 

Nothinman

Elite Member
Sep 14, 2001
30,672
0
0
Technically those aren't crashes, it even says INFO in the text:

INFO: task VirtualBox:25951 blocked for more than 120 seconds.

Which just means that process was stuck waiting on I/O for over 2min, which apparently the Linux VMs don't handle well.
 

Red Squirrel

No Lifer
May 24, 2003
70,166
13,573
126
www.anyf.ca
That still can't be normal though. What would cause it? The apps that are shown in the log are not always the same. I think it's what happens to be using more cpu at the time of the crash. It's almost like the cpu suddenly locks up for a minute or something. I've caught this while I'm using the system's GUI, and what basically happens is everything completely locks up. I also ran memtest and there was no errors, but could there still be a ram issue memtest wont catch?

The only drive I did not change is the OS drive, could that maybe be going bad? Maybe the swap file tries to write to a bad sector or something? Just speculating. Is there such thing as too big of a swap file? I set it really big (something stupid like 160GB) just because I had the space, should I reduce it? Now that I think of it, I'm almost thinking it may be that.
 

lxskllr

No Lifer
Nov 30, 2004
59,426
9,944
126
Bad memory will crash the machine. It either works, or it doesn't. Errors are generally disastrous. Swap file shouldn't matter unless you made it too small. I'd look at the hard drive(possibly failing), and how the machine interacts with it(I have no idea regarding that aspect).
 

Nothinman

Elite Member
Sep 14, 2001
30,672
0
0
That still can't be normal though. What would cause it? The apps that are shown in the log are not always the same. I think it's what happens to be using more cpu at the time of the crash. It's almost like the cpu suddenly locks up for a minute or something. I've caught this while I'm using the system's GUI, and what basically happens is everything completely locks up. I also ran memtest and there was no errors, but could there still be a ram issue memtest wont catch?

The only drive I did not change is the OS drive, could that maybe be going bad? Maybe the swap file tries to write to a bad sector or something? Just speculating. Is there such thing as too big of a swap file? I set it really big (something stupid like 160GB) just because I had the space, should I reduce it? Now that I think of it, I'm almost thinking it may be that.

It's not the same process every time because it varies depending on what was doing I/O at the time. It's almost certainly not related to the CPU at all and instead the disks are either overworked or not working properly.

Too large of a swap file is just a waste of space, it wouldn't have this affect.
 

Nothinman

Elite Member
Sep 14, 2001
30,672
0
0
All the data drives were changed. Could it be the controller?

I guess I could start by changing the OS drive and go from there.

Do you have anything like cacti setup to show you disk activity graphs so you can see the history? Have you run something like 'vmstat 1' when it was happening?
 

Red Squirrel

No Lifer
May 24, 2003
70,166
13,573
126
www.anyf.ca
It's rare that I manage to catch it. It usually happens overnight, but that's when all the backup jobs run so it's definitely durring a high disk i/o process. I can cause all the disk i/o I want but can't seem to force reproduce it. I've even tried running bonnie++ and stuff to try to make it happen. I can go days and be fine, other times it may happen within the same day, it seems very sporadic.
 

Nothinman

Elite Member
Sep 14, 2001
30,672
0
0
It's rare that I manage to catch it. It usually happens overnight, but that's when all the backup jobs run so it's definitely durring a high disk i/o process. I can cause all the disk i/o I want but can't seem to force reproduce it. I've even tried running bonnie++ and stuff to try to make it happen. I can go days and be fine, other times it may happen within the same day, it seems very sporadic.

So if you know with a reasonable amount of certainty that it's the backups why don't you try and fix that?
 

Red Squirrel

No Lifer
May 24, 2003
70,166
13,573
126
www.anyf.ca
Well why would rsync cause the error to happen in first place? That's what I'm trying to figure out. That BSOD-like error is not normal operation of rsync or any other program. The backups are normal procedure and have always been. This only started recently.
 

Nothinman

Elite Member
Sep 14, 2001
30,672
0
0
Well why would rsync cause the error to happen in first place? That's what I'm trying to figure out. That BSOD-like error is not normal operation of rsync or any other program. The backups are normal procedure and have always been. This only started recently.

It's not BSOD-like and it's not even an error, the kernel is just letting you know a task was stuck waiting on I/O for >120s and giving you a backtrace so that a developer can see how it got to that point.

Do you have multiple rsync's running at once? And you never mentioned whether not you have any monitoring of the system's vitals setup.
 

Red Squirrel

No Lifer
May 24, 2003
70,166
13,573
126
www.anyf.ca
Well anything that crashes a system is an error imo. I never got around to setting up any kind of monitoring, been looking at possibly setting up Pandora or Nagios. But no there's not anything currently. If it helps, I can confirm the IO on that box is usually quite high at any given time, especially at night when the backups start to kick in. But I can't see why that should cause crashes. Slow down, yes, but not actually crash.

There are always a few separate backup jobs that run overnight, so it's normal to have multiple rsyncs running. Is there a way to have the date go into dmesg? It would help pin point the time it happens. Even if I did have monitoring I would not really know when the crash happened. If I run all my jobs manually it does not nececerily cause the crash, so I don't think it's the backups directly that are doing it.

I went ahead and ordered a new sata controller, I'll start with that I guess. I'll also replace the OS drive while I'm at it. Hopefully that might solve the issue.
 

Nothinman

Elite Member
Sep 14, 2001
30,672
0
0
Well anything that crashes a system is an error imo. I never got around to setting up any kind of monitoring, been looking at possibly setting up Pandora or Nagios. But no there's not anything currently. If it helps, I can confirm the IO on that box is usually quite high at any given time, especially at night when the backups start to kick in. But I can't see why that should cause crashes. Slow down, yes, but not actually crash.

There are always a few separate backup jobs that run overnight, so it's normal to have multiple rsyncs running. Is there a way to have the date go into dmesg? It would help pin point the time it happens. Even if I did have monitoring I would not really know when the crash happened. If I run all my jobs manually it does not nececerily cause the crash, so I don't think it's the backups directly that are doing it.

I went ahead and ordered a new sata controller, I'll start with that I guess. I'll also replace the OS drive while I'm at it. Hopefully that might solve the issue.

And "the system" isn't crashing, it's just giving you a warning because of the high I/O latency. If you run "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" the message won't appear any more and your system won't be "crashing" any more.

But if the I/O is usually high at any given time and then you add on backups on top of that, I'm not surprised that you're seeing those messages. Although I am surprised the Windows VMs handle I/O lag of those extremes.
 

lxskllr

No Lifer
Nov 30, 2004
59,426
9,944
126
Can't you stagger the backups so they run consecutively, and not concurrently?
 

Red Squirrel

No Lifer
May 24, 2003
70,166
13,573
126
www.anyf.ca
And "the system" isn't crashing, it's just giving you a warning because of the high I/O latency. If you run "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" the message won't appear any more and your system won't be "crashing" any more.

But if the I/O is usually high at any given time and then you add on backups on top of that, I'm not surprised that you're seeing those messages. Although I am surprised the Windows VMs handle I/O lag of those extremes.

Hiding the message wont really do me any good. This crash is causing most of the VMs to hang and require to be hard resetted. It's more than just a warning. Anything that causes negative effects on a system and requires user intervention to get it going is a crash imo. It might be a "warning", but it's telling me that something went wrong. What? That's what I'm trying to figure out.

I could stagger the backups or try to minimize load, but that's just a bandage. The system should not be crashing even if it's under high load. We stress our systems at work much more than this, and they do fine.

Guess I'll wait for that disk controller to come in and try that, then go from there.

I'm even wondering if it might be a software issue. Maybe upgrading would be worth a shot. Huge tedious task to reinstall everything and reconfigure it all though, but may be worth a try. There's some stuff such as converting to raid 6 that I've been wanting to do which my current kernel and mdadm version don't support, so if I upgrade I'll get that functionality. Though, these issues point more towards hardware, so I'm hoping changing the card will do the trick.
 
Last edited:

Nothinman

Elite Member
Sep 14, 2001
30,672
0
0
Hiding the message wont really do me any good. This crash is causing most of the VMs to hang and require to be hard resetted. It's more than just a warning. Anything that causes negative effects on a system and requires user intervention to get it going is a crash imo. It might be a "warning", but it's telling me that something went wrong. What? That's what I'm trying to figure out.

It might make you stop categorizing the issue as a crash when it's not. If the VMs are crashing from the high I/O load that's on them, processes aren't guaranteed I/O latency because you're not running a real-time system. What's wrong is that your drives can't satisfy the I/O load you've placed on them, it's that simple. Whether that's because of a bad drive, shitty controller or just hugely overcommited system is up to you to find out and not really something we can deduce without access to the system.

I could stagger the backups or try to minimize load, but that's just a bandage. The system should not be crashing even if it's under high load. We stress our systems at work much more than this, and they do fine.

No, that's not a bandaid. Your disks have an upper limit on throughput, IOPs, latency, etc and if you're overcommiting them too much then you need to fix that. You can't just claim that you can run whatever you want concurrently and have it magically be true.

Guess I'll wait for that disk controller to come in and try that, then go from there.

I'm even wondering if it might be a software issue. Maybe upgrading would be worth a shot. Huge tedious task to reinstall everything and reconfigure it all though, but may be worth a try. There's some stuff such as converting to raid 6 that I've been wanting to do which my current kernel and mdadm version don't support, so if I upgrade I'll get that functionality. Though, these issues point more towards hardware, so I'm hoping changing the card will do the trick.

Why would you even consider upgrading software until you've ruled out the actual disk I/O? I mean, you really shouldn't be clinging to FC9 anymore anyway, but changing out the software won't change anything if the hardware simply can't handle what you're asking it to do.
 

Red Squirrel

No Lifer
May 24, 2003
70,166
13,573
126
www.anyf.ca
Is there a way to allocate more I/O then? It seems odd that everything is running fine then suddenly it would just crap out. It's not like it gets very slow or unusable, no, it just randomly decide to completely crap out. Otherwise the VMs are working fine. There are two instances of a game server, no lag whatsoever. I'm not really pushing the hardware all that much, even while the backups are running. I have even started every single backup job at once to see if it does it, but it wont. It's random and seems to only happen overnight maybe beween the hours of 1am to 6am or so.
 
Last edited:

hf2046

Junior Member
Sep 23, 2011
18
0
0
Is there a way to allocate more I/O then?

You can find out what I/O schedulers your system supports by typing at the command line:

cat /sys/block/{dev_name}/queue/scheduler

Just replace {dev_name} with device name (i.e. sda, etc).

If you're running a hardware RAID, I believe the advice is to use 'noop' and let the controller schedule reads/writes. To change the scheduler you can use the following command:

echo noop > /sys/block/{dev_name}/queue/scheduler

http://en.wikipedia.org/wiki/Noop_scheduler
http://en.wikipedia.org/wiki/CFQ
http://en.wikipedia.org/wiki/Deadline_scheduler

Hope this helps.
 

Nothinman

Elite Member
Sep 14, 2001
30,672
0
0
Is there a way to allocate more I/O then? It seems odd that everything is running fine then suddenly it would just crap out. It's not like it gets very slow or unusable, no, it just randomly decide to completely crap out. Otherwise the VMs are working fine. There are two instances of a game server, no lag whatsoever. I'm not really pushing the hardware all that much, even while the backups are running. I have even started every single backup job at once to see if it does it, but it wont. It's random and seems to only happen overnight maybe beween the hours of 1am to 6am or so.

There are some knobs to turn, but you can't allocate more I/O than the hardware can physically sustain. Is mdadm possibly doing an array verification when the backups are running?
 

Red Squirrel

No Lifer
May 24, 2003
70,166
13,573
126
www.anyf.ca
The hardware can handle it fine, well the load is around 4ish at most times due to F@H but when I run all backups at once and all VMs at once, it will go up to around 10 or so. Thing is, the VMs are fully responsive. I have two game environments, I can login and don't get any lag. So I know the hardware is handling it no problem.

How would I tell if mdadm is verifying, perhaps that could be a trigger to look for. Is there a way I can force it? I'd be curious to force it while stressing the system to see what happens.

Also, would I be better off separating disk I/O from cpu I/O? Basically, offload the raid to a completely separate box and just use iSCSI or NFS? I do want to get one of those big 24 bay enclosures some day and build a SAN.
 

hf2046

Junior Member
Sep 23, 2011
18
0
0
Also, would I be better off separating disk I/O from cpu I/O? Basically, offload the raid to a completely separate box and just use iSCSI or NFS? I do want to get one of those big 24 bay enclosures some day and build a SAN.

Disk I/O and CPU I/O are linked. Every time a process / thread requests data from disk, it blocks until the CPU receives an interrupt that the data is ready to be read. Your processes are getting starved for data because somewhere along the disk I/O chain, reads and writes are not getting processed in a 'decent' amount of time (from the log messages, 120 seconds).

It could be possible that your VMs are starving the other processes for I/O if they run in a high enough priority. This would be why you can still interact with the VM but other parts of your system are unresponsive. You should probably attempt to run your backups while the VMs are turned off and see if you get the same errors.
 

Nothinman

Elite Member
Sep 14, 2001
30,672
0
0
The hardware can handle it fine, well the load is around 4ish at most times due to F@H but when I run all backups at once and all VMs at once, it will go up to around 10 or so. Thing is, the VMs are fully responsive. I have two game environments, I can login and don't get any lag. So I know the hardware is handling it no problem.

How would I tell if mdadm is verifying, perhaps that could be a trigger to look for. Is there a way I can force it? I'd be curious to force it while stressing the system to see what happens.

Also, would I be better off separating disk I/O from cpu I/O? Basically, offload the raid to a completely separate box and just use iSCSI or NFS? I do want to get one of those big 24 bay enclosures some day and build a SAN.

The load average has nothing to do with I/O, it only covers processes.
 

Red Squirrel

No Lifer
May 24, 2003
70,166
13,573
126
www.anyf.ca
I just had it happen again, this time I was actually around to witness it as it happened. I was just coding, so really low disk I/O, just the occasional file save. I went to save a file, then everything just locked up, I went to do dir /raid1 and that is still frozen. My Linux VMs are all crashing. I still don't get why Windows VMs don't get affected by this.

The error is now spamming in dmesg again.

Code:
INFO: task kjournald:1398 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
kjournald     D ffff88021b3ca700     0  1398      2
 ffff880218ecbca0 0000000000000046 ffff88021b5e9ed8 ffff88021ab50000
 ffffffff8162a500 ffffffff8162a500 ffff88021a90db80 ffff8801cf458000
 ffff88021a90dec8 00000003a02daa3c ffff880219914e98 ffff88021a90dec8
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0de1>] __wait_on_bit+0x47/0x79
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0e7d>] out_of_line_wait_on_bit+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff8113cb81>] ? submit_bio+0xe0/0xe9
 [<ffffffff810e150a>] __wait_on_buffer+0x36/0x3a
 [<ffffffffa0024deb>] wait_on_buffer+0x41/0x45 [jbd]
 [<ffffffffa002540e>] journal_commit_transaction+0x55d/0xf2f [jbd]
 [<ffffffff81049a0d>] ? try_to_del_timer_sync+0x58/0x63
 [<ffffffffa0028ad8>] kjournald+0xe3/0x23a [jbd]
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffffa00289f5>] ? kjournald+0x0/0x23a [jbd]
 [<ffffffff810534bf>] kthread+0x49/0x76
 [<ffffffff81011719>] child_rip+0xa/0x11
 [<ffffffff81010a37>] ? restore_args+0x0/0x30
 [<ffffffff81053476>] ? kthread+0x0/0x76
 [<ffffffff8101170f>] ? child_rip+0x0/0x11

INFO: task VirtualBox:27793 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
VirtualBox    D ffff8801c11fc340     0 27793  13748
 ffff88016eb898f8 0000000000000082 0000000000000092 ffff88021ab50798
 ffffffff8162a500 ffffffff8162a500 ffff88013b088000 ffff88010074adc0
 ffff88013b088348 00000001a02daa3c ffff88021ab41e60 ffff88013b088348
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0de1>] __wait_on_bit+0x47/0x79
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0e7d>] out_of_line_wait_on_bit+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff810e150a>] __wait_on_buffer+0x36/0x3a
 [<ffffffff810e154f>] wait_on_buffer+0x41/0x45
 [<ffffffff810e22fe>] __block_prepare_write+0x2ba/0x2fd
 [<ffffffffa0037e85>] ? ext3_get_block+0x0/0xfc [ext3]
 [<ffffffff8108d627>] ? add_to_page_cache_locked+0x9a/0xae
 [<ffffffff810e24b6>] block_write_begin+0x86/0xd8
 [<ffffffffa00374c3>] ext3_write_begin+0xdf/0x1a7 [ext3]
 [<ffffffffa0037e85>] ? ext3_get_block+0x0/0xfc [ext3]
 [<ffffffff8108e0fa>] generic_file_buffered_write+0x14b/0x643
 [<ffffffff810d623b>] ? mnt_drop_write+0x82/0x143
 [<ffffffff810d453d>] ? mnt_want_write+0x77/0x8d
 [<ffffffff8108e9e7>] __generic_file_aio_write_nolock+0x25e/0x292
 [<ffffffffa042fde9>] ? rtSemEventWait+0xed/0xfe [vboxdrv]
 [<ffffffffa04308fd>] ? RTSpinlockRelease+0xd/0x10 [vboxdrv]
 [<ffffffffa0427785>] ? SUPR0ObjRelease+0x17e/0x1d7 [vboxdrv]
 [<ffffffff8108f233>] generic_file_aio_write+0x67/0xc3
 [<ffffffffa003443f>] ext3_file_write+0x1e/0x9f [ext3]
 [<ffffffff810beb88>] do_sync_write+0xe7/0x12d
 [<ffffffffa042c8a2>] ? RTMemFree+0x1e/0x20 [vboxdrv]
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff81120e84>] ? security_file_permission+0x11/0x13
 [<ffffffff810bf444>] vfs_write+0xab/0x105
 [<ffffffff810bf562>] sys_write+0x47/0x6f
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task md0_raid5:1327 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
md0_raid5     D ffff8801c11fc340     0  1327      2
 ffff88021b5e99b0 0000000000000046 ffff88021b5e9900 000000108108f44b
 ffffffff8162a500 ffffffff8162a500 ffff88021a5d44a0 ffff88021a5adb80
 ffff88021a5d47e8 0000000200000001 ffff880206ab5a60 ffff88021a5d47e8
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff8113dca9>] get_request_wait+0xc1/0x152
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff8113a64f>] ? elv_merge+0x163/0x185
 [<ffffffff8113e07e>] __make_request+0x344/0x3dd
 [<ffffffff8113ca66>] generic_make_request+0x27f/0x2ba
 [<ffffffffa02dcbd7>] ops_run_io+0x1f7/0x246 [raid456]
 [<ffffffffa02deae9>] handle_stripe5+0x114b/0x1190 [raid456]
 [<ffffffff811591a7>] ? swiotlb_map_sg_attrs+0x116/0x135
 [<ffffffffa02dfb11>] handle_stripe+0xfe3/0x1014 [raid456]
 [<ffffffff8100e717>] ? __switch_to+0xb9/0x3e0
 [<ffffffff812c22f4>] ? _spin_unlock_irqrestore+0x27/0x3e
 [<ffffffff81032ff6>] ? __wake_up+0x43/0x4f
 [<ffffffff81212174>] ? md_wakeup_thread+0x27/0x29
 [<ffffffffa02dac0b>] ? __release_stripe+0x176/0x17f [raid456]
 [<ffffffffa02e18fa>] raid5d+0x432/0x488 [raid456]
 [<ffffffff812c0bb4>] ? schedule_timeout+0x22/0xb4
 [<ffffffff81217012>] md_thread+0x11d/0x13b
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff81216ef5>] ? md_thread+0x0/0x13b
 [<ffffffff810534bf>] kthread+0x49/0x76
 [<ffffffff81011719>] child_rip+0xa/0x11
 [<ffffffff81010a37>] ? restore_args+0x0/0x30
 [<ffffffff81053476>] ? kthread+0x0/0x76
 [<ffffffff8101170f>] ? child_rip+0x0/0x11

INFO: task VirtualBox:4014 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
VirtualBox    D ffff8801e30156c0     0  4014   3991
 ffff8801df9af8f8 0000000000000086 0000000000000092 ffff88021ab50798
 ffffffff8162a500 ffffffff8162a500 ffff8801cf45adc0 ffff8801a1c196e0
 ffff8801cf45b108 00000003a02daa3c ffff88021ab41e60 ffff8801cf45b108
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0de1>] __wait_on_bit+0x47/0x79
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0e7d>] out_of_line_wait_on_bit+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff810e150a>] __wait_on_buffer+0x36/0x3a
 [<ffffffff810e154f>] wait_on_buffer+0x41/0x45
 [<ffffffff810e22fe>] __block_prepare_write+0x2ba/0x2fd
 [<ffffffffa0037e85>] ? ext3_get_block+0x0/0xfc [ext3]
 [<ffffffff8108d627>] ? add_to_page_cache_locked+0x9a/0xae
 [<ffffffff810e24b6>] block_write_begin+0x86/0xd8
 [<ffffffffa00374c3>] ext3_write_begin+0xdf/0x1a7 [ext3]
 [<ffffffffa0037e85>] ? ext3_get_block+0x0/0xfc [ext3]
 [<ffffffff8108e0fa>] generic_file_buffered_write+0x14b/0x643
 [<ffffffff810d623b>] ? mnt_drop_write+0x82/0x143
 [<ffffffff810d453d>] ? mnt_want_write+0x77/0x8d
 [<ffffffff8108e9e7>] __generic_file_aio_write_nolock+0x25e/0x292
 [<ffffffff8108f233>] generic_file_aio_write+0x67/0xc3
 [<ffffffffa003443f>] ext3_file_write+0x1e/0x9f [ext3]
 [<ffffffff810beb88>] do_sync_write+0xe7/0x12d
 [<ffffffffa042c8a2>] ? RTMemFree+0x1e/0x20 [vboxdrv]
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff81120e84>] ? security_file_permission+0x11/0x13
 [<ffffffff810bf444>] vfs_write+0xab/0x105
 [<ffffffff810bf562>] sys_write+0x47/0x6f
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task smbd:12214 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
smbd          D ffff88021a949380     0 12214   2711
 ffff8801145edbb8 0000000000000082 ffff880219914800 0000000000000004
 ffffffff8162a500 ffffffff8162a500 ffff8801897216e0 ffff8801037016e0
 ffff880189721a28 00000003280216e8 ffff8801145edb88 ffff880189721a28
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff8108d285>] sync_page+0x51/0x58
 [<ffffffff812c0de1>] __wait_on_bit+0x47/0x79
 [<ffffffff8108d234>] ? sync_page+0x0/0x58
 [<ffffffff8108d4f1>] wait_on_page_bit+0x6e/0x75
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff810965d2>] wait_on_page_writeback+0x2a/0x2e
 [<ffffffff81096e3c>] truncate_inode_pages_range+0x2e1/0x361
 [<ffffffff8109e06e>] ? unmap_mapping_range+0x21c/0x22c
 [<ffffffff810b7ac7>] ? virt_to_head_page+0x31/0x41
 [<ffffffff81096ec9>] truncate_inode_pages+0xd/0x10
 [<ffffffff8109e1f3>] vmtruncate+0x96/0xe4
 [<ffffffff810d29c8>] inode_setattr+0x2b/0x125
 [<ffffffffa003670b>] ext3_setattr+0x190/0x1f7 [ext3]
 [<ffffffff810d2c58>] notify_change+0x196/0x2ee
 [<ffffffff810be03d>] do_truncate+0x63/0x81
 [<ffffffff810be13c>] sys_ftruncate+0xe1/0xfe
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task FahCore_a3.exe:314 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
FahCore_a3.ex D ffff88002806b9f0     0   314   2797
 ffff8801a6cbfd08 0000000000000082 0000000000000001 000000000110cecc
 ffffffff8162a500 ffffffff8162a500 ffff8801a1c196e0 ffff88021f1216e0
 ffff8801a1c19a28 0000000300000000 ffff8801a6cbfce8 ffff8801a1c19a28
Call Trace:
 [<ffffffffa00285e0>] log_wait_commit+0xbd/0x116 [jbd]
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff81094370>] ? write_cache_pages+0x179/0x3b4
 [<ffffffffa0023c01>] journal_stop+0x189/0x1c1 [jbd]
 [<ffffffffa0024d09>] journal_force_commit+0x23/0x26 [jbd]
 [<ffffffffa003e668>] ext3_force_commit+0x26/0x28 [ext3]
 [<ffffffffa0035c47>] ext3_write_inode+0x39/0x3f [ext3]
 [<ffffffff810dbfd1>] __writeback_single_inode+0x1de/0x332
 [<ffffffff810dc14d>] sync_inode+0x28/0x40
 [<ffffffffa0034562>] ext3_sync_file+0xa2/0xb0 [ext3]
 [<ffffffff810df337>] do_fsync+0x55/0x8a
 [<ffffffff810df39a>] __do_fsync+0x2e/0x44
 [<ffffffff810df3cb>] sys_fsync+0xb/0xd
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task smbd:3288 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
smbd          D ffff8801fe90c9c0     0  3288   2711
 ffff8801e142dbb8 0000000000000086 ffff880219914800 0000000000000004
 ffffffff8162a500 ffffffff8162a500 ffff8801e6de2dc0 ffff88021a5adb80
 ffff8801e6de3108 000000032801cdc8 ffff8801e142db88 ffff8801e6de3108
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff8108d285>] sync_page+0x51/0x58
 [<ffffffff812c0de1>] __wait_on_bit+0x47/0x79
 [<ffffffff8108d234>] ? sync_page+0x0/0x58
 [<ffffffff8108d4f1>] wait_on_page_bit+0x6e/0x75
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff810965d2>] wait_on_page_writeback+0x2a/0x2e
 [<ffffffff81096e3c>] truncate_inode_pages_range+0x2e1/0x361
 [<ffffffff810b7ac7>] ? virt_to_head_page+0x31/0x41
 [<ffffffff81096ec9>] truncate_inode_pages+0xd/0x10
 [<ffffffff8109e1f3>] vmtruncate+0x96/0xe4
 [<ffffffff810d29c8>] inode_setattr+0x2b/0x125
 [<ffffffffa003670b>] ext3_setattr+0x190/0x1f7 [ext3]
 [<ffffffff810d2c58>] notify_change+0x196/0x2ee
 [<ffffffff810be03d>] do_truncate+0x63/0x81
 [<ffffffff810be13c>] sys_ftruncate+0xe1/0xfe
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task dir:12842 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
dir           D ffff8801aecdb400     0 12842  13353
 ffff880135925b58 0000000000000086 0000000000000096 ffff88021ab50798
 ffffffff8162a500 ffffffff8162a500 ffff88007c6616e0 ffff88010074adc0
 ffff88007c661a28 00000001a02daa3c ffff88021ab41e60 ffff88007c661a28
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0de1>] __wait_on_bit+0x47/0x79
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0e7d>] out_of_line_wait_on_bit+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff8113cb81>] ? submit_bio+0xe0/0xe9
 [<ffffffff810e150a>] __wait_on_buffer+0x36/0x3a
 [<ffffffffa0035cc7>] wait_on_buffer+0x41/0x45 [ext3]
 [<ffffffffa0035f4a>] __ext3_get_inode_loc+0x27f/0x2d8 [ext3]
 [<ffffffff812c1edb>] ? __down_read+0x3d/0xbd
 [<ffffffffa0036350>] ext3_get_inode_loc+0x15/0x17 [ext3]
 [<ffffffffa0043dad>] ext3_xattr_get+0x48/0x262 [ext3]
 [<ffffffff810c5fc1>] ? path_to_nameidata+0x16/0x39
 [<ffffffffa0044e86>] ext3_xattr_security_get+0x21/0x23 [ext3]
 [<ffffffff810d9d36>] generic_getxattr+0x62/0x66
 [<ffffffff810da5e9>] vfs_getxattr+0xa1/0xb3
 [<ffffffff810da697>] getxattr+0x9c/0xfb
 [<ffffffff810c8e6e>] ? putname+0x30/0x39
 [<ffffffff810c99a9>] ? user_path_at+0x5d/0x8c
 [<ffffffff810da7a5>] sys_lgetxattr+0x4a/0x67
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task kjournald:1398 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
kjournald     D ffff88021b3ca700     0  1398      2
 ffff880218ecbca0 0000000000000046 ffff88021b5e9ed8 ffff88021ab50000
 ffffffff8162a500 ffffffff8162a500 ffff88021a90db80 ffff8801cf458000
 ffff88021a90dec8 00000003a02daa3c ffff880219914e98 ffff88021a90dec8
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0de1>] __wait_on_bit+0x47/0x79
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0e7d>] out_of_line_wait_on_bit+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff8113cb81>] ? submit_bio+0xe0/0xe9
 [<ffffffff810e150a>] __wait_on_buffer+0x36/0x3a
 [<ffffffffa0024deb>] wait_on_buffer+0x41/0x45 [jbd]
 [<ffffffffa002540e>] journal_commit_transaction+0x55d/0xf2f [jbd]
 [<ffffffff81049a0d>] ? try_to_del_timer_sync+0x58/0x63
 [<ffffffffa0028ad8>] kjournald+0xe3/0x23a [jbd]
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffffa00289f5>] ? kjournald+0x0/0x23a [jbd]
 [<ffffffff810534bf>] kthread+0x49/0x76
 [<ffffffff81011719>] child_rip+0xa/0x11
 [<ffffffff81010a37>] ? restore_args+0x0/0x30
 [<ffffffff81053476>] ? kthread+0x0/0x76
 [<ffffffff8101170f>] ? child_rip+0x0/0x11

INFO: task VirtualBox:4175 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
VirtualBox    D ffff88021b3ca700     0  4175   3991
 ffff8801492556e8 0000000000000086 0000000000000096 ffff88021ab50798
 ffffffff8162a500 ffffffff8162a500 ffff8801a1d216e0 ffff88021a5adb80
 ffff8801a1d21a28 00000001a02daa3c ffff88021ab41e60 ffff8801a1d21a28
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0de1>] __wait_on_bit+0x47/0x79
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0e7d>] out_of_line_wait_on_bit+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff8113cb81>] ? submit_bio+0xe0/0xe9
 [<ffffffff810e150a>] __wait_on_buffer+0x36/0x3a
 [<ffffffff810e154f>] wait_on_buffer+0x41/0x45
 [<ffffffff810e34fc>] __bread+0x5e/0x7e
 [<ffffffffa0036911>] ext3_get_branch+0x76/0xea [ext3]
 [<ffffffffa00376ca>] ext3_get_blocks_handle+0x9d/0x858 [ext3]
 [<ffffffff811552b7>] ? __sg_alloc_table+0x6f/0xf8
 [<ffffffff81140e0c>] ? blk_rq_map_sg+0x131/0x27d
 [<ffffffff811591a7>] ? swiotlb_map_sg_attrs+0x116/0x135
 [<ffffffffa0109f24>] ? sil24_qc_prep+0x151/0x174 [sata_sil24]
 [<ffffffffa008ba56>] ? ata_qc_issue+0x27e/0x2bb [libata]
 [<ffffffffa0037f43>] ext3_get_block+0xbe/0xfc [ext3]
 [<ffffffff810e7c1e>] do_mpage_readpage+0x1a8/0x4d5
 [<ffffffff8114fafd>] ? radix_tree_insert+0x186/0x1ca
 [<ffffffff81099ca5>] ? __inc_zone_page_state+0x25/0x27
 [<ffffffff8108d627>] ? add_to_page_cache_locked+0x9a/0xae
 [<ffffffff81096290>] ? lru_cache_add+0x2b/0x5c
 [<ffffffff810e804e>] mpage_readpages+0xb1/0xf4
 [<ffffffffa0037e85>] ? ext3_get_block+0x0/0xfc [ext3]
 [<ffffffff810935af>] ? __alloc_pages_internal+0xfe/0x457
 [<ffffffffa0037e85>] ? ext3_get_block+0x0/0xfc [ext3]
 [<ffffffffa00375a5>] ext3_readpages+0x1a/0x1c [ext3]
 [<ffffffff81095490>] __do_page_cache_readahead+0xfc/0x172
 [<ffffffff81095804>] ondemand_readahead+0x178/0x18a
 [<ffffffff810958af>] page_cache_sync_readahead+0x17/0x1c
 [<ffffffff8108ec4a>] generic_file_aio_read+0x22f/0x595
 [<ffffffff810becb5>] do_sync_read+0xe7/0x12d
 [<ffffffffa042c8a2>] ? RTMemFree+0x1e/0x20 [vboxdrv]
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff81120e84>] ? security_file_permission+0x11/0x13
 [<ffffffff810bf632>] vfs_read+0xa8/0x102
 [<ffffffff810bf750>] sys_read+0x47/0x6e
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b

INFO: task VirtualBox:27793 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
VirtualBox    D ffff8801c11fc340     0 27793  13748
 ffff88016eb898f8 0000000000000082 0000000000000092 ffff88021ab50798
 ffffffff8162a500 ffffffff8162a500 ffff88013b088000 ffff88010074adc0
 ffff88013b088348 00000001a02daa3c ffff88021ab41e60 ffff88013b088348
Call Trace:
 [<ffffffff8101686f>] ? read_tsc+0xe/0x24
 [<ffffffff810590b6>] ? getnstimeofday+0x54/0xb0
 [<ffffffff812c08a3>] io_schedule+0x63/0xa5
 [<ffffffff810e15e1>] sync_buffer+0x3b/0x3f
 [<ffffffff812c0de1>] __wait_on_bit+0x47/0x79
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff810e15a6>] ? sync_buffer+0x0/0x3f
 [<ffffffff812c0e7d>] out_of_line_wait_on_bit+0x6a/0x77
 [<ffffffff81053861>] ? wake_bit_function+0x0/0x2a
 [<ffffffff810e150a>] __wait_on_buffer+0x36/0x3a
 [<ffffffff810e154f>] wait_on_buffer+0x41/0x45
 [<ffffffff810e22fe>] __block_prepare_write+0x2ba/0x2fd
 [<ffffffffa0037e85>] ? ext3_get_block+0x0/0xfc [ext3]
 [<ffffffff8108d627>] ? add_to_page_cache_locked+0x9a/0xae
 [<ffffffff810e24b6>] block_write_begin+0x86/0xd8
 [<ffffffffa00374c3>] ext3_write_begin+0xdf/0x1a7 [ext3]
 [<ffffffffa0037e85>] ? ext3_get_block+0x0/0xfc [ext3]
 [<ffffffff8108e0fa>] generic_file_buffered_write+0x14b/0x643
 [<ffffffff810d623b>] ? mnt_drop_write+0x82/0x143
 [<ffffffff810d453d>] ? mnt_want_write+0x77/0x8d
 [<ffffffff8108e9e7>] __generic_file_aio_write_nolock+0x25e/0x292
 [<ffffffffa042fde9>] ? rtSemEventWait+0xed/0xfe [vboxdrv]
 [<ffffffffa04308fd>] ? RTSpinlockRelease+0xd/0x10 [vboxdrv]
 [<ffffffffa0427785>] ? SUPR0ObjRelease+0x17e/0x1d7 [vboxdrv]
 [<ffffffff8108f233>] generic_file_aio_write+0x67/0xc3
 [<ffffffffa003443f>] ext3_file_write+0x1e/0x9f [ext3]
 [<ffffffff810beb88>] do_sync_write+0xe7/0x12d
 [<ffffffffa042c8a2>] ? RTMemFree+0x1e/0x20 [vboxdrv]
 [<ffffffff81053829>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff81031103>] ? need_resched+0x1e/0x28
 [<ffffffff81120e84>] ? security_file_permission+0x11/0x13
 [<ffffffff810bf444>] vfs_write+0xab/0x105
 [<ffffffff810bf562>] sys_write+0x47/0x6f
 [<ffffffff8101027a>] system_call_fastpath+0x16/0x1b


A few minutes later, everything is back to normal, I was able to try saving again and it's fast as usual. This crash seems to be a momentary thing, then everything recovers. I can start the crashed VMs back up after doing a hard reset, and it's like it never happened.

The sata controller came in a week ago, but it's not compatible with my system, which royally sucks. I was going to replace the existing to see if the issue may be the controller. The system would not post at all with the card in there so, so much for that idea. Anything else worth trying, or is it pretty much time to pack it in and just build a new server from scratch? If it's a motherboard or cpu issue, then I have to get a full new system anyway as the other parts wont be compatible with whatever I buy.

Guess this may be an excuse to build a Sandy Bridge or I7 system. Not that I have the money.
 

Red Squirrel

No Lifer
May 24, 2003
70,166
13,573
126
www.anyf.ca
Hmm that's good to know. I wonder if KVM/Qemu would be better. I need a type 2 hypervisor as that machine has other tasks, and also does not have ESX supported hardware.

It's odd that all these issues started right after a power outage though, but maybe it's just the fact that I rebooted and a module reloaded or something. I had upgraded it maybe a year before and it had not been rebooted since.

I'm also tempted to pull out the OS drive and start a fresh install of a newer distro and do everything from scratch to rule out a software issue. Maybe the fact that all this started after a power outage is just a coincidence or something. I've been itching to upgrade that FC9, but it's just a pain to redo everything, but if I keep the other drive at least I'll have piece of mind I can easily roll back if it turns out to be too much work.