kernel-packages team mailing list archive
-
kernel-packages team
-
Mailing list archive
-
Message #21515
[Bug 652812] Re: task blocked for more than 120 seconds on server kernel
[Expired for linux (Ubuntu) because there has been no activity for 60
days.]
** Changed in: linux (Ubuntu)
Status: Incomplete => Expired
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/652812
Title:
task blocked for more than 120 seconds on server kernel
Status in “linux” package in Ubuntu:
Expired
Bug description:
Hi,
this is about a ubuntu server version.
The server consists mainly of fast HDDs and 2 external attached LTO-3 tape drives in a changer.
It's purpose is to sync with other servers and then write ewverything onto both tape drives in parallel overnight.
The following is our main problem:
[ 1081.590063] INFO: task mbuffer1:2589 blocked for more than 120 seconds.
[ 1081.590577] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1081.591151] mbuffer1 D 0000000000000000 0 2589 2560 0x00000000
[ 1081.591162] ffff88080cee9c18 0000000000000082 0000000000015bc0 0000000000015bc0
[ 1081.591173] ffff8803f87ac890 ffff88080cee9fd8 0000000000015bc0 ffff8803f87ac4d0
[ 1081.591181] 0000000000015bc0 ffff88080cee9fd8 0000000000015bc0 ffff8803f87ac890
[ 1081.591189] Call Trace:
[ 1081.591208] [<ffffffff815583ad>] schedule_timeout+0x22d/0x300
[ 1081.591220] [<ffffffff812b4567>] ? kobject_put+0x27/0x60
[ 1081.591228] [<ffffffff81559f45>] ? _spin_lock_irq+0x15/0x20
[ 1081.591238] [<ffffffff8138a90a>] ? scsi_request_fn+0xda/0x5e0
[ 1081.591246] [<ffffffff81557656>] wait_for_common+0xd6/0x180
[ 1081.591256] [<ffffffff8129de33>] ? __generic_unplug_device+0x33/0x40
[ 1081.591266] [<ffffffff8105a350>] ? default_wake_function+0x0/0x20
[ 1081.591286] [<ffffffffa015c4d8>] ? T.945+0x158/0x170 [st]
[ 1081.591294] [<ffffffff815577bd>] wait_for_completion+0x1d/0x20
[ 1081.591305] [<ffffffffa015c637>] T.944+0x127/0x270 [st]
[ 1081.591315] [<ffffffffa0162092>] st_write+0x5a2/0xc70 [st]
[ 1081.591324] [<ffffffff8105a380>] ? wake_up_state+0x10/0x20
[ 1081.591334] [<ffffffff81143aa8>] vfs_write+0xb8/0x1a0
[ 1081.591342] [<ffffffff81144311>] sys_write+0x51/0x80
[ 1081.591351] [<ffffffff810121b2>] system_call_fastpath+0x16/0x1b
[ 1081.591358] INFO: task mbuffer2:2608 blocked for more than 120 seconds.
[ 1081.591800] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1081.592374] mbuffer2 D 0000000000000000 0 2608 2591 0x00000000
[ 1081.592383] ffff8800df895c18 0000000000000082 0000000000015bc0 0000000000015bc0
[ 1081.592392] ffff8803f87a9ab0 ffff8800df895fd8 0000000000015bc0 ffff8803f87a96f0
[ 1081.592400] 0000000000015bc0 ffff8800df895fd8 0000000000015bc0 ffff8803f87a9ab0
[ 1081.592408] Call Trace:
[ 1081.592417] [<ffffffff815583ad>] schedule_timeout+0x22d/0x300
[ 1081.592425] [<ffffffff812b4567>] ? kobject_put+0x27/0x60
[ 1081.592432] [<ffffffff81559f45>] ? _spin_lock_irq+0x15/0x20
[ 1081.592439] [<ffffffff8138a90a>] ? scsi_request_fn+0xda/0x5e0
[ 1081.592448] [<ffffffff81557656>] wait_for_common+0xd6/0x180
[ 1081.592456] [<ffffffff8129de33>] ? __generic_unplug_device+0x33/0x40
[ 1081.592464] [<ffffffff8105a350>] ? default_wake_function+0x0/0x20
[ 1081.592474] [<ffffffffa015c4d8>] ? T.945+0x158/0x170 [st]
[ 1081.592482] [<ffffffff815577bd>] wait_for_completion+0x1d/0x20
[ 1081.592492] [<ffffffffa015c637>] T.944+0x127/0x270 [st]
[ 1081.592502] [<ffffffffa0162092>] st_write+0x5a2/0xc70 [st]
[ 1081.592510] [<ffffffff8105a380>] ? wake_up_state+0x10/0x20
[ 1081.592518] [<ffffffff81143aa8>] vfs_write+0xb8/0x1a0
[ 1081.592525] [<ffffffff81144311>] sys_write+0x51/0x80
[ 1081.592533] [<ffffffff810121b2>] system_call_fastpath+0x16/0x1b
After the 5th 120s delay the following aborts the backup:
[ 1818.980059] mptscsih: ioc1: attempting task abort! (sc=ffff880057bb7000)
[ 1818.980067] st 6:0:4:0: CDB: Write(6): 0a 00 04 00 00 00
[ 1829.300042] mptscsih: ioc1: WARNING - Issuing Reset from mptscsih_IssueTaskMgmt!!
[ 1831.280030] mptscsih: ioc1: task abort: SUCCESS (sc=ffff880057bb7000)
[ 1831.282296] mptscsih: ioc1: attempting task abort! (sc=ffff880057bb6a00)
[ 1831.282302] st 6:0:5:0: CDB: Write(6): 0a 00 04 00 00 00
[ 1831.282321] mptscsih: ioc1: task abort: SUCCESS (sc=ffff880057bb6a00)
[ 1831.284945] st0: Error 80000 (driver bt 0x0, host bt 0x8).
[ 1831.285106] st1: Error 80000 (driver bt 0x0, host bt 0x8).
[ 1831.490044] scsi target6:0:4: Beginning Domain Validation
[ 1831.637097] scsi target6:0:4: Ending Domain Validation
[ 1831.637208] scsi target6:0:4: FAST-80 WIDE SCSI 160.0 MB/s DT (12.5 ns, offset 64)
[ 1834.150032] scsi target6:0:5: Beginning Domain Validation
[ 1834.297533] scsi target6:0:5: Ending Domain Validation
[ 1834.297649] scsi target6:0:5: FAST-160 WIDE SCSI 320.0 MB/s DT IU RTI PCOMP (6.25 ns, offset 64)
[ 1910.340056] scsi target6:0:5: Beginning Domain Validation
[ 1910.729074] scsi target6:0:5: Ending Domain Validation
[ 1910.729194] scsi target6:0:5: FAST-160 WIDE SCSI 320.0 MB/s DT IU RTI PCOMP (6.25 ns, offset 64)
This is with the SAS-LSI driver manually updated to version:
# cat /sys/module/mptbase/version
4.24.00.00
because I get lost connections to SATA drives with the driver supplied
with the kernel (was with 2.6.32-23).
This is a really serious bug for this server! It prevents it from doing backups.
Please also read Bug 494476
regards
Lars
ProblemType: Bug
DistroRelease: Ubuntu 10.04
Package: linux-image-2.6.32-25-server 2.6.32-25.44 [modified: lib/modules/2.6.32-25-server/kernel/drivers/message/fusion/mptbase.ko lib/modules/2.6.32-25-server/kernel/drivers/message/fusion/mptctl.ko lib/modules/2.6.32-25-server/kernel/drivers/message/fusion/mptfc.ko lib/modules/2.6.32-25-server/kernel/drivers/message/fusion/mptlan.ko lib/modules/2.6.32-25-server/kernel/drivers/message/fusion/mptsas.ko lib/modules/2.6.32-25-server/kernel/drivers/message/fusion/mptscsih.ko lib/modules/2.6.32-25-server/kernel/drivers/message/fusion/mptspi.ko]
Regression: No
Reproducible: Yes
ProcVersionSignature: Ubuntu 2.6.32-25.44-server 2.6.32.21+drm33.7
Uname: Linux 2.6.32-25-server x86_64
AlsaDevices: Error: command ['ls', '-l', '/dev/snd/'] failed with exit code 2: ls: cannot access /dev/snd/: No such file or directory
AplayDevices: Error: [Errno 2] No such file or directory
Architecture: amd64
ArecordDevices: Error: [Errno 2] No such file or directory
CurrentDmesg:
Date: Fri Oct 1 10:20:57 2010
MachineType: Supermicro H8DI3+
PciMultimedia:
ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.32-25-server root=LABEL=WURZEL ro elevator=noop quiet splash
ProcEnviron:
LANG=de_DE.UTF-8
SHELL=/bin/bash
SourcePackage: linux
dmi.bios.date: 12/07/2009
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: 1.0b
dmi.board.asset.tag: To Be Filled By O.E.M.
dmi.board.name: H8DI3+
dmi.board.vendor: Supermicro
dmi.board.version: 1234567890
dmi.chassis.asset.tag: To Be Filled By O.E.M.
dmi.chassis.type: 3
dmi.chassis.vendor: Supermicro
dmi.chassis.version: 1234567890
dmi.modalias: dmi:bvnAmericanMegatrendsInc.:bvr1.0b:bd12/07/2009:svnSupermicro:pnH8DI3+:pvr1234567890:rvnSupermicro:rnH8DI3+:rvr1234567890:cvnSupermicro:ct3:cvr1234567890:
dmi.product.name: H8DI3+
dmi.product.version: 1234567890
dmi.sys.vendor: Supermicro
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/652812/+subscriptions