← Back to team overview

kernel-packages team mailing list archive

[Bug 652812] Re: task blocked for more than 120 seconds on server kernel

 

Lars, this bug was reported a while ago and there hasn't been any activity in it recently. We were wondering if this is still an issue? If so, could you please test the latest upstream kernel available following https://wiki.ubuntu.com/KernelMainlineBuilds ? It will allow additional upstream developers to examine the issue. Please do not test the daily folder, but the one all the way at the bottom. Once you've tested the upstream kernel, please comment on which kernel version specifically you tested. If this bug is fixed in the mainline kernel, please add the following tags:
kernel-fixed-upstream
kernel-fixed-upstream-VERSION-NUMBER

where VERSION-NUMBER is the version number of the kernel you tested. For example:
kernel-fixed-upstream-v3.11-rc5

This can be done by clicking on the yellow circle with a black pencil icon next to the word Tags located at the bottom of the bug description. As well, please remove the tag:
needs-upstream-testing

If the mainline kernel does not fix this bug, please add the following tags:
kernel-bug-exists-upstream
kernel-bug-exists-upstream-VERSION-NUMBER

As well, please remove the tag:
needs-upstream-testing

If you are unable to test the mainline kernel, please comment as to why specifically you were unable to test it and add the following tags:
kernel-unable-to-test-upstream
kernel-unable-to-test-upstream-VERSION-NUMBER

Once testing of the upstream kernel is complete, please mark this bug's
Status as Confirmed. Please let us know your results. Thank you for your
understanding.

** Tags added: latest-bios-1.0b regression-potential

** Changed in: linux (Ubuntu)
       Status: Confirmed => Incomplete

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/652812

Title:
  task blocked for more than 120 seconds on server kernel

Status in “linux” package in Ubuntu:
  Incomplete

Bug description:
  Hi,

  this is about a ubuntu server version.
  The server consists mainly of fast HDDs and 2 external attached LTO-3 tape drives in a changer.
  It's purpose is to sync with other servers and then write ewverything onto both tape drives in parallel overnight.

  The following is our main problem:
  [ 1081.590063] INFO: task mbuffer1:2589 blocked for more than 120 seconds.
  [ 1081.590577] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
  [ 1081.591151] mbuffer1      D 0000000000000000     0  2589   2560 0x00000000
  [ 1081.591162]  ffff88080cee9c18 0000000000000082 0000000000015bc0 0000000000015bc0
  [ 1081.591173]  ffff8803f87ac890 ffff88080cee9fd8 0000000000015bc0 ffff8803f87ac4d0
  [ 1081.591181]  0000000000015bc0 ffff88080cee9fd8 0000000000015bc0 ffff8803f87ac890
  [ 1081.591189] Call Trace:
  [ 1081.591208]  [<ffffffff815583ad>] schedule_timeout+0x22d/0x300
  [ 1081.591220]  [<ffffffff812b4567>] ? kobject_put+0x27/0x60
  [ 1081.591228]  [<ffffffff81559f45>] ? _spin_lock_irq+0x15/0x20
  [ 1081.591238]  [<ffffffff8138a90a>] ? scsi_request_fn+0xda/0x5e0
  [ 1081.591246]  [<ffffffff81557656>] wait_for_common+0xd6/0x180
  [ 1081.591256]  [<ffffffff8129de33>] ? __generic_unplug_device+0x33/0x40
  [ 1081.591266]  [<ffffffff8105a350>] ? default_wake_function+0x0/0x20
  [ 1081.591286]  [<ffffffffa015c4d8>] ? T.945+0x158/0x170 [st]
  [ 1081.591294]  [<ffffffff815577bd>] wait_for_completion+0x1d/0x20
  [ 1081.591305]  [<ffffffffa015c637>] T.944+0x127/0x270 [st]
  [ 1081.591315]  [<ffffffffa0162092>] st_write+0x5a2/0xc70 [st]
  [ 1081.591324]  [<ffffffff8105a380>] ? wake_up_state+0x10/0x20
  [ 1081.591334]  [<ffffffff81143aa8>] vfs_write+0xb8/0x1a0
  [ 1081.591342]  [<ffffffff81144311>] sys_write+0x51/0x80
  [ 1081.591351]  [<ffffffff810121b2>] system_call_fastpath+0x16/0x1b
  [ 1081.591358] INFO: task mbuffer2:2608 blocked for more than 120 seconds.
  [ 1081.591800] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
  [ 1081.592374] mbuffer2      D 0000000000000000     0  2608   2591 0x00000000
  [ 1081.592383]  ffff8800df895c18 0000000000000082 0000000000015bc0 0000000000015bc0
  [ 1081.592392]  ffff8803f87a9ab0 ffff8800df895fd8 0000000000015bc0 ffff8803f87a96f0
  [ 1081.592400]  0000000000015bc0 ffff8800df895fd8 0000000000015bc0 ffff8803f87a9ab0
  [ 1081.592408] Call Trace:
  [ 1081.592417]  [<ffffffff815583ad>] schedule_timeout+0x22d/0x300
  [ 1081.592425]  [<ffffffff812b4567>] ? kobject_put+0x27/0x60
  [ 1081.592432]  [<ffffffff81559f45>] ? _spin_lock_irq+0x15/0x20
  [ 1081.592439]  [<ffffffff8138a90a>] ? scsi_request_fn+0xda/0x5e0
  [ 1081.592448]  [<ffffffff81557656>] wait_for_common+0xd6/0x180
  [ 1081.592456]  [<ffffffff8129de33>] ? __generic_unplug_device+0x33/0x40
  [ 1081.592464]  [<ffffffff8105a350>] ? default_wake_function+0x0/0x20
  [ 1081.592474]  [<ffffffffa015c4d8>] ? T.945+0x158/0x170 [st]
  [ 1081.592482]  [<ffffffff815577bd>] wait_for_completion+0x1d/0x20
  [ 1081.592492]  [<ffffffffa015c637>] T.944+0x127/0x270 [st]
  [ 1081.592502]  [<ffffffffa0162092>] st_write+0x5a2/0xc70 [st]
  [ 1081.592510]  [<ffffffff8105a380>] ? wake_up_state+0x10/0x20
  [ 1081.592518]  [<ffffffff81143aa8>] vfs_write+0xb8/0x1a0
  [ 1081.592525]  [<ffffffff81144311>] sys_write+0x51/0x80
  [ 1081.592533]  [<ffffffff810121b2>] system_call_fastpath+0x16/0x1b

  
  After the 5th 120s delay the following aborts the backup:
  [ 1818.980059] mptscsih: ioc1: attempting task abort! (sc=ffff880057bb7000)
  [ 1818.980067] st 6:0:4:0: CDB: Write(6): 0a 00 04 00 00 00
  [ 1829.300042] mptscsih: ioc1: WARNING - Issuing Reset from mptscsih_IssueTaskMgmt!!
  [ 1831.280030] mptscsih: ioc1: task abort: SUCCESS (sc=ffff880057bb7000)
  [ 1831.282296] mptscsih: ioc1: attempting task abort! (sc=ffff880057bb6a00)
  [ 1831.282302] st 6:0:5:0: CDB: Write(6): 0a 00 04 00 00 00
  [ 1831.282321] mptscsih: ioc1: task abort: SUCCESS (sc=ffff880057bb6a00)
  [ 1831.284945] st0: Error 80000 (driver bt 0x0, host bt 0x8).
  [ 1831.285106] st1: Error 80000 (driver bt 0x0, host bt 0x8).
  [ 1831.490044] scsi target6:0:4: Beginning Domain Validation
  [ 1831.637097] scsi target6:0:4: Ending Domain Validation
  [ 1831.637208] scsi target6:0:4: FAST-80 WIDE SCSI 160.0 MB/s DT (12.5 ns, offset 64)
  [ 1834.150032] scsi target6:0:5: Beginning Domain Validation
  [ 1834.297533] scsi target6:0:5: Ending Domain Validation
  [ 1834.297649] scsi target6:0:5: FAST-160 WIDE SCSI 320.0 MB/s DT IU RTI PCOMP (6.25 ns, offset 64)
  [ 1910.340056] scsi target6:0:5: Beginning Domain Validation
  [ 1910.729074] scsi target6:0:5: Ending Domain Validation
  [ 1910.729194] scsi target6:0:5: FAST-160 WIDE SCSI 320.0 MB/s DT IU RTI PCOMP (6.25 ns, offset 64)

  
  This is with the SAS-LSI driver manually updated to version:
  # cat /sys/module/mptbase/version 
  4.24.00.00

  because I get lost connections to SATA drives with the driver supplied
  with the kernel (was with 2.6.32-23).

  This is a really serious bug for this server! It prevents it from doing backups.
  Please also read Bug 494476

  
  regards
  Lars

  ProblemType: Bug
  DistroRelease: Ubuntu 10.04
  Package: linux-image-2.6.32-25-server 2.6.32-25.44 [modified: lib/modules/2.6.32-25-server/kernel/drivers/message/fusion/mptbase.ko lib/modules/2.6.32-25-server/kernel/drivers/message/fusion/mptctl.ko lib/modules/2.6.32-25-server/kernel/drivers/message/fusion/mptfc.ko lib/modules/2.6.32-25-server/kernel/drivers/message/fusion/mptlan.ko lib/modules/2.6.32-25-server/kernel/drivers/message/fusion/mptsas.ko lib/modules/2.6.32-25-server/kernel/drivers/message/fusion/mptscsih.ko lib/modules/2.6.32-25-server/kernel/drivers/message/fusion/mptspi.ko]
  Regression: No
  Reproducible: Yes
  ProcVersionSignature: Ubuntu 2.6.32-25.44-server 2.6.32.21+drm33.7
  Uname: Linux 2.6.32-25-server x86_64
  AlsaDevices: Error: command ['ls', '-l', '/dev/snd/'] failed with exit code 2: ls: cannot access /dev/snd/: No such file or directory
  AplayDevices: Error: [Errno 2] No such file or directory
  Architecture: amd64
  ArecordDevices: Error: [Errno 2] No such file or directory
  CurrentDmesg:
   
  Date: Fri Oct  1 10:20:57 2010
  MachineType: Supermicro H8DI3+
  PciMultimedia:
   
  ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.32-25-server root=LABEL=WURZEL ro elevator=noop quiet splash
  ProcEnviron:
   LANG=de_DE.UTF-8
   SHELL=/bin/bash
  SourcePackage: linux
  dmi.bios.date: 12/07/2009
  dmi.bios.vendor: American Megatrends Inc.
  dmi.bios.version: 1.0b
  dmi.board.asset.tag: To Be Filled By O.E.M.
  dmi.board.name: H8DI3+
  dmi.board.vendor: Supermicro
  dmi.board.version: 1234567890
  dmi.chassis.asset.tag: To Be Filled By O.E.M.
  dmi.chassis.type: 3
  dmi.chassis.vendor: Supermicro
  dmi.chassis.version: 1234567890
  dmi.modalias: dmi:bvnAmericanMegatrendsInc.:bvr1.0b:bd12/07/2009:svnSupermicro:pnH8DI3+:pvr1234567890:rvnSupermicro:rnH8DI3+:rvr1234567890:cvnSupermicro:ct3:cvr1234567890:
  dmi.product.name: H8DI3+
  dmi.product.version: 1234567890
  dmi.sys.vendor: Supermicro

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/652812/+subscriptions