← Back to team overview

kernel-packages team mailing list archive

[Bug 1357295] Re: bcache_writebac blocks and goes silent on trusty kernel 3.13.0-24-generic #47-Ubuntu

 

This bug was fixed in the package linux - 3.13.0-36.63

---------------
linux (3.13.0-36.63) trusty; urgency=low

  [ Joseph Salisbury ]

  * Release Tracking Bug
    - LP: #1365052

  [ Feng Kan ]

  * SAUCE: (no-up) irqchip:gic: change access of gicc_ctrl register to read
    modify write.
    - LP: #1357527
  * SAUCE: (no-up) arm64: optimized copy_to_user and copy_from_user
    assembly code
    - LP: #1358949

  [ Ming Lei ]

  * SAUCE: (no-up) Drop APM X-Gene SoC Ethernet driver
    - LP: #1360140
  * [Config] Drop XGENE entries
    - LP: #1360140
  * [Config] CONFIG_NET_XGENE=m for arm64
    - LP: #1360140

  [ Stefan Bader ]

  * SAUCE: Add compat macro for skb_get_hash
    - LP: #1358162
  * SAUCE: bcache: prevent crash on changing writeback_running
    - LP: #1357295

  [ Suman Tripathi ]

  * SAUCE: (no-up) arm64: Fix the csr-mask for APM X-Gene SoC AHCI SATA PHY
    clock DTS node.
    - LP: #1359489
  * SAUCE: (no-up) ahci_xgene: Skip the PHY and clock initialization if
    already configured by the firmware.
    - LP: #1359501
  * SAUCE: (no-up) ahci_xgene: Fix the link down in first attempt for the
    APM X-Gene SoC AHCI SATA host controller driver.
    - LP: #1359507

  [ Tuan Phan ]

  * SAUCE: (no-up) pci-xgene-msi: fixed deadlock in irq_set_affinity
    - LP: #1359514

  [ Upstream Kernel Changes ]

  * iwlwifi: mvm: Add a missed beacons threshold
    - LP: #1349572
  * mac80211: reset probe_send_count also in HW_CONNECTION_MONITOR case
    - LP: #1349572
  * genirq: Add an accessor for IRQ_PER_CPU flag
    - LP: #1357527
  * arm64: perf: add support for percpu pmu interrupt
    - LP: #1357527
  * cifs: sanity check length of data to send before sending
    - LP: #1283101
  * KVM: nVMX: Pass vmexit parameters to nested_vmx_vmexit
    - LP: #1329434
  * KVM: nVMX: Rework interception of IRQs and NMIs
    - LP: #1329434
  * KVM: vmx: disable APIC virtualization in nested guests
    - LP: #1329434
  * HID: Add transport-driver functions to the USB HID interface.
    - LP: #1353021
  * ahci_xgene: Removing NCQ support from the APM X-Gene SoC AHCI SATA Host
    Controller driver.
    - LP: #1358498
  * fold d_kill() and d_free()
    - LP: #1354234
  * fold try_prune_one_dentry()
    - LP: #1354234
  * new helper: dentry_free()
    - LP: #1354234
  * expand the call of dentry_lru_del() in dentry_kill()
    - LP: #1354234
  * dentry_kill(): don't try to remove from shrink list
    - LP: #1354234
  * don't remove from shrink list in select_collect()
    - LP: #1354234
  * more graceful recovery in umount_collect()
    - LP: #1354234
  * dcache: don't need rcu in shrink_dentry_list()
    - LP: #1354234
  * lift the "already marked killed" case into shrink_dentry_list()
  * split dentry_kill()
    - LP: #1354234
  * expand dentry_kill(dentry, 0) in shrink_dentry_list()
    - LP: #1354234
  * shrink_dentry_list(): take parent's ->d_lock earlier
    - LP: #1354234
  * dealing with the rest of shrink_dentry_list() livelock
    - LP: #1354234
  * dentry_kill() doesn't need the second argument now
    - LP: #1354234
  * dcache: add missing lockdep annotation
    - LP: #1354234
  * fs: convert use of typedef ctl_table to struct ctl_table
    - LP: #1354234
  * lock_parent: don't step on stale ->d_parent of all-but-freed one
    - LP: #1354234
  * tools/testing/selftests/ptrace/peeksiginfo.c: add PAGE_SIZE definition
    - LP: #1358855
  * x86, irq, pic: Probe for legacy PIC and set legacy_pic appropriately
    - LP: #1317697
  * bnx2x: Fix kernel crash and data miscompare after EEH recovery
    - LP: #1353105
  * bnx2x: Adapter not recovery from EEH error injection
    - LP: #1353105
  * Fix: module signature vs tracepoints: add new TAINT_UNSIGNED_MODULE
    - LP: #1359670
  * bcache: fix crash on shutdown in passthrough mode
    - LP: #1357295
  * bcache: fix uninterruptible sleep in writeback thread
    - LP: #1357295
  * namespaces: Use task_lock and not rcu to protect nsproxy
    - LP: #1328088
  * MAINTAINERS: Add entry for APM X-Gene SoC ethernet driver
    - LP: #1360140
  * Documentation: dts: Add bindings for APM X-Gene SoC ethernet driver
    - LP: #1360140
  * dts: Add bindings for APM X-Gene SoC ethernet driver
    - LP: #1360140
  * drivers: net: Add APM X-Gene SoC ethernet driver support.
    - LP: #1360140
  * powerpc/mm: Add new "set" flag argument to pte/pmd update function
    - LP: #1357014
  * powerpc/thp: Add write barrier after updating the valid bit
    - LP: #1357014
  * powerpc/thp: Don't recompute vsid and ssize in loop on invalidate
    - LP: #1357014
  * powerpc/thp: Invalidate old 64K based hash page mapping before insert
    of 4k pte
    - LP: #1357014
  * powerpc/thp: Handle combo pages in invalidate
    - LP: #1357014
  * powerpc/thp: Invalidate with vpn in loop
    - LP: #1357014
  * powerpc/thp: Use ACCESS_ONCE when loading pmdp
    - LP: #1357014
  * powerpc/mm: Use read barrier when creating real_pte
    - LP: #1357014
  * powerpc/thp: Add tracepoints to track hugepage invalidate
    - LP: #1357014
  * powerpc: subpage_protect: Increase the array size to take care of 64TB
    - LP: #1357014
  * mfd: rtsx: Add set pull control macro and simplify rtl8411
    - LP: #1361086
  * mfd: rtsx: Add support for card reader rtl8402
    - LP: #1361086
  * kvm: iommu: fix the third parameter of kvm_iommu_put_pages
    (CVE-2014-3601)
    - LP: #1362443
    - CVE-2014-3601
  * isofs: Fix unbounded recursion when processing relocated directories
    - LP: #1362447, #1362448
    - CVE-2014-5472
  * net: sctp: inherit auth_capable on INIT collisions
    - LP: #1349804
    - CVE-2014-5077
  * blk-mq: fix initializing request's start time
    - LP: #1297522

  [ Vinayak Kale ]

  * SAUCE: (no-up) dt-bindings: Add Potenza PMU binding
    - LP: #1357527
  * SAUCE: (no-up) arm64: dts: Add PMU node for APM X-Gene Storm SOC
    - LP: #1357527
 -- Joseph Salisbury <joseph.salisbury@xxxxxxxxxxxxx>   Wed, 03 Sep 2014 12:13:43 -0400

** Changed in: linux (Ubuntu Trusty)
       Status: Fix Committed => Fix Released

** CVE added: http://www.cve.mitre.org/cgi-
bin/cvename.cgi?name=2014-3601

** CVE added: http://www.cve.mitre.org/cgi-
bin/cvename.cgi?name=2014-5077

** CVE added: http://www.cve.mitre.org/cgi-
bin/cvename.cgi?name=2014-5472

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1357295

Title:
  bcache_writebac blocks and goes silent on trusty kernel
  3.13.0-24-generic #47-Ubuntu

Status in “linux” package in Ubuntu:
  Fix Released
Status in “linux” source package in Trusty:
  Fix Released
Status in “linux” source package in Utopic:
  Fix Released

Bug description:
  We regularly use bcache to use SSDs as an extra block cache layer for
  performance on large working sets (archive mirrors, large databases,
  etc).  Until now we have had good success with bcache on precise LTS
  with HWE kernels up to Saucy's.

  Unfortunately when we mount a bcache-backed volume on a Trusty server,
  it immediately loses contact with bcache_writebac:

    PID USER      PR  NI    VIRT    RES    SHR S  %CPU %MEM     TIME+ COMMAND                            
    501 root      20   0       0      0      0 D   0.0  0.0   0:00.00 bcache_writebac                    

  kern.log shows it losing its mind pretty much immediately on startup:

      Aug 12 02:22:15 somehostname kernel: [ 6483.477618] md: bind<sdc>
      Aug 12 02:22:15 somehostname kernel: [ 6483.478599] md: bind<sdd>
      Aug 12 02:22:15 somehostname kernel: [ 6483.482016] md/raid0:md0: md_size is 781443072 sectors.
      Aug 12 02:22:15 somehostname kernel: [ 6483.482024] md: RAID0 configuration for md0 - 1 zone
      Aug 12 02:22:15 somehostname kernel: [ 6483.482026] md: zone0=[sdc/sdd]
      Aug 12 02:22:15 somehostname kernel: [ 6483.482033]       zone-offset=         0KB, device-offset=         0KB, size= 390721536KB
      Aug 12 02:22:15 somehostname kernel: [ 6483.482036] 
      Aug 12 02:22:15 somehostname kernel: [ 6483.482055] md0: detected capacity change from 0 to 400098852864
      Aug 12 02:22:15 somehostname kernel: [ 6483.487644]  md0: unknown partition table
      Aug 12 02:26:55 somehostname kernel: [ 6763.657249] bio: create slab <bio-2> at 2
      Aug 12 02:26:55 somehostname kernel: [ 6763.658018] bcache: run_cache_set() invalidating existing data
      Aug 12 02:26:55 somehostname kernel: [ 6763.695447] bcache: register_cache() registered cache device md0
      Aug 12 02:26:55 somehostname kernel: [ 6763.696262] bcache: register_bdev() registered backing device sdb
      Aug 12 02:26:55 somehostname kernel: [ 6763.697833] bcache: bch_cached_dev_attach() Caching sdb as bcache0 on set 13dcf71e-73ea-49f7-8cd6-fb8182c3cc17
      Aug 12 02:27:24 somehostname kernel: [ 6793.114287] EXT4-fs (bcache0): mounted filesystem with ordered data mode. Opts: (null)
      Aug 12 02:30:16 somehostname kernel: [ 6964.514425] INFO: task bcache_writebac:29721 blocked for more than 120 seconds.
      Aug 12 02:30:16 somehostname kernel: [ 6964.608861]       Not tainted 3.13.0-24-generic #47-Ubuntu
      Aug 12 02:30:16 somehostname kernel: [ 6964.679403] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
      Aug 12 02:30:16 somehostname kernel: [ 6964.779270] bcache_writebac D ffff8817dfb34440     0 29721      2 0x00000000
      Aug 12 02:30:16 somehostname kernel: [ 6964.779278]  ffff882fafe57eb8 0000000000000002 ffff882fb100afe0 ffff882fafe57fd8
      Aug 12 02:30:16 somehostname kernel: [ 6964.779286]  0000000000014440 0000000000014440 ffff882fb100afe0 ffff882fb7bcac80
      Aug 12 02:30:16 somehostname kernel: [ 6964.779293]  ffff882fb11c0000 ffffffffa03f5840 0000000000000000 0000000000000000
      Aug 12 02:30:16 somehostname kernel: [ 6964.779301] Call Trace:
      Aug 12 02:30:16 somehostname kernel: [ 6964.779335]  [<ffffffffa03f5840>] ? read_dirty+0x400/0x400 [bcache]
      Aug 12 02:30:16 somehostname kernel: [ 6964.779346]  [<ffffffff81719ee9>] schedule+0x29/0x70
      Aug 12 02:30:16 somehostname kernel: [ 6964.779356]  [<ffffffff8108b2ed>] kthread+0xad/0xf0
      Aug 12 02:30:16 somehostname kernel: [ 6964.779368]  [<ffffffff8108b240>] ? kthread_create_on_node+0x1d0/0x1d0
      Aug 12 02:30:16 somehostname kernel: [ 6964.779379]  [<ffffffff817263fc>] ret_from_fork+0x7c/0xb0
      Aug 12 02:30:16 somehostname kernel: [ 6964.779385]  [<ffffffff8108b240>] ? kthread_create_on_node+0x1d0/0x1d0
      Aug 12 02:32:16 somehostname kernel: [ 7084.828482] INFO: task bcache_writebac:29721 blocked for more than 120 seconds.
      Aug 12 02:32:16 somehostname kernel: [ 7084.922987]       Not tainted 3.13.0-24-generic #47-Ubuntu
      Aug 12 02:32:16 somehostname kernel: [ 7084.993239] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
      Aug 12 02:32:16 somehostname kernel: [ 7085.093858] bcache_writebac D ffff8817dfb34440     0 29721      2 0x00000000
      Aug 12 02:32:16 somehostname kernel: [ 7085.093866]  ffff882fafe57eb8 0000000000000002 ffff882fb100afe0 ffff882fafe57fd8
      Aug 12 02:32:16 somehostname kernel: [ 7085.093874]  0000000000014440 0000000000014440 ffff882fb100afe0 ffff882fb7bcac80
      Aug 12 02:32:16 somehostname kernel: [ 7085.093881]  ffff882fb11c0000 ffffffffa03f5840 0000000000000000 0000000000000000
      Aug 12 02:32:16 somehostname kernel: [ 7085.093897] Call Trace:
      Aug 12 02:32:16 somehostname kernel: [ 7085.093918]  [<ffffffffa03f5840>] ? read_dirty+0x400/0x400 [bcache]
      Aug 12 02:32:16 somehostname kernel: [ 7085.093928]  [<ffffffff81719ee9>] schedule+0x29/0x70
      Aug 12 02:32:16 somehostname kernel: [ 7085.093944]  [<ffffffff8108b2ed>] kthread+0xad/0xf0
      Aug 12 02:32:16 somehostname kernel: [ 7085.093950]  [<ffffffff8108b240>] ? kthread_create_on_node+0x1d0/0x1d0
      Aug 12 02:32:16 somehostname kernel: [ 7085.093959]  [<ffffffff817263fc>] ret_from_fork+0x7c/0xb0
      Aug 12 02:32:16 somehostname kernel: [ 7085.093964]  [<ffffffff8108b240>] ? kthread_create_on_node+0x1d0/0x1d0

  ProblemType: Bug
  DistroRelease: Ubuntu 14.04
  Package: linux-image-3.13.0-33-generic 3.13.0-33.58
  ProcVersionSignature: Ubuntu 3.13.0-33.58-generic 3.13.11.4
  Uname: Linux 3.13.0-33-generic x86_64
  ApportVersion: 2.14.1-0ubuntu3.3
  Architecture: amd64
  AudioDevicesInUse:
   USER        PID ACCESS COMMAND
   /dev/snd/controlC1:  nick       2424 F.... pulseaudio
   /dev/snd/controlC0:  nick       2424 F.... pulseaudio
  CurrentDesktop: Unity
  Date: Fri Aug 15 11:08:19 2014
  HibernationDevice: RESUME=UUID=dae1fd50-2dc0-48b2-bd93-5823478772b6
  InstallationDate: Installed on 2014-05-29 (77 days ago)
  InstallationMedia: Ubuntu 14.04 LTS "Trusty Tahr" - Release amd64 (20140417)
  MachineType: LENOVO 20ALCTO1WW
  ProcFB: 0 inteldrmfb
  ProcKernelCmdLine: BOOT_IMAGE=/vmlinuz-3.13.0-33-generic.efi.signed root=/dev/mapper/ubuntu--vg-root ro quiet splash vt.handoff=7
  RelatedPackageVersions:
   linux-restricted-modules-3.13.0-33-generic N/A
   linux-backports-modules-3.13.0-33-generic  N/A
   linux-firmware                             1.127.5
  SourcePackage: linux
  UpgradeStatus: No upgrade log present (probably fresh install)
  dmi.bios.date: 02/06/2014
  dmi.bios.vendor: LENOVO
  dmi.bios.version: GIET69WW (2.19 )
  dmi.board.asset.tag: Not Available
  dmi.board.name: 20ALCTO1WW
  dmi.board.vendor: LENOVO
  dmi.board.version: 0B98405 STD
  dmi.chassis.asset.tag: No Asset Information
  dmi.chassis.type: 10
  dmi.chassis.vendor: LENOVO
  dmi.chassis.version: Not Available
  dmi.modalias: dmi:bvnLENOVO:bvrGIET69WW(2.19):bd02/06/2014:svnLENOVO:pn20ALCTO1WW:pvrThinkPadX240:rvnLENOVO:rn20ALCTO1WW:rvr0B98405STD:cvnLENOVO:ct10:cvrNotAvailable:
  dmi.product.name: 20ALCTO1WW
  dmi.product.version: ThinkPad X240
  dmi.sys.vendor: LENOVO

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1357295/+subscriptions


References