← Back to team overview

kernel-packages team mailing list archive

[Bug 1239800] Re: Soft lockup when running bonnie++ only at 1600 mt/s

 

This bug was fixed in the package linux - 3.11.0-15.23

---------------
linux (3.11.0-15.23) saucy; urgency=low

  [Brad Figg]

  * Release Tracking Bug
    - LP: #1259259

  [ Tim Gardner ]

  * [Config] Build-in ohci-pci
    - LP: #1244176

linux (3.11.0-15.22) saucy; urgency=low

  [Brad Figg]

  * Release Tracking Bug
    - LP: #1257092

  [ Andy Whitcroft ]

  * [Config] CONFIG_DEBUG_BUGVERBOSE=y
    - LP: #1252353

  [ Benjamin Tissoires ]

  * SAUCE: (no-up) HID: appleir: force input to be set
    - LP: #1244505

  [ John Johansen ]

  * SAUCE: (no-up) apparmor: Fix tasks not subject to, reloaded policy
    - LP: #1236455

  [ Kamal Mostafa ]

  * SAUCE: (no-up) drm/i915: i915.disable_pch_pwm overrides PCH_PWM_ENABLE
    quirk
    - LP: #1163720

  [ Manoj Iyer ]

  * SAUCE: Enable earlyprintk via the PL011.
    - LP: #1248233

  [ Paolo Pisati ]

  * [Config] armhf: RTC_DRV_PL031=y
    - LP: #1252242
  * [Config] armhf: CPU_FREQ=y && ARM_HIGHBANK_CPUFREQ=y
    - LP: #1249397

  [ Rob Herring ]

  * [Config] armhf: PSTORE_RAM=y and PSTORE_CONSOLE=y
    - LP: #1248492
  * SAUCE: net: calxedaxgmac: add mac address learning
    - LP: #1248233

  [ Tim Gardner ]

  * [Debian] Re-sign modules after debug objcopy
    - LP: #1253155

  [ Upstream Kernel Changes ]

  * Revert "rt2x00pci: Use PCI MSIs whenever possible"
    - LP: #1257037
  * Revert "epoll: use freezable blocking call"
    - LP: #1257037
  * Revert "select: use freezable blocking call"
    - LP: #1257037
  * Revert "ima: policy for RAMFS"
    - LP: #1257037
  * ARM: tlb: don't perform inner-shareable invalidation for local TLB ops
    - LP: #1239800
  * ARM: 7855/1: Add check for Cortex-A15 errata 798181 ECO
    - LP: #1239800
  * mfd: rtsx: Modify rts5249_optimize_phy
    - LP: #1255297
  * usb: musb: start musb on the udc side, too
    - LP: #1257037
  * usb-storage: add quirk for mandatory READ_CAPACITY_16
    - LP: #1257037
  * USB: support new huawei devices in option.c
    - LP: #1257037
  * USB: quirks.c: add one device that cannot deal with suspension
    - LP: #1257037
  * USB: quirks: add touchscreen that is dazzeled by remote wakeup
    - LP: #1257037
  * USB: serial: ftdi_sio: add id for Z3X Box device
    - LP: #1257037
  * xhci: Don't enable/disable RWE on bus suspend/resume.
    - LP: #1257037
  * cifs: Fix inability to write files >2GB to SMB2/3 shares
    - LP: #1257037
  * x86: Update UV3 hub revision ID
    - LP: #1257037
  * cpufreq: s3c64xx: Rename index to driver_data
    - LP: #1257037
  * cpufreq / intel_pstate: Fix max_perf_pct on resume
    - LP: #1257037
  * bcache: Fixed incorrect order of arguments to bio_alloc_bioset()
    - LP: #1257037
  * HID: wiimote: add LEGO-wiimote VID
    - LP: #1257037
  * cgroup: fix to break the while loop in cgroup_attach_task() correctly
    - LP: #1257037
  * mac80211: correctly close cancelled scans
    - LP: #1257037
  * mac80211: drop spoofed packets in ad-hoc mode
    - LP: #1257037
  * mac80211: use sta_info_get_bss() for nl80211 tx and client probing
    - LP: #1257037
  * mac80211: update sta->last_rx on acked tx frames
    - LP: #1257037
  * mac80211: fix crash if bitrate calculation goes wrong
    - LP: #1257037
  * ath9k: fix tx queue scheduling after channel changes
    - LP: #1257037
  * cfg80211: use the correct macro to check for active monitor support
    - LP: #1257037
  * cfg80211: fix warning when using WEXT for IBSS
    - LP: #1257037
  * mwifiex: fix SDIO interrupt lost issue
    - LP: #1257037
  * rtlwifi: rtl8192cu: Fix error in pointer arithmetic
    - LP: #1257037
  * iwlwifi: mvm: call ieee80211_scan_completed when needed
    - LP: #1257037
  * iwlwifi: pcie: add SKUs for 6000, 6005 and 6235 series
    - LP: #1257037
  * jfs: fix error path in ialloc
    - LP: #1257037
  * can: at91-can: fix device to driver data mapping for platform devices
    - LP: #1257037
  * can: flexcan: fix mx28 detection by rearanging OF match table
    - LP: #1257037
  * can: flexcan: flexcan_chip_start: fix regression, mark one MB for TX
    and abort pending TX
    - LP: #1257037
  * SCSI: BusLogic: Fix an oops when intializing multimaster adapter
    - LP: #1257037
  * SCSI: sd: call blk_pm_runtime_init before add_disk
    - LP: #1257037
  * ecryptfs: Fix memory leakage in keystore.c
    - LP: #1257037
  * raid5: set bio bi_vcnt 0 for discard request
    - LP: #1257037
  * raid5: avoid finding "discard" stripe
    - LP: #1257037
  * libata: make ata_eh_qc_retry() bump scmd->allowed on bogus failures
    - LP: #1257037
  * md: avoid deadlock when md_set_badblocks.
    - LP: #1257037
  * md: Fix skipping recovery for read-only arrays.
    - LP: #1257037
  * target: Fix assignment of LUN in tracepoints
    - LP: #1257037
  * target/pscsi: fix return value check
    - LP: #1257037
  * vhost/scsi: Fix incorrect usage of get_user_pages_fast write parameter
    - LP: #1257037
  * clockevents: Sanitize ticks to nsec conversion
    - LP: #1257037
  * parisc: Do not crash 64bit SMP kernels on machines with >= 4GB RAM
    - LP: #1257037
  * scripts/kallsyms: filter symbols not in kernel address space
    - LP: #1257037
  * ARC: Incorrect mm reference used in vmalloc fault handler
    - LP: #1257037
  * ALSA: hda - Add missing initial vmaster hook at build_controls callback
    - LP: #1257037
  * ALSA: hda - Fix unbalanced runtime PM refcount after S3/S4
    - LP: #1257037
  * ALSA: hda - Add a fixup for ASUS N76VZ
    - LP: #1257037
  * ALSA: fix oops in snd_pcm_info() caused by ASoC DPCM
    - LP: #1257037
  * ASoC: wm_hubs: Add missing break in hp_supply_event()
    - LP: #1257037
  * ASoC: dapm: Fix source list debugfs outputs
    - LP: #1257037
  * staging: ozwpan: prevent overflow in oz_cdev_write()
    - LP: #1257037
  * Staging: bcm: info leak in ioctl
    - LP: #1257037
  * Staging: sb105x: info leak in mp_get_count()
    - LP: #1257037
  * staging: wlags49_h2: buffer overflow setting station name
    - LP: #1257037
  * uml: check length in exitcode_proc_write()
    - LP: #1257037
  * xtensa: don't use alternate signal stack on threads
    - LP: #1257037
  * mm: make generic_access_phys available for modules
    - LP: #1257037
  * uio: provide vm access to UIO_MEM_PHYS maps
    - LP: #1257037
  * Fix a few incorrectly checked [io_]remap_pfn_range() calls
    - LP: #1257037
  * lib/scatterlist.c: don't flush_kernel_dcache_page on slab page
    - LP: #1257037
  * aacraid: missing capable() check in compat ioctl
    - LP: #1257037
  * clk: fixup argument order when setting VCO parameters
    - LP: #1257037
  * clk: nomadik: set all timers to use 2.4 MHz TIMCLK
    - LP: #1257037
  * mm: numa: Do not account for a hinting fault if we raced
    - LP: #1257037
  * mm: Wait for THP migrations to complete during NUMA hinting faults
    - LP: #1257037
  * mm: Prevent parallel splits during THP migration
    - LP: #1257037
  * mm: numa: Sanitize task_numa_fault() callsites
    - LP: #1257037
  * mm: Close races between THP migration and PMD numa clearing
    - LP: #1257037
  * mm: Account for a THP NUMA hinting update as one PTE update
    - LP: #1257037
  * mm: /proc/pid/pagemap: inspect _PAGE_SOFT_DIRTY only on present pages
    - LP: #1257037
  * mm/pagewalk.c: fix walk_page_range() access of wrong PTEs
    - LP: #1257037
  * drm/vmwgfx: Don't put resources with invalid id's on lru list
    - LP: #1257037
  * drm/vmwgfx: Don't kill clients on VT switch
    - LP: #1257037
  * drm/i915: split aux_clock_divider logic in a separated function for
    reuse.
    - LP: #1257037
  * drm/i915: Retry DP aux_ch communications with a different clock after
    failure
    - LP: #1257037
  * drm: Prevent overwriting from userspace underallocating core ioctl
    structs
    - LP: #1257037
  * drm: Pad drm_mode_get_connector to 64-bit boundary
    - LP: #1257037
  * drm/radeon/atom: workaround vbios bug in transmitter table on rs780
    - LP: #1257037
  * drm/radeon: make missing smc ucode non-fatal (r7xx-SI)
    - LP: #1257037
  * drm/i915: Add HSW CRT output readout support
    - LP: #1257037
  * drm/i915: Add support for pipe_bpp readout
    - LP: #1257037
  * drm/i915: No LVDS hardware on Intel D410PT and D425KT
    - LP: #1257037
  * drm/i915: Fix the PPT fdi lane bifurcate state handling on ivb
    - LP: #1257037
  * mutex: Avoid gcc version dependent __builtin_constant_p() usage
    - LP: #1257037
  * seq_file: always update file->f_pos in seq_lseek()
    - LP: #1257037
  * NTB: Add Error Handling in ntb_device_setup
    - LP: #1257037
  * NTB: Correct Number of Scratch Pad Registers
    - LP: #1257037
  * NTB: Correct USD/DSD Identification
    - LP: #1257037
  * NTB: Correct debugfs to work with more than 1 NTB Device
    - LP: #1257037
  * Linux 3.11.8
    - LP: #1257037
  * net/mlx4_core: Fix call to __mlx4_unregister_mac
    - LP: #1257037
  * net: sctp: do not trigger BUG_ON in sctp_cmd_delete_tcb
    - LP: #1257037
  * net: flow_dissector: fail on evil iph->ihl
    - LP: #1257037
  * virtio-net: correctly handle cpu hotplug notifier during resuming
    - LP: #1257037
  * xen-netback: use jiffies_64 value to calculate credit timeout
    - LP: #1257037
  * cxgb3: Fix length calculation in write_ofld_wr() on 32-bit
    architectures
    - LP: #1257037
  * tcp: gso: fix truesize tracking
    - LP: #1257037
  * ipv6: ip6_dst_check needs to check for expired dst_entries
    - LP: #1257037
  * ipv6: reset dst.expires value when clearing expire flag
    - LP: #1257037
  * xen-netback: Handle backend state transitions in a more robust way
    - LP: #1257037
  * xen-netback: transition to CLOSED when removing a VIF
    - LP: #1257037
  * Thermal: x86_pkg_temp: change spin lock
    - LP: #1257037
  * hyperv-fb: add pci stub
    - LP: #1257037
  * USB: add new zte 3g-dongle's pid to option.c
    - LP: #1257037
  * ALSA: hda - hdmi: Fix reported channel map on common default layouts
    - LP: #1257037
  * tracing: Fix potential out-of-bounds in trace_get_user()
    - LP: #1257037
  * drm/i915/dp: workaround BIOS eDP bpp clamping issue
    - LP: #1257037
  * perf: Fix perf ring buffer memory ordering
    - LP: #1257037
  * iwlwifi: pcie: add new SKUs for 7000 & 3160 NIC series
    - LP: #1257037
  * misc: atmel_pwm: add deferred-probing support
    - LP: #1257037
  * backlight: atmel-pwm-bl: fix deferred probe from __init
    - LP: #1257037
  * usb: fix cleanup after failure in hub_configure()
    - LP: #1257037
  * usb: fail on usb_hub_create_port_device() errors
    - LP: #1257037
  * usbcore: set lpm_capable field for LPM capable root hubs
    - LP: #1257037
  * media: sh_vou: almost forever loop in sh_vou_try_fmt_vid_out()
    - LP: #1257037
  * Linux 3.11.9
    - LP: #1257037
  * ACPICA: DeRefOf operator: Update to fully resolve FieldUnit and
    BufferField refs.
    - LP: #1257037
  * libertas: potential oops in debugfs
    - LP: #1257037
  * aacraid: prevent invalid pointer dereference
    - LP: #1257037
  * ACPICA: Return error if DerefOf resolves to a null package element.
    - LP: #1257037
  * ACPICA: Fix for a Store->ArgX when ArgX contains a reference to a
    field.
    - LP: #1257037
  * USB: mos7840: fix tiocmget error handling
    - LP: #1257037
  * can: kvaser_usb: fix usb endpoints detection
    - LP: #1257037
  * Btrfs: relocate csums properly with prealloc extents
    - LP: #1257037
  * crypto: ansi_cprng - Fix off by one error in non-block size request
    - LP: #1257037
  * crypto: s390 - Fix aes-cbc IV corruption
    - LP: #1257037
  * can: c_can: Fix RX message handling, handle lost message before EOB
    - LP: #1257037
  * alx: Reset phy speed after resume
    - LP: #1257037
  * ipc,shm: correct error return value in shmctl (SHM_UNLOCK)
    - LP: #1257037
  * ipc,shm: fix shm_file deletion races
    - LP: #1257037
  * drm/nvc0-/gr: fix a number of missing explicit array terminators...
    - LP: #1257037
  * thinkpad_acpi: Fix build error when CONFIG_SND_MAX_CARDS > 32
    - LP: #1257037
  * SUNRPC: don't map EKEYEXPIRED to EACCES in call_refreshresult
    - LP: #1257037
  * sched, idle: Fix the idle polling state logic
    - LP: #1257037
  * PCI: Allow PCIe Capability link-related register access for switches
    - LP: #1257037
  * PCI: Remove PCIe Capability version checks
    - LP: #1257037
  * PCI: Support PCIe Capability Slot registers only for ports with slots
    - LP: #1257037
  * perf/ftrace: Fix paranoid level for enabling function tracer
    - LP: #1257037
  * ACPI / EC: Ensure lock is acquired before accessing ec struct members
    - LP: #1257037
  * ACPI / video: Quirk initial backlight level 0
    - LP: #1257037
  * ACPI / hotplug: Fix handle_root_bridge_removal()
    - LP: #1257037
  * ACPI / hotplug: Do not execute "insert in progress" _OST
    - LP: #1257037
  * Staging: zram: Fix access of NULL pointer
    - LP: #1257037
  * staging: comedi: avoid memleak for subdevice private
    - LP: #1257037
  * Drivers: hv: vmbus: Fix a bug in channel rescind code
    - LP: #1257037
  * rt2x00: fix a crash bug in the HT descriptor handling fix
    - LP: #1257037
  * rt2x00: check if device is still available on rt2x00mac_flush()
    - LP: #1257037
  * rt2x00: rt2800lib: fix VGC adjustment for RT5592
    - LP: #1257037
  * rt2x00: fix HT TX descriptor settings regression
    - LP: #1257037
  * Bluetooth: revert: "Bluetooth: Add missing reset_resume dev_pm_ops"
    - LP: #1257037
  * exec/ptrace: fix get_dumpable() incorrect tests
    - LP: #1257037
    - CVE-2013-2929
  * Linux 3.11.10
    - LP: #1257037
 -- Brad Figg <brad.figg@xxxxxxxxxxxxx>   Mon, 09 Dec 2013 09:41:31 -0800

** Changed in: linux (Ubuntu)
       Status: Incomplete => Fix Released

** CVE added: http://www.cve.mitre.org/cgi-
bin/cvename.cgi?name=2013-2929

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1239800

Title:
  Soft lockup when running bonnie++ only at 1600 mt/s

Status in “linux” package in Ubuntu:
  Fix Released

Bug description:
  SRU Justification:

  Impact: running a test like bonnie++ makes the system instable and
  prone to hangs.

  Fix: apply the attached patches and recompile a kernel.

  Test case: leave bonnie running in a loop for 24hrs.

  --

  When bonnie++ was run in a loop, the system exhibits a hang behavior with
  "rcu_sched: self-detected stall on CPU"
  The time to error can be inconsistent.  One time it took 7 hours and the next time more than 2 days.

  Commands to reproduce the failure:
  $ sudo apt-get install bonnie++
  $ mkdir bonnie
  $ while true; do bonnie++ -d bonnie; done &>>bonnie0.log &

  Stack trace:
  [237019.072290] INFO: rcu_sched self-detected stall on CPU { 1} (t=19305216 jiffies g=580389 c=580388 q=84)
  [237019.080901] CPU: 1 PID: 44 Comm: kswapd0 Tainted: GF 3.11.0-6-generic-lpae #12-Ubuntu
  [237019.088879] [<c002bc00>] (unwind_backtrace+0x0/0x138) from [<c0026f1c>] (show_stack+0x10/0x14)
  [237019.096700] [<c0026f1c>] (show_stack+0x10/0x14) from [<c05cbe50>] (dump_stack+0x74/0x90)
  [237019.104051] [<c05cbe50>] (dump_stack+0x74/0x90) from [<c00bf37c>] (rcu_check_callbacks+0x31c/0x798)
  [237019.112262] [<c00bf37c>] (rcu_check_callbacks+0x31c/0x798) from [<c00492a0>] (update_process_times+0x38/0x64)
  [237019.121254] [<c00492a0>] (update_process_times+0x38/0x64) from [<c008cdbc>] (tick_sched_handle+0x54/0x60)
  [237019.129933] [<c008cdbc>] (tick_sched_handle+0x54/0x60) from [<c008d00c>] (tick_sched_timer+0x44/0x74)
  [237019.138300] [<c008d00c>] (tick_sched_timer+0x44/0x74) from [<c005db50>] (__run_hrtimer+0x74/0x1d4)
  [237019.146433] [<c005db50>] (__run_hrtimer+0x74/0x1d4) from [<c005e6f8>] (hrtimer_interrupt+0x10c/0x2c0)
  [237019.154800] [<c005e6f8>] (hrtimer_interrupt+0x10c/0x2c0) from [<c0492e44>] (arch_timer_handler_phys+0x28/0x30)
  [237019.163871] [<c0492e44>] (arch_timer_handler_phys+0x28/0x30) from [<c00b8c2c>] (handle_percpu_devid_irq+0x6c/0x104)
  [237019.173332] [<c00b8c2c>] (handle_percpu_devid_irq+0x6c/0x104) from [<c00b54ec>] (generic_handle_irq+0x20/0x30)
  [237019.182402] [<c00b54ec>] (generic_handle_irq+0x20/0x30) from [<c0023ff4>] (handle_IRQ+0x38/0x94)
  [237019.190378] [<c0023ff4>] (handle_IRQ+0x38/0x94) from [<c0008508>] (gic_handle_irq+0x28/0x5c)
  [237019.198041] [<c0008508>] (gic_handle_irq+0x28/0x5c) from [<c05d1c00>] (__irq_svc+0x40/0x50)
  [237019.205624] Exception stack(0xee2c1c18 to 0xee2c1c60)
  [237019.210238] 1c00: 00000004 00000004
  [237019.217666] 1c20: 00000008 00000001 ee2c1c8c ca208700 ca208700 0996b000 ca208708 00000001
  [237019.225093] 1c40: 00000002 edb31300 00000003 ee2c1c60 c02f54fc c00923c8 200f0013 ffffffff
  [237019.232523] [<c05d1c00>] (__irq_svc+0x40/0x50) from [<c00923c8>] (generic_exec_single+0x6c/0x94)
  [237019.240500] [<c00923c8>] (generic_exec_single+0x6c/0x94) from [<c00924f4>] (smp_call_function_single+0x104/0x198)
  [237019.249805] [<c00924f4>] (smp_call_function_single+0x104/0x198) from [<c0029920>] (broadcast_tlb_mm_a15_erratum+0x7c/0x84)
  [237019.259812] [<c0029920>] (broadcast_tlb_mm_a15_erratum+0x7c/0x84) from [<c0029adc>] (flush_tlb_page+0x74/0xa8)
  [237019.268882] [<c0029adc>] (flush_tlb_page+0x74/0xa8) from [<c011fc8c>] (ptep_clear_flush_young+0x6c/0xd0)
  [237019.277484] [<c011fc8c>] (ptep_clear_flush_young+0x6c/0xd0) from [<c011a60c>] (page_referenced_one+0x64/0x1fc)
  [237019.286554] [<c011a60c>] (page_referenced_one+0x64/0x1fc) from [<c011c034>] (page_referenced+0xf4/0x2e4)
  [237019.295155] [<c011c034>] (page_referenced+0xf4/0x2e4) from [<c00fc410>] (shrink_active_list+0x1f0/0x35c)
  [237019.303756] [<c00fc410>] (shrink_active_list+0x1f0/0x35c) from [<c00fdadc>] (shrink_lruvec+0x32c/0x598)
  [237019.312279] [<c00fdadc>] (shrink_lruvec+0x32c/0x598) from [<c00fddb0>] (shrink_zone+0x68/0x180)
  [237019.320176] [<c00fddb0>] (shrink_zone+0x68/0x180) from [<c00fe430>] (kswapd+0x568/0x9d4)
  [237019.327527] [<c00fe430>] (kswapd+0x568/0x9d4) from [<c005aae0>] (kthread+0xa4/0xb0)
  [237019.334487] [<c005aae0>] (kthread+0xa4/0xb0) from [<c0023198>] (ret_from_fork+0x14/0x3c)

  Setup details:
  Quad-core A15 server nodes on Calxeda Midway hardware.
  The failure has been seen two times with DDR setting of DDR3@1600mt/s

  cat /proc/version_signature
  Ubuntu 3.11.0-12.18-generic-lpae 3.11.3
  The issue was first seen on Ubuntu 3.11.0-6.12-generic-lpae

  cat /etc/issue
  Ubuntu 13.04 \n \l

  Additional debug information attached
  ---
  Architecture: armhf
  DistroRelease: Ubuntu 13.04
  MarkForUpload: True
  Package: linux (not installed)
  ProcEnviron:
   LANGUAGE=en_US:
   TERM=vt102
   PATH=(custom, no user)
   LANG=en_US
   SHELL=/bin/bash
  Uname: Linux 3.11.0-12-generic-lpae armv7l
  UserGroups: adm cdrom dip lpadmin plugdev sambashare sudo

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1239800/+subscriptions


References