← Back to team overview

kernel-packages team mailing list archive

[Bug 1004313] Re: watchdog detected hard lockup on cpu 1

 

I have a similar issue on Ubuntu 15.10. While running make runtest for
opencl-caffe (https://github.com/amd/OpenCL-caffe)

$ uname -a
Linux thesun 4.2.0-22-generic #27-Ubuntu SMP Thu Dec 17 22:57:08 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux


Dec 26 01:36:38 thesun kernel: [  926.843143] WARNING: CPU: 1 PID: 2792 at /build/linux-cRemOf/linux-4.2.0/kernel/watchdog.c:311 watchdog_overflow_callback+0x79/0xa0()
Dec 26 01:36:38 thesun kernel: [  926.843144] Watchdog detected hard LOCKUP on cpu 1
Dec 26 01:36:38 thesun kernel: [  926.843144] Modules linked in: rfcomm bnep fglrx(POE) uas intel_rapl usb_storage iosf_mbi x86_pkg_temp_thermal intel_powerclamp arc4 ath9k ath9k_common ath9k_hw hid_generic joydev ath input_leds mac80211 coretemp snd_hda_codec_realtek kvm_intel snd_hda_codec_hdmi snd_hda_codec_generic eeepc_wmi asus_wmi sparse_keymap kvm snd_hda_intel crct10dif_pclmul snd_hda_codec cfg80211 snd_hda_core crc32_pclmul snd_hwdep snd_pcm ath3k btusb snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device btrtl btbcm btintel snd_timer snd aesni_intel bluetooth aes_x86_64 lrw amd_iommu_v2 gf128mul soundcore glue_helper ablk_helper mei_me mei lpc_ich cryptd shpchp serio_raw tpm_infineon mac_hid parport_pc ppdev lp parport autofs4 hid_microsoft usbhid hid mxm_wmi psmouse e1000e ahci libahci ptp pps_core wmi video
Dec 26 01:36:38 thesun kernel: [  926.843169] CPU: 1 PID: 2792 Comm: test.testbin Tainted: P        W  OE   4.2.0-22-generic #27-Ubuntu
Dec 26 01:36:38 thesun kernel: [  926.843170] Hardware name: ASUS All Series/Z87-PRO, BIOS 2103 08/18/2014
Dec 26 01:36:38 thesun kernel: [  926.843171]  0000000000000000 000000000e4dc345 ffff88082ec45aa0 ffffffff817e94c9
Dec 26 01:36:38 thesun kernel: [  926.843172]  0000000000000000 ffff88082ec45af8 ffff88082ec45ae0 ffffffff8107b3d6
Dec 26 01:36:38 thesun kernel: [  926.843173]  0000000000000000 ffff88080afb8000 0000000000000000 ffff88082ec45c00
Dec 26 01:36:38 thesun kernel: [  926.843174] Call Trace:
Dec 26 01:36:38 thesun kernel: [  926.843175]  <NMI>  [<ffffffff817e94c9>] dump_stack+0x45/0x57
Dec 26 01:36:38 thesun kernel: [  926.843180]  [<ffffffff8107b3d6>] warn_slowpath_common+0x86/0xc0
Dec 26 01:36:38 thesun kernel: [  926.843181]  [<ffffffff8107b465>] warn_slowpath_fmt+0x55/0x70
Dec 26 01:36:38 thesun kernel: [  926.843183]  [<ffffffff811329d9>] watchdog_overflow_callback+0x79/0xa0
Dec 26 01:36:38 thesun kernel: [  926.843185]  [<ffffffff81177cc0>] __perf_event_overflow+0x90/0x1c0
Dec 26 01:36:38 thesun kernel: [  926.843186]  [<ffffffff811788c4>] perf_event_overflow+0x14/0x20
Dec 26 01:36:38 thesun kernel: [  926.843188]  [<ffffffff81034521>] intel_pmu_handle_irq+0x1e1/0x460
Dec 26 01:36:38 thesun kernel: [  926.843190]  [<ffffffff8102acf6>] perf_event_nmi_handler+0x26/0x40
Dec 26 01:36:38 thesun kernel: [  926.843192]  [<ffffffff810185f3>] nmi_handle+0x83/0x120
Dec 26 01:36:38 thesun kernel: [  926.843193]  [<ffffffff81018b62>] default_do_nmi+0x42/0x100
Dec 26 01:36:38 thesun kernel: [  926.843194]  [<ffffffff81018d0a>] do_nmi+0xea/0x140
Dec 26 01:36:38 thesun kernel: [  926.843195]  [<ffffffff817f2591>] end_repeat_nmi+0x1a/0x1e
Dec 26 01:36:38 thesun kernel: [  926.843227]  [<ffffffffc06c1395>] ? firegl_trace+0x65/0x1e0 [fglrx]
Dec 26 01:36:38 thesun kernel: [  926.843248]  [<ffffffffc06c13a2>] ? firegl_trace+0x72/0x1e0 [fglrx]
Dec 26 01:36:38 thesun kernel: [  926.843269]  [<ffffffffc06c13a2>] ? firegl_trace+0x72/0x1e0 [fglrx]
Dec 26 01:36:38 thesun kernel: [  926.843290]  [<ffffffffc06c13a2>] ? firegl_trace+0x72/0x1e0 [fglrx]
Dec 26 01:36:38 thesun kernel: [  926.843290]  <<EOE>>  [<ffffffffc06c13a2>] ? firegl_trace+0x72/0x1e0 [fglrx]
Dec 26 01:36:38 thesun kernel: [  926.843331]  [<ffffffffc06c13a2>] ? firegl_trace+0x72/0x1e0 [fglrx]
Dec 26 01:36:38 thesun kernel: [  926.843333]  [<ffffffff811dec6d>] ? __kmalloc+0x1ad/0x250
Dec 26 01:36:38 thesun kernel: [  926.843349]  [<ffffffffc0687193>] ? KCL_MEM_SmallBufferAlloc+0x13/0x20 [fglrx]
Dec 26 01:36:38 thesun kernel: [  926.843351]  [<ffffffff810c377e>] ? down+0x2e/0x50
Dec 26 01:36:38 thesun kernel: [  926.843372]  [<ffffffffc06c9f09>] ? firegl_cmmqs_CWDDE_32+0x199/0x480 [fglrx]
Dec 26 01:36:38 thesun kernel: [  926.843393]  [<ffffffffc06c897e>] ? firegl_cmmqs_CWDDE32+0x8e/0x140 [fglrx]
Dec 26 01:36:38 thesun kernel: [  926.843395]  [<ffffffff810841d7>] ? capable+0x17/0x20
Dec 26 01:36:38 thesun kernel: [  926.843416]  [<ffffffffc06c88f0>] ? firegl_cmmqs_disabledriver+0x120/0x120 [fglrx]
Dec 26 01:36:38 thesun kernel: [  926.843434]  [<ffffffffc0696714>] ? firegl_ioctl+0x1f4/0x260 [fglrx]
Dec 26 01:36:38 thesun kernel: [  926.843436]  [<ffffffffc03064a6>] ? helper_rfc4106_encrypt+0x1b6/0x2f0 [aesni_intel]
Dec 26 01:36:38 thesun kernel: [  926.843451]  [<ffffffffc06841fe>] ? ip_firegl_unlocked_ioctl+0xe/0x20 [fglrx]
Dec 26 01:36:38 thesun kernel: [  926.843453]  [<ffffffff81210aa5>] ? do_vfs_ioctl+0x295/0x480
Dec 26 01:36:38 thesun kernel: [  926.843454]  [<ffffffff81210d09>] ? SyS_ioctl+0x79/0x90
Dec 26 01:36:38 thesun kernel: [  926.843456]  [<ffffffff817f02b2>] ? entry_SYSCALL_64_fastpath+0x16/0x75
Dec 26 01:36:38 thesun kernel: [  926.843457] ---[ end trace 5dc0feee496b4886 ]---
Dec 26 01:36:46 thesun kernel: [  934.914340] ------------[ cut here ]------------

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1004313

Title:
  watchdog detected hard lockup on cpu 1

Status in linux package in Ubuntu:
  Incomplete
Status in openSUSE:
  Won't Fix

Bug description:
  On the latest kernel (3.1 and up?) waking up from sleep may fail with
  the message: "watchdog detected hard lockup on cpu 1"

  Ubuntu 12.04 x64

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1004313/+subscriptions