← Back to team overview

kernel-packages team mailing list archive

[Bug 1530405] Re: NMI watchdog: BUG: soft lockup - CPU#0 stuck for 22s! [kerneloops:814]

 

this problem was apparently fixed for me, after I install
4.5.0-040500-generic ( from http://kernel.ubuntu.com/~kernel-
ppa/mainline/v4.5-wily/ ), but after 2016-05-28 ubuntu packages update,
i unable to start Ubuntu 16.04 with any of the installed kernels:

4.5.0-040500-generic ( from http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.5-wily/ )
4.4.0-22-generic
4.4.0-21-generic

I can confirm that this problem also occurs with x86_64

Until the moment, I find two things:

I can only start the ubuntu 16.04 if:

1 - Disable Multi Core support on Dell BIOS,

or

2 - add "acpi=off" on grub start line on system start

I think it's a problem with the processor (bug) or a bug with the
identify/access the ACPI features

I really want to find a definitive solution to this problem.

If you need more information or need me to run a program, script, or
compile the kernel to test (the last time I did it was in Ubuntu 8.04).

please tell me what needs to be run and how, I'll be happy to help.

I've attached the kernel log que contains the problem traceback

this is lscpu from the system started after disable Multi Core support
on BIOS:

fabio@luuzfcb:~$ lscpu
Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Byte Order: Little Endian
CPU(s): 2
On-line CPU(s) list: 0,1
Thread(s) per core: 2
Core(s) per socket: 1
Socket(s): 1
NUMA node(s): 1
Vendor ID: GenuineIntel
CPU family: 6
Model: 61
Model name: Intel(R) Core(TM) i7-5500U CPU @ 2.40GHz
Stepping: 4
CPU MHz: 799.968
CPU max MHz: 3000.0000
CPU min MHz: 500.0000
BogoMIPS: 4788.86
Virtualization: VT-x
L1d cache: 32K
L1i cache: 32K
L2 cache: 256K
L3 cache: 4096K
NUMA node0 CPU(s): 0,1
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc aperfmperf eagerfpu pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch epb intel_pt tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid rdseed adx smap xsaveopt dtherm ida arat pln pts


Relevant traceback from kern.log:

May 28 13:56:30 luuzfcb kernel: [ 9535.840881] NMI watchdog: BUG: soft lockup - CPU#2 stuck for 22s! [kworker/2:3:6541]
May 28 13:56:30 luuzfcb kernel: [ 9535.840885] Modules linked in: drbg ansi_cprng ctr ccm rfcomm nvram msr xt_conntrack ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 xt_addrtype iptable_filter ip_tables x_tables nf_nat nf_conntrack br_netfilter bridge stp llc dm_thin_pool dm_persistent_data dm_bio_prison dm_bufio libcrc32c bbswitch(OE) bnep nls_iso8859_1 nvidia_uvm(POE) nvidia_modeset(POE) nvidia(POE) uvcvideo btusb btrtl btbcm btintel videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_core bluetooth videodev media arc4 iwlmvm snd_soc_rt5640 dell_wmi sparse_keymap mxm_wmi intel_rapl x86_pkg_temp_thermal intel_powerclamp dell_led snd_hda_codec_hdmi snd_soc_rl6231 snd_soc_ssm4567 coretemp kvm_intel snd_soc_core mac80211 snd_hda_codec_realtek snd_hda_codec_generic snd_hda_intel snd_hda_codec snd_compress ac97_bus snd_pcm_dmaengine snd_hda_core kvm iwlwifi snd_hwdep snd_pcm dell_laptop dcdbas dell_smm_hwmon snd_seq_midi snd_seq_midi_event irqbypass crct10dif_pclmul crc32_pclmul snd_rawmidi ghash_clmulni_intel snd_seq cfg80211 aesni_intel aes_x86_64 lrw gf128mul mei_me acpi_als glue_helper kfifo_buf snd_seq_device ablk_helper snd_timer joydev input_leds cryptd mei industrialio snd serio_raw lpc_ich snd_soc_sst_acpi snd_soc_sst_match soundcore elan_i2c spi_pxa2xx_platform mac_hid wmi dw_dmac shpchp dw_dmac_core dell_rbtn i2c_designware_platform i2c_designware_core 8250_dw acpi_pad parport_pc ppdev lp parport autofs4 hid_generic usbhid i915 i2c_algo_bit drm_kms_helper psmouse syscopyarea sysfillrect sysimgblt fb_sys_fops ahci drm libahci r8169 mii video fjes i2c_hid hid sdhci_acpi sdhci
May 28 13:56:30 luuzfcb kernel: [ 9535.840944] CPU: 2 PID: 6541 Comm: kworker/2:3 Tainted: P        W  OEL  4.5.0-040500-generic #201603140130
May 28 13:56:30 luuzfcb kernel: [ 9535.840945] Hardware name: Dell Inc. Latitude 3450/0J6K1N, BIOS A10 01/13/2016
May 28 13:56:30 luuzfcb kernel: [ 9535.840950] Workqueue: events bpf_prog_free_deferred
May 28 13:56:30 luuzfcb kernel: [ 9535.840952] task: ffff880221e03900 ti: ffff8801f7eb4000 task.ti: ffff8801f7eb4000
May 28 13:56:30 luuzfcb kernel: [ 9535.840953] RIP: 0010:[<ffffffff81104125>]  [<ffffffff81104125>] smp_call_function_many+0x1f5/0x250
May 28 13:56:30 luuzfcb kernel: [ 9535.840956] RSP: 0018:ffff8801f7eb7bb0  EFLAGS: 00000202
May 28 13:56:30 luuzfcb kernel: [ 9535.840957] RAX: 0000000000000003 RBX: 0000000000000100 RCX: 0000000000000001
May 28 13:56:30 luuzfcb kernel: [ 9535.840958] RDX: ffff88022f49ab98 RSI: 0000000000000100 RDI: ffff88022f517b08
May 28 13:56:30 luuzfcb kernel: [ 9535.840959] RBP: ffff8801f7eb7be8 R08: 0000000000000000 R09: 000000000000000b
May 28 13:56:30 luuzfcb kernel: [ 9535.840960] R10: 0000000000000004 R11: ffff88022f517b08 R12: ffff88022f517b08
May 28 13:56:30 luuzfcb kernel: [ 9535.840960] R13: ffff88022f517b00 R14: ffffffff81071b50 R15: ffff8801f7eb7c20
May 28 13:56:30 luuzfcb kernel: [ 9535.840962] FS:  0000000000000000(0000) GS:ffff88022f500000(0000) knlGS:0000000000000000
May 28 13:56:30 luuzfcb kernel: [ 9535.840963] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
May 28 13:56:30 luuzfcb kernel: [ 9535.840963] CR2: 00007fd61ccfb1f0 CR3: 0000000002e0b000 CR4: 00000000003406e0
May 28 13:56:30 luuzfcb kernel: [ 9535.840964] Stack:
May 28 13:56:30 luuzfcb kernel: [ 9535.840965]  0000000000017ac0 0100000000000001 ffff8801f7eb7cd8 ffffffff81071b50
May 28 13:56:30 luuzfcb kernel: [ 9535.840967]  ffff8801f7eb7c20 ffff88022f590e68 ffff88022f590e38 ffff8801f7eb7c10
May 28 13:56:30 luuzfcb kernel: [ 9535.840968]  ffffffff8110425d ffff8801f7eb7cd8 ffff8801f7eb7c70 ffff8801f7eb7c30
May 28 13:56:30 luuzfcb kernel: [ 9535.840970] Call Trace:
May 28 13:56:30 luuzfcb kernel: [ 9535.840974]  [<ffffffff81071b50>] ? rbt_memtype_copy_nth_element+0xa0/0xa0
May 28 13:56:30 luuzfcb kernel: [ 9535.840975]  [<ffffffff8110425d>] on_each_cpu+0x2d/0x60
May 28 13:56:30 luuzfcb kernel: [ 9535.840977]  [<ffffffff81072349>] flush_tlb_kernel_range+0x79/0x80
May 28 13:56:30 luuzfcb kernel: [ 9535.840981]  [<ffffffff811d077b>] __purge_vmap_area_lazy+0x31b/0x380
May 28 13:56:30 luuzfcb kernel: [ 9535.840983]  [<ffffffff811d08f7>] vm_unmap_aliases+0x117/0x140
May 28 13:56:30 luuzfcb kernel: [ 9535.840985]  [<ffffffff8106d7de>] change_page_attr_set_clr+0xee/0x4f0
May 28 13:56:30 luuzfcb kernel: [ 9535.840987]  [<ffffffff8106e8bf>] set_memory_rw+0x2f/0x40
May 28 13:56:30 luuzfcb kernel: [ 9535.840988]  [<ffffffff8107c8af>] bpf_jit_free+0x3f/0x4c
May 28 13:56:30 luuzfcb kernel: [ 9535.840989]  [<ffffffff81170ed2>] bpf_prog_free_deferred+0x12/0x20
May 28 13:56:30 luuzfcb kernel: [ 9535.840992]  [<ffffffff81099f45>] process_one_work+0x165/0x480
May 28 13:56:30 luuzfcb kernel: [ 9535.840993]  [<ffffffff8109a2ab>] worker_thread+0x4b/0x500
May 28 13:56:30 luuzfcb kernel: [ 9535.840995]  [<ffffffff8109a260>] ? process_one_work+0x480/0x480
May 28 13:56:30 luuzfcb kernel: [ 9535.840996]  [<ffffffff8109a260>] ? process_one_work+0x480/0x480
May 28 13:56:30 luuzfcb kernel: [ 9535.840998]  [<ffffffff810a04e8>] kthread+0xd8/0xf0
May 28 13:56:30 luuzfcb kernel: [ 9535.840999]  [<ffffffff810a0410>] ? kthread_create_on_node+0x1a0/0x1a0
May 28 13:56:30 luuzfcb kernel: [ 9535.841002]  [<ffffffff8182398f>] ret_from_fork+0x3f/0x70
May 28 13:56:30 luuzfcb kernel: [ 9535.841003]  [<ffffffff810a0410>] ? kthread_create_on_node+0x1a0/0x1a0
May 28 13:56:30 luuzfcb kernel: [ 9535.841004] Code: d2 e8 00 0d 2f 00 3b 05 1e 6f e3 00 89 c1 0f 8d 99 fe ff ff 48 98 49 8b 55 00 48 03 14 c5 80 8f f3 81 8b 42 18 a8 01 74 ca f3 90 <8b> 42 18 a8 01 75 f7 eb bf 0f b6 4d d0 4c 89 fa 4c 89 f6 44 89 


** Attachment added: "kern.log.1"
   https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1530405/+attachment/4680428/+files/kern.log.1

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1530405

Title:
  NMI watchdog: BUG: soft lockup - CPU#0 stuck for 22s! [kerneloops:814]

Status in linux package in Ubuntu:
  Triaged

Bug description:
  I'm using Ubuntu Xenial 16.04 and my computer (ASUS M32BF) will
  randomly freeze up, sometimes before the login screen, sometimes while
  I'm in the middle of using a program. This sometimes happens on the
  Wily 15.10 live cd as well, and on both kernel 4.3.0-2, and kernel
  4.2.0-22.

  Important part of log:

  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112129] NMI watchdog: BUG: soft lockup - CPU#0 stuck for 22s! [kerneloops:814]
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112134] Modules linked in: rfcomm bnep nls_iso8859_1 kvm_amd kvm eeepc_wmi asus_wmi crct10dif_pclmul sparse_keymap crc32_pclmul aesni_intel aes_x86_64 arc4 lrw gf128mul rtl8821ae glue_helper snd_hda_codec_realtek ablk_helper snd_hda_codec_generic snd_hda_codec_hdmi snd_hda_intel btcoexist snd_hda_codec rtl_pci snd_hda_core joydev input_leds snd_hwdep rtlwifi snd_pcm fam15h_power cryptd snd_seq_midi serio_raw snd_seq_midi_event snd_rawmidi mac80211 snd_seq snd_seq_device snd_timer cfg80211 btusb btrtl btbcm btintel bluetooth snd soundcore edac_mce_amd k10temp edac_core i2c_piix4 shpchp mac_hid parport_pc ppdev lp parport autofs4 hid_logitech_hidpp uas usb_storage hid_logitech_dj usbhid hid amdkfd amd_iommu_v2 radeon i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops r8169 psmouse mii drm ahci libahci wmi fjes video
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112183] CPU: 0 PID: 814 Comm: kerneloops Not tainted 4.3.0-2-generic #11-Ubuntu
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112184] Hardware name: ASUSTeK COMPUTER INC. K30BF_M32BF_A_F_K31BF_6/K30BF_M32BF_A_F_K31BF_6, BIOS 0501 07/09/2015
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112186] task: ffff88031146d400 ti: ffff88030e838000 task.ti: ffff88030e838000
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112188] RIP: 0010:[<ffffffff810819d6>]  [<ffffffff810819d6>] __do_softirq+0x76/0x250
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112194] RSP: 0018:ffff88031fc03f30  EFLAGS: 00000202
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112196] RAX: ffff88030e83c000 RBX: 0000000000000000 RCX: 0000000040400040
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112197] RDX: 0000000000000000 RSI: 000000000000613e RDI: 0000000000000380
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112198] RBP: ffff88031fc03f80 R08: 00000029f8fa1411 R09: ffff88031fc169f0
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112199] R10: 0000000000000020 R11: 0000000000000004 R12: ffff88031fc169c0
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112201] R13: ffff88030df8e200 R14: 0000000000000000 R15: 0000000000000001
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112202] FS:  00007f6c266ac880(0000) GS:ffff88031fc00000(0000) knlGS:0000000000000000
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112203] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112205] CR2: 00007ffef000bff8 CR3: 000000030f368000 CR4: 00000000000406f0
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112206] Stack:
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112207]  404000401fc03f78 ffff88030e83c000 00000000ffff0121 ffff88030000000a
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112209]  000000021fc0d640 0000000000000000 ffff88031fc169c0 ffff88030df8e200
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112211]  0000000000000000 0000000000000001 ffff88031fc03f90 ffffffff81081d23
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112213] Call Trace:
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112215]  <IRQ> 
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112219]  [<ffffffff81081d23>] irq_exit+0xa3/0xb0
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112222]  [<ffffffff817fda02>] smp_apic_timer_interrupt+0x42/0x50
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112225]  [<ffffffff817fb862>] apic_timer_interrupt+0x82/0x90
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112226]  <EOI> 
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112230]  [<ffffffff810a5457>] ? finish_task_switch+0x67/0x1c0
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112232]  [<ffffffff817f645c>] __schedule+0x36c/0x980
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112234]  [<ffffffff817f6aa3>] schedule+0x33/0x80
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112236]  [<ffffffff817f9e4f>] do_nanosleep+0x6f/0xf0
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112239]  [<ffffffff810ea59c>] hrtimer_nanosleep+0xdc/0x1f0
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112241]  [<ffffffff810e9500>] ? __hrtimer_init+0x90/0x90
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112243]  [<ffffffff817f9e3a>] ? do_nanosleep+0x5a/0xf0
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112245]  [<ffffffff810ea72a>] SyS_nanosleep+0x7a/0x90
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112247]  [<ffffffff817faaf2>] entry_SYSCALL_64_fastpath+0x16/0x71
  Dec 31 19:13:12 COMPUTERNAME kernel: [   64.112248] Code: 45 d4 89 4d b4 65 48 8b 04 25 c4 3e 01 00 c7 45 c8 0a 00 00 00 48 89 45 b8 65 c7 05 31 24 f9 7e 00 00 00 00 fb 66 0f 1f 44 00 00 <b8> ff ff ff ff 49 c7 c4 c0 b0 e0 81 0f bc 45 d4 83 c0 01 89 45 

  This soft lockup happens either in kerneloops or swapper/0.

  This might have something to do with networking, because the soft
  lockups appear happen immediately before or after some network-related
  stuff. nm-applet says NetworkManager is not running, but "service
  network-manager status" says it is. "service network-manager stop"
  does not work, and I need to "kill -9" the pids for the processes.
  After starting the service after killing it, it the nm-applet's "Edit
  Connection" works for a few moments, then it won't delete any
  connections, and when closed, it won't re-open. At then end "kill -9"
  won't even work anymore (the processes get parented to init, but do
  not die). Usually, however, I never even see the login screen.

  I've been able to boot into the Wily livecd by using Windows 10 ->
  Shift-Restart -> UEFI Settings -> then booting my USB with the livecd
  (often soft lockups without going through windows).

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1530405/+subscriptions


References