← Back to team overview

kernel-packages team mailing list archive

[Bug 1403459] Re: kernel bug when hotplugging a CPU

 

Launchpad has imported 8 comments from the remote bug at
https://bugzilla.novell.com/show_bug.cgi?id=908794.

If you reply to an imported comment from within Launchpad, your comment
will be sent to the remote bug automatically. Read more about
Launchpad's inter-bugtracker facilities at
https://help.launchpad.net/InterBugTracking.

------------------------------------------------------------------------
On 2014-12-08T09:58:29+00:00 Benjamin Drung wrote:

When I hot-plug a CPU to an openSUSE VM running on KVM, I get following
kernel oops:

# uname -a
Linux opensuse 3.16.6-2-default #1 SMP Mon Oct 20 13:47:22 UTC 2014 (feb42ea) x86_64 x86_64 x86_64 GNU/Linux
# dmesg
[   54.927758] CPU2 has been hot-added
[   54.928610] smpboot: Booting Node 0 Processor 2 APIC 0x2
[    0.004000] kvm-clock: cpu 2, msr 2:403de081, secondary cpu clock
[   54.940001] TSC synchronization [CPU#1 -> CPU#2]:
[   54.940001] Measured 195274448514 cycles TSC warp between CPUs, turning off TSC clock.
[   54.940001] tsc: Marking TSC unstable due to check_tsc_sync_source failed
[   54.960222] KVM setup async PF for cpu 2
[   54.960265] kvm-stealtime: cpu 2, msr 24008d240
[   54.960342] microcode: CPU2 sig=0x306c1, pf=0x1, revision=0x1
[   55.012464] Will online and init hotplugged CPU: 2
[   55.012574] kobject '(null)' (ffff88024009a3d8): tried to add an uninitialized object, something is seriously wrong.
[   55.012615] CPU: 1 PID: 892 Comm: systemd-udevd Not tainted 3.16.6-2-default #1
[   55.012649] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.7.5-20140531_171129-lamiak 04/01/2014
[   55.012688]  ffff8802347cbd98 ffffffff815ca482 ffff88024009a3d8 ffffffff8130e153
[   55.012729]  ffffffff00000020 ffff8802347cbda8 ffff8802347cbd50 00000001334da7a8
[   55.012766]  000000000000b0a0 0000000001ca6930 0000000000000002 ffff8802334da298
[   55.012804] Call Trace:
[   55.012845]  [<ffffffff81005096>] dump_trace+0x86/0x330
[   55.012881]  [<ffffffff810053d4>] show_stack_log_lvl+0x94/0x170
[   55.012919]  [<ffffffff81006a31>] show_stack+0x21/0x50
[   55.012955]  [<ffffffff815ca482>] dump_stack+0x41/0x51
[   55.012990]  [<ffffffff8130e153>] kobject_add+0xa3/0xb0
[   55.013027]  [<ffffffff812ed4a1>] blk_mq_register_hctx+0x91/0xb0
[   55.013062]  [<ffffffff812edd5e>] blk_mq_sysfs_register+0x3e/0x50
[   55.013095]  [<ffffffff812eb9c2>] blk_mq_queue_reinit_notify+0x72/0xb0
[   55.013129]  [<ffffffff810806bc>] notifier_call_chain+0x4c/0x70
[   55.013162]  [<ffffffff8105beef>] cpu_notify+0x1f/0x40
[   55.013195]  [<ffffffff8105c0d7>] _cpu_up+0x157/0x170
[   55.013226]  [<ffffffff8105c169>] cpu_up+0x79/0xa0
[   55.013260]  [<ffffffff815bf555>] cpu_subsys_online+0x35/0x80
[   55.013294]  [<ffffffff8140c64d>] device_online+0x5d/0xa0
[   55.013328]  [<ffffffff8140c705>] online_store+0x75/0x80
[   55.013360]  [<ffffffff8122127a>] kernfs_fop_write+0xda/0x150
[   55.013394]  [<ffffffff811b1aa2>] vfs_write+0xb2/0x1f0
[   55.013426]  [<ffffffff811b25e2>] SyS_write+0x42/0xa0
[   55.013462]  [<ffffffff815d0c2d>] system_call_fastpath+0x1a/0x1f
[   55.013517]  [<00007f9fbf3804e0>] 0x7f9fbf3804df

I can reproduce it on a Ubuntu 14.10 host with stock KVM and on Debian
host with a custom KVM. The oops sometimes causes the virtual machine to
freeze.

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1403459/comments/0

------------------------------------------------------------------------
On 2014-12-08T12:43:55+00:00 Tiwai-r wrote:

Could you check whether the issue is reproduced on 3.17 and 3.18 kernels
available at OBS Kernel:stable and Kernel:HEAD repos?

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1403459/comments/1

------------------------------------------------------------------------
On 2014-12-08T13:03:42+00:00 Benjamin Drung wrote:

(In reply to Takashi Iwai from comment #1)
> Could you check whether the issue is reproduced on 3.17 and 3.18 kernels
> available at OBS Kernel:stable and Kernel:HEAD repos?

Same result with linux 3.17 from OBS Kernel:stable:

# uname -a
Linux opensuse 3.17.4-4.g8622a2e-default #1 SMP Sat Dec 6 20:43:41 UTC 2014 (8622a2e) x86_64 x86_64 x86_64 GNU/Linux
# dmesg
[  117.297802] CPU1 has been hot-added
[  117.298543] SMP alternatives: switching to SMP code
[  117.306886] x86: Booting SMP configuration:
[  117.306922] smpboot: Booting Node 0 Processor 1 APIC 0x1
[    0.008000] kvm-clock: cpu 1, msr 2:403de041, secondary cpu clock
[  117.320007] TSC synchronization [CPU#0 -> CPU#1]:
[  117.320007] Measured 407089711478 cycles TSC warp between CPUs, turning off TSC clock.
[  117.320007] tsc: Marking TSC unstable due to check_tsc_sync_source failed
[  117.338582] KVM setup async PF for cpu 1
[  117.338626] kvm-stealtime: cpu 1, msr 24004d240
[  117.338727] kobject '(null)' (ffff88024005a558): tried to add an uninitialized object, something is seriously wrong.
[  117.338728] CPU: 0 PID: 841 Comm: systemd-udevd Not tainted 3.17.4-4.g8622a2e-default #1
[  117.338729] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.7.5-20140531_171129-lamiak 04/01/2014
[  117.338730]  ffff8800bbaf3d98 ffffffff815dc842 ffff88024005a558 ffffffff813181c3
[  117.338731]  ffffffff00000020 ffff8800bbaf3da8 ffff8800bbaf3d50 0000000133a94f28
[  117.338732]  000000000000b0a0 0000000001e779d0 0000000000000001 ffff880233a94a18
[  117.338732] Call Trace:
[  117.338740]  [<ffffffff81005246>] dump_trace+0x86/0x330
[  117.338742]  [<ffffffff81005584>] show_stack_log_lvl+0x94/0x170
[  117.338744]  [<ffffffff81006be1>] show_stack+0x21/0x50
[  117.338746]  [<ffffffff815dc842>] dump_stack+0x41/0x51
[  117.338749]  [<ffffffff813181c3>] kobject_add+0xa3/0xb0
[  117.338753]  [<ffffffff812f74a1>] blk_mq_register_hctx+0x91/0xb0
[  117.338755]  [<ffffffff812f7d5e>] blk_mq_sysfs_register+0x3e/0x50
[  117.338756]  [<ffffffff812f5962>] blk_mq_queue_reinit_notify+0x72/0xb0
[  117.338759]  [<ffffffff8107985c>] notifier_call_chain+0x4c/0x70
[  117.338761]  [<ffffffff8105d6cf>] cpu_notify+0x1f/0x40
[  117.338763]  [<ffffffff8105d8b7>] _cpu_up+0x157/0x170
[  117.338765]  [<ffffffff8105d949>] cpu_up+0x79/0xa0
[  117.338768]  [<ffffffff815d1625>] cpu_subsys_online+0x35/0x80
[  117.338776]  [<ffffffff814150ed>] device_online+0x5d/0xa0
[  117.338779]  [<ffffffff814151a5>] online_store+0x75/0x80
[  117.338782]  [<ffffffff81227c4a>] kernfs_fop_write+0xda/0x150
[  117.338785]  [<ffffffff811b7d62>] vfs_write+0xb2/0x1f0
[  117.338787]  [<ffffffff811b88a2>] SyS_write+0x42/0xa0
[  117.338788]  [<ffffffff815e2f6d>] system_call_fastpath+0x1a/0x1f
[  117.338800]  [<00007f5728fbd4e0>] 0x7f5728fbd4e0
[  117.338826] microcode: CPU1 sig=0x306c1, pf=0x1, revision=0x1
[  117.393791] Will online and init hotplugged CPU: 1

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1403459/comments/2

------------------------------------------------------------------------
On 2014-12-08T13:09:34+00:00 Benjamin Drung wrote:

Same result with linux 3.18 from OBS Kernel:HEAD:

# uname -a
Linux opensuse 3.18.0-rc7-2.g088d59b-default #1 SMP Fri Dec 5 00:03:54 UTC 2014 (088d59b) x86_64 x86_64 x86_64 GNU/Linux
# dmesg
[  150.252464] CPU2 has been hot-added
[  150.253350] smpboot: Booting Node 0 Processor 2 APIC 0x2
[    0.004000] kvm-clock: cpu 2, msr 2:403de081, secondary cpu clock
[  150.268003] TSC synchronization [CPU#1 -> CPU#2]:
[  150.268003] Measured 518729842258 cycles TSC warp between CPUs, turning off TSC clock.
[  150.268003] tsc: Marking TSC unstable due to check_tsc_sync_source failed
[  150.285117] KVM setup async PF for cpu 2
[  150.285201] kvm-stealtime: cpu 2, msr 24008c280
[  150.296118] kobject '(null)' (ffffe8ffffc8b5d8): tried to add an uninitialized object, something is seriously wrong.
[  150.296197] CPU: 1 PID: 1386 Comm: systemd-udevd Not tainted 3.18.0-rc7-2.g088d59b-default #1
[  150.296243] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.7.5-20140531_171129-lamiak 04/01/2014
[  150.296285]  0000000000000000 0000000000000002 ffffffff81605f07 ffffe8ffffc8b5d8
[  150.296332]  ffffffff8132c7a0 ffff88023341d370 0000000000000020 ffff8800bb05bd58
[  150.296382]  ffff8800bb05bd08 000000000000a0a0 000000003f441940 0000000000000007
[  150.296427] Call Trace:
[  150.296479]  [<ffffffff81005306>] dump_trace+0x86/0x330
[  150.296541]  [<ffffffff81005644>] show_stack_log_lvl+0x94/0x170
[  150.296596]  [<ffffffff81006d21>] show_stack+0x21/0x50
[  150.296673]  [<ffffffff81605f07>] dump_stack+0x41/0x51
[  150.296721]  [<ffffffff8132c7a0>] kobject_add+0xa0/0xb0
[  150.296767]  [<ffffffff8130aee1>] blk_mq_register_hctx+0x91/0xb0
[  150.296809]  [<ffffffff8130b82e>] blk_mq_sysfs_register+0x3e/0x60
[  150.296857]  [<ffffffff81309298>] blk_mq_queue_reinit_notify+0xf8/0x190
[  150.296900]  [<ffffffff8107cfdc>] notifier_call_chain+0x4c/0x70
[  150.296947]  [<ffffffff8105fd23>] cpu_notify+0x23/0x50
[  150.296988]  [<ffffffff81060037>] _cpu_up+0x157/0x170
[  150.297033]  [<ffffffff810600d9>] cpu_up+0x89/0xb0
[  150.297079]  [<ffffffff815fa5b5>] cpu_subsys_online+0x35/0x80
[  150.297127]  [<ffffffff814323cd>] device_online+0x5d/0xa0
[  150.297185]  [<ffffffff81432485>] online_store+0x75/0x80
[  150.297238]  [<ffffffff81236a5a>] kernfs_fop_write+0xda/0x150
[  150.297309]  [<ffffffff811c5532>] vfs_write+0xb2/0x1f0
[  150.297367]  [<ffffffff811c5f42>] SyS_write+0x42/0xb0
[  150.297442]  [<ffffffff8160c4ed>] system_call_fastpath+0x16/0x1b
[  150.297493]  [<00007f0132fb24e0>] 0x7f0132fb24e0
[  150.297583] microcode: CPU2 sig=0x306c1, pf=0x1, revision=0x1
[  150.297727] Will online and init hotplugged CPU: 2

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1403459/comments/3

------------------------------------------------------------------------
On 2014-12-08T13:11:21+00:00 Tiwai-r wrote:

Thanks.  So the bug still remains in the upstream,  so we need to report
it.

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1403459/comments/4

------------------------------------------------------------------------
On 2014-12-10T14:18:25+00:00 Tiwai-r wrote:

OK, I could reproduce this easily, too, and spotted the culprit.
The fix patch is below.  I'm going to send the fix to upstream.

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1403459/comments/5

------------------------------------------------------------------------
On 2014-12-10T14:18:49+00:00 Tiwai-r wrote:

Created attachment 616585
Fix patch

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1403459/comments/6

------------------------------------------------------------------------
On 2014-12-14T10:27:03+00:00 Tiwai-r wrote:

The patch was merged to Linus tree, and backported to openSUSE 13.2 and
stable branches now.  SLE12 doesn't need this.

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1403459/comments/7


** Changed in: linux (openSUSE)
       Status: Unknown => Fix Released

** Changed in: linux (openSUSE)
   Importance: Unknown => Medium

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1403459

Title:
   kernel bug when hotplugging a CPU

Status in linux package in Ubuntu:
  Incomplete
Status in linux package in Debian:
  Unknown
Status in linux package in openSUSE:
  Fix Released

Bug description:
  When I hot-plug a CPU to an Ubuntu 14.10 VM running on KVM, I get
  following kernel bug. Ubuntu 14.04 is not affected by this bug.

  # uname -a
  Linux ubuntu 3.16.0-28-generic #38-Ubuntu SMP Fri Dec 12 17:37:40 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux
  # dmesg
  [  109.469094] CPU1 has been hot-added
  [  109.477005] SMP alternatives: switching to SMP code
  [  109.490583] x86: Booting SMP configuration:
  [  109.490586] smpboot: Booting Node 0 Processor 1 APIC 0x1
  [  109.476344] kvm-clock: cpu 1, msr 0:3ffe6041, secondary cpu clock
  [  109.586704] kvm: enabling virtualization on CPU1
  [  109.596330] KVM setup async PF for cpu 1
  [  109.596330] kvm-stealtime: cpu 1, msr 3de2d100
  [  109.596645] microcode: CPU1: patch_level=0x01000065
  [  109.596910] Will online and init hotplugged CPU: 1
  [  109.598469] kobject '(null)' (ffff88003de3b5d8): tried to add an uninitialized object, something is seriously wrong.
  [  109.599255] CPU: 0 PID: 1424 Comm: systemd-udevd Not tainted 3.16.0-28-generic #38-Ubuntu
  [  109.599257] Hardware name: Bochs Bochs, BIOS Bochs 01/01/2011
  [  109.599259]  ffff880036465250 ffff88003af77c88 ffffffff81781eaa ffff88003de3b5d8
  [  109.599263]  ffff88003af77cf8 ffffffff8139e896 ffff880000000020 ffff88003af77d08
  [  109.599266]  ffff88003af77cb0 ffff88003af77cc8 ffffffff8137967b 0000000000000001
  [  109.599269] Call Trace:
  [  109.599276]  [<ffffffff81781eaa>] dump_stack+0x45/0x56
  [  109.599280]  [<ffffffff8139e896>] kobject_add+0xa6/0xb0
  [  109.599283]  [<ffffffff8137967b>] ? get_first_sibling+0x2b/0x40
  [  109.599287]  [<ffffffff8139d08d>] ? ida_get_new_above+0x1dd/0x210
  [  109.599291]  [<ffffffff81378b66>] blk_mq_register_hctx+0xa6/0xc0
  [  109.599293]  [<ffffffff813794c1>] blk_mq_sysfs_register+0x41/0x60
  [  109.599297]  [<ffffffff81376fc2>] blk_mq_queue_reinit_notify+0x72/0xb0
  [  109.599300]  [<ffffffff8109972d>] notifier_call_chain+0x4d/0x70
  [  109.599303]  [<ffffffff8109983e>] __raw_notifier_call_chain+0xe/0x10
  [  109.599306]  [<ffffffff81070133>] cpu_notify+0x23/0x50
  [  109.599309]  [<ffffffff81070348>] _cpu_up+0x168/0x180
  [  109.599312]  [<ffffffff810703ea>] cpu_up+0x8a/0xb0
  [  109.599315]  [<ffffffff81778ee0>] cpu_subsys_online+0x40/0xa0
  [  109.599319]  [<ffffffff814cf225>] device_online+0x65/0x90
  [  109.599322]  [<ffffffff814cf2cd>] online_store+0x7d/0x90
  [  109.599325]  [<ffffffff814cc658>] dev_attr_store+0x18/0x30
  [  109.599328]  [<ffffffff8125950a>] sysfs_kf_write+0x3a/0x50
  [  109.599331]  [<ffffffff812586ae>] kernfs_fop_write+0xee/0x180
  [  109.599335]  [<ffffffff811e10b7>] vfs_write+0xb7/0x1f0
  [  109.599338]  [<ffffffff811e1c66>] SyS_write+0x46/0xb0
  [  109.599342]  [<ffffffff81789eed>] system_call_fastpath+0x1a/0x1f

  I initially reported this bug in openSUSE and it was fixed in the
  Linus tree:
  https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/commit/?id=06a41a9

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1403459/+subscriptions


References