yahoo-eng-team team mailing list archive
-
yahoo-eng-team team
-
Mailing list archive
-
Message #87616
[Bug 1950310] [NEW] guest kernel panic in live migration test
Public bug reported:
There are various kernel panics visible in the guest in the nova-live-
migration job. It is so far mostly visible on stable/victoria .
Example run:
https://zuul.opendev.org/t/openstack/build/67c89daf17e3475cb1d632f87beeb60d/logs
Nova stack trace:
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall [-] Dynamic interval
looping call
'oslo_service.loopingcall.RetryDecorator.__call__.<locals>._func'
failed: nova.exception.DeviceDetachFailed: Device detach failed for vdb:
Unable to detach the device from the live config.
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall Traceback (most recent
call last):
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/oslo_service/loopingcall.py",
line 150, in _run_loop
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall result =
func(*self.args, **self.kw)
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/oslo_service/loopingcall.py",
line 428, in _func
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall return
self._sleep_time
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/oslo_utils/excutils.py", line
220, in __exit__
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall self.force_reraise()
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/oslo_utils/excutils.py", line
196, in force_reraise
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall
six.reraise(self.type_, self.value, self.tb)
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/six.py", line 703, in reraise
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall raise value
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/oslo_service/loopingcall.py",
line 407, in _func
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall result = f(*args,
**kwargs)
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/opt/stack/nova/nova/virt/libvirt/guest.py", line 483, in
_do_wait_and_retry_detach
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall raise
exception.DeviceDetachFailed(
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall
nova.exception.DeviceDetachFailed: Device detach failed for vdb: Unable
to detach the device from the live config.
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: WARNING nova.virt.block_device [None
req-9e67da96-3f0f-4e77-8bc3-0682b443b86c tempest-
LiveAutoBlockMigrationV225Test-2108607333 tempest-
LiveAutoBlockMigrationV225Test-2108607333-project] [instance:
06c70e38-8f0e-43a9-b783-cda99fa6b5e0] Guest refused to detach volume
8806a6e3-b560-4f72-8f1b-8cef2c257fa1: nova.exception.DeviceDetachFailed:
Device detach failed for vdb: Unable to detach the device from the live
config.
https://zuul.opendev.org/t/openstack/build/67c89daf17e3475cb1d632f87beeb60d/log/compute1/logs/screen-
n-cpu.txt#10396-10491
Guest kernel panic:
2021-11-08 16:41:35,132 99547 DEBUG [tempest.common.waiters] Console output for 06c70e38-8f0e-43a9-b783-cda99fa6b5e0
body=
[ 15.293919] kernel tried to execute NX-protected page - exploit attempt? (uid: 0)
[ 15.298512] BUG: unable to handle page fault for address: ffff91bdc256c400
[ 15.299353] #PF: supervisor instruction fetch in kernel mode
[ 15.299943] #PF: error_code(0x0011) - permissions violation
[ 15.300902] PGD 5e01067 P4D 5e01067 PUD 5e02067 PMD 80000000024001e3
[ 15.302056] Oops: 0011 [#1] SMP NOPTI
[ 15.302770] CPU: 0 PID: 9 Comm: ksoftirqd/0 Not tainted 5.3.0-26-generic #28~18.04.1-Ubuntu
[ 15.303549] Hardware name: OpenStack Foundation OpenStack Nova, BIOS 1.13.0-1ubuntu1.1 04/01/2014
[ 15.305100] RIP: 0010:0xffff91bdc256c400
[ 15.305758] Code: 6f 72 69 74 79 20 66 6f 72 20 25 64 20 28 25 73 29 00 25 64 20 28 25 73 29 20 6f 6c 64 20 70 72 69 6f 72 69 74 79 20 25 64 2c <20> 6e 65 77 20 70 72 69 6f 72 69 74 79 20 25 64 0a 00 70 72 6f 63
[ 15.307322] RSP: 0018:ffffa59240053de8 EFLAGS: 00000292
[ 15.307835] RAX: ffff91bdc256c400 RBX: ffff91bdc762b4c0 RCX: ffff91bdc3239900
[ 15.308457] RDX: 0000000000000400 RSI: ffffa59240053df8 RDI: ffff91bdc67e6d88
[ 15.309109] RBP: ffffa59240053e48 R08: 0000000000000000 R09: 0000000000000001
[ 15.309725] R10: ffffa59240053bc0 R11: 000000000430d5a0 R12: ffff91bdc762b510
[ 15.310344] R13: ffffa59240053df8 R14: 000000000000000a R15: 0000000000000202
[ 15.311176] FS: 0000000000000000(0000) GS:ffff91bdc7600000(0000) knlGS:0000000000000000
[ 15.311892] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 15.312403] CR2: ffff91bdc256c400 CR3: 0000000002f6c000 CR4: 00000000000006f0
[ 15.313336] Call Trace:
[ 15.314947] ? rcu_core+0x2b5/0x450
[ 15.315635] rcu_core_si+0xe/0x10
[ 15.316138] __do_softirq+0xe4/0x2da
[ 15.316573] run_ksoftirqd+0x2b/0x40
[ 15.316952] smpboot_thread_fn+0xfc/0x170
[ 15.317326] kthread+0x121/0x140
[ 15.317648] ? sort_range+0x30/0x30
[ 15.317980] ? kthread_park+0xb0/0xb0
[ 15.318337] ret_from_fork+0x35/0x40
[ 15.318819] Modules linked in: ip_tables x_tables nls_utf8 nls_iso8859_1 nls_ascii isofs hid_generic usbhid hid virtio_rng virtio_gpu drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops ttm drm virtio_scsi virtio_net net_failover failover virtio_input virtio_blk qemu_fw_cfg 9pnet_virtio 9pnet pcnet32 8139cp mii ne2k_pci 8390 e1000
[ 15.322811] CR2: ffff91bdc256c400
[ 15.324234] ---[ end trace 73d738baa971ca73 ]---
[ 15.324797] RIP: 0010:0xffff91bdc256c400
[ 15.325176] Code: 6f 72 69 74 79 20 66 6f 72 20 25 64 20 28 25 73 29 00 25 64 20 28 25 73 29 20 6f 6c 64 20 70 72 69 6f 72 69 74 79 20 25 64 2c <20> 6e 65 77 20 70 72 69 6f 72 69 74 79 20 25 64 0a 00 70 72 6f 63
[ 15.326679] RSP: 0018:ffffa59240053de8 EFLAGS: 00000292
[ 15.327142] RAX: ffff91bdc256c400 RBX: ffff91bdc762b4c0 RCX: ffff91bdc3239900
[ 15.327742] RDX: 0000000000000400 RSI: ffffa59240053df8 RDI: ffff91bdc67e6d88
[ 15.328342] RBP: ffffa59240053e48 R08: 0000000000000000 R09: 0000000000000001
[ 15.328964] R10: ffffa59240053bc0 R11: 000000000430d5a0 R12: ffff91bdc762b510
[ 15.329563] R13: ffffa59240053df8 R14: 000000000000000a R15: 0000000000000202
[ 15.330167] FS: 0000000000000000(0000) GS:ffff91bdc7600000(0000) knlGS:0000000000000000
[ 15.330854] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 15.331348] CR2: ffff91bdc256c400 CR3: 0000000002f6c000 CR4: 00000000000006f0
[ 15.332107] Kernel panic - not syncing: Fatal exception in interrupt
[ 15.333470] Kernel Offset: 0x34800000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
[ 15.334628] ---[ end Kernel panic - not syncing: Fatal exception in interrupt ]---
Additional hits: https://paste.opendev.org/show/810868/
** Affects: nova
Importance: Undecided
Status: New
** Affects: nova/victoria
Importance: Undecided
Status: New
** Tags: gate-failure
** Also affects: nova/victoria
Importance: Undecided
Status: New
** Description changed:
- There is various kernel panics are visible in the guest in the nova-
- live-migration job. It was so far visible on stable/victoria only.
+ There are various kernel panics visible in the guest in the nova-live-
+ migration job. It was so far visible on stable/victoria only.
Example run:
https://zuul.opendev.org/t/openstack/build/67c89daf17e3475cb1d632f87beeb60d/logs
Nova stack trace:
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall [-] Dynamic interval
looping call
'oslo_service.loopingcall.RetryDecorator.__call__.<locals>._func'
failed: nova.exception.DeviceDetachFailed: Device detach failed for vdb:
Unable to detach the device from the live config.
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall Traceback (most recent
call last):
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/oslo_service/loopingcall.py",
line 150, in _run_loop
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall result =
func(*self.args, **self.kw)
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/oslo_service/loopingcall.py",
line 428, in _func
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall return
self._sleep_time
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/oslo_utils/excutils.py", line
220, in __exit__
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall self.force_reraise()
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/oslo_utils/excutils.py", line
196, in force_reraise
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall
six.reraise(self.type_, self.value, self.tb)
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/six.py", line 703, in reraise
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall raise value
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/oslo_service/loopingcall.py",
line 407, in _func
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall result = f(*args,
**kwargs)
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/opt/stack/nova/nova/virt/libvirt/guest.py", line 483, in
_do_wait_and_retry_detach
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall raise
exception.DeviceDetachFailed(
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall
nova.exception.DeviceDetachFailed: Device detach failed for vdb: Unable
to detach the device from the live config.
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: WARNING nova.virt.block_device [None
req-9e67da96-3f0f-4e77-8bc3-0682b443b86c tempest-
LiveAutoBlockMigrationV225Test-2108607333 tempest-
LiveAutoBlockMigrationV225Test-2108607333-project] [instance:
06c70e38-8f0e-43a9-b783-cda99fa6b5e0] Guest refused to detach volume
8806a6e3-b560-4f72-8f1b-8cef2c257fa1: nova.exception.DeviceDetachFailed:
Device detach failed for vdb: Unable to detach the device from the live
config.
-
- https://zuul.opendev.org/t/openstack/build/67c89daf17e3475cb1d632f87beeb60d/log/compute1/logs/screen-n-cpu.txt#10396-10491
+ https://zuul.opendev.org/t/openstack/build/67c89daf17e3475cb1d632f87beeb60d/log/compute1/logs/screen-
+ n-cpu.txt#10396-10491
Guest kernel panic:
2021-11-08 16:41:35,132 99547 DEBUG [tempest.common.waiters] Console output for 06c70e38-8f0e-43a9-b783-cda99fa6b5e0
body=
[ 15.293919] kernel tried to execute NX-protected page - exploit attempt? (uid: 0)
[ 15.298512] BUG: unable to handle page fault for address: ffff91bdc256c400
[ 15.299353] #PF: supervisor instruction fetch in kernel mode
[ 15.299943] #PF: error_code(0x0011) - permissions violation
- [ 15.300902] PGD 5e01067 P4D 5e01067 PUD 5e02067 PMD 80000000024001e3
+ [ 15.300902] PGD 5e01067 P4D 5e01067 PUD 5e02067 PMD 80000000024001e3
[ 15.302056] Oops: 0011 [#1] SMP NOPTI
[ 15.302770] CPU: 0 PID: 9 Comm: ksoftirqd/0 Not tainted 5.3.0-26-generic #28~18.04.1-Ubuntu
[ 15.303549] Hardware name: OpenStack Foundation OpenStack Nova, BIOS 1.13.0-1ubuntu1.1 04/01/2014
[ 15.305100] RIP: 0010:0xffff91bdc256c400
[ 15.305758] Code: 6f 72 69 74 79 20 66 6f 72 20 25 64 20 28 25 73 29 00 25 64 20 28 25 73 29 20 6f 6c 64 20 70 72 69 6f 72 69 74 79 20 25 64 2c <20> 6e 65 77 20 70 72 69 6f 72 69 74 79 20 25 64 0a 00 70 72 6f 63
[ 15.307322] RSP: 0018:ffffa59240053de8 EFLAGS: 00000292
[ 15.307835] RAX: ffff91bdc256c400 RBX: ffff91bdc762b4c0 RCX: ffff91bdc3239900
[ 15.308457] RDX: 0000000000000400 RSI: ffffa59240053df8 RDI: ffff91bdc67e6d88
[ 15.309109] RBP: ffffa59240053e48 R08: 0000000000000000 R09: 0000000000000001
[ 15.309725] R10: ffffa59240053bc0 R11: 000000000430d5a0 R12: ffff91bdc762b510
[ 15.310344] R13: ffffa59240053df8 R14: 000000000000000a R15: 0000000000000202
[ 15.311176] FS: 0000000000000000(0000) GS:ffff91bdc7600000(0000) knlGS:0000000000000000
[ 15.311892] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 15.312403] CR2: ffff91bdc256c400 CR3: 0000000002f6c000 CR4: 00000000000006f0
[ 15.313336] Call Trace:
[ 15.314947] ? rcu_core+0x2b5/0x450
[ 15.315635] rcu_core_si+0xe/0x10
[ 15.316138] __do_softirq+0xe4/0x2da
[ 15.316573] run_ksoftirqd+0x2b/0x40
[ 15.316952] smpboot_thread_fn+0xfc/0x170
[ 15.317326] kthread+0x121/0x140
[ 15.317648] ? sort_range+0x30/0x30
[ 15.317980] ? kthread_park+0xb0/0xb0
[ 15.318337] ret_from_fork+0x35/0x40
[ 15.318819] Modules linked in: ip_tables x_tables nls_utf8 nls_iso8859_1 nls_ascii isofs hid_generic usbhid hid virtio_rng virtio_gpu drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops ttm drm virtio_scsi virtio_net net_failover failover virtio_input virtio_blk qemu_fw_cfg 9pnet_virtio 9pnet pcnet32 8139cp mii ne2k_pci 8390 e1000
[ 15.322811] CR2: ffff91bdc256c400
[ 15.324234] ---[ end trace 73d738baa971ca73 ]---
[ 15.324797] RIP: 0010:0xffff91bdc256c400
[ 15.325176] Code: 6f 72 69 74 79 20 66 6f 72 20 25 64 20 28 25 73 29 00 25 64 20 28 25 73 29 20 6f 6c 64 20 70 72 69 6f 72 69 74 79 20 25 64 2c <20> 6e 65 77 20 70 72 69 6f 72 69 74 79 20 25 64 0a 00 70 72 6f 63
[ 15.326679] RSP: 0018:ffffa59240053de8 EFLAGS: 00000292
[ 15.327142] RAX: ffff91bdc256c400 RBX: ffff91bdc762b4c0 RCX: ffff91bdc3239900
[ 15.327742] RDX: 0000000000000400 RSI: ffffa59240053df8 RDI: ffff91bdc67e6d88
[ 15.328342] RBP: ffffa59240053e48 R08: 0000000000000000 R09: 0000000000000001
[ 15.328964] R10: ffffa59240053bc0 R11: 000000000430d5a0 R12: ffff91bdc762b510
[ 15.329563] R13: ffffa59240053df8 R14: 000000000000000a R15: 0000000000000202
[ 15.330167] FS: 0000000000000000(0000) GS:ffff91bdc7600000(0000) knlGS:0000000000000000
[ 15.330854] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 15.331348] CR2: ffff91bdc256c400 CR3: 0000000002f6c000 CR4: 00000000000006f0
[ 15.332107] Kernel panic - not syncing: Fatal exception in interrupt
[ 15.333470] Kernel Offset: 0x34800000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
[ 15.334628] ---[ end Kernel panic - not syncing: Fatal exception in interrupt ]---
Additional hits: https://paste.opendev.org/show/810868/
** Description changed:
There are various kernel panics visible in the guest in the nova-live-
- migration job. It was so far visible on stable/victoria only.
+ migration job. It is so far mostly visible on stable/victoria .
Example run:
https://zuul.opendev.org/t/openstack/build/67c89daf17e3475cb1d632f87beeb60d/logs
Nova stack trace:
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall [-] Dynamic interval
looping call
'oslo_service.loopingcall.RetryDecorator.__call__.<locals>._func'
failed: nova.exception.DeviceDetachFailed: Device detach failed for vdb:
Unable to detach the device from the live config.
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall Traceback (most recent
call last):
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/oslo_service/loopingcall.py",
line 150, in _run_loop
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall result =
func(*self.args, **self.kw)
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/oslo_service/loopingcall.py",
line 428, in _func
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall return
self._sleep_time
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/oslo_utils/excutils.py", line
220, in __exit__
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall self.force_reraise()
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/oslo_utils/excutils.py", line
196, in force_reraise
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall
six.reraise(self.type_, self.value, self.tb)
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/six.py", line 703, in reraise
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall raise value
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/oslo_service/loopingcall.py",
line 407, in _func
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall result = f(*args,
**kwargs)
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/opt/stack/nova/nova/virt/libvirt/guest.py", line 483, in
_do_wait_and_retry_detach
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall raise
exception.DeviceDetachFailed(
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall
nova.exception.DeviceDetachFailed: Device detach failed for vdb: Unable
to detach the device from the live config.
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: WARNING nova.virt.block_device [None
req-9e67da96-3f0f-4e77-8bc3-0682b443b86c tempest-
LiveAutoBlockMigrationV225Test-2108607333 tempest-
LiveAutoBlockMigrationV225Test-2108607333-project] [instance:
06c70e38-8f0e-43a9-b783-cda99fa6b5e0] Guest refused to detach volume
8806a6e3-b560-4f72-8f1b-8cef2c257fa1: nova.exception.DeviceDetachFailed:
Device detach failed for vdb: Unable to detach the device from the live
config.
https://zuul.opendev.org/t/openstack/build/67c89daf17e3475cb1d632f87beeb60d/log/compute1/logs/screen-
n-cpu.txt#10396-10491
Guest kernel panic:
2021-11-08 16:41:35,132 99547 DEBUG [tempest.common.waiters] Console output for 06c70e38-8f0e-43a9-b783-cda99fa6b5e0
body=
[ 15.293919] kernel tried to execute NX-protected page - exploit attempt? (uid: 0)
[ 15.298512] BUG: unable to handle page fault for address: ffff91bdc256c400
[ 15.299353] #PF: supervisor instruction fetch in kernel mode
[ 15.299943] #PF: error_code(0x0011) - permissions violation
[ 15.300902] PGD 5e01067 P4D 5e01067 PUD 5e02067 PMD 80000000024001e3
[ 15.302056] Oops: 0011 [#1] SMP NOPTI
[ 15.302770] CPU: 0 PID: 9 Comm: ksoftirqd/0 Not tainted 5.3.0-26-generic #28~18.04.1-Ubuntu
[ 15.303549] Hardware name: OpenStack Foundation OpenStack Nova, BIOS 1.13.0-1ubuntu1.1 04/01/2014
[ 15.305100] RIP: 0010:0xffff91bdc256c400
[ 15.305758] Code: 6f 72 69 74 79 20 66 6f 72 20 25 64 20 28 25 73 29 00 25 64 20 28 25 73 29 20 6f 6c 64 20 70 72 69 6f 72 69 74 79 20 25 64 2c <20> 6e 65 77 20 70 72 69 6f 72 69 74 79 20 25 64 0a 00 70 72 6f 63
[ 15.307322] RSP: 0018:ffffa59240053de8 EFLAGS: 00000292
[ 15.307835] RAX: ffff91bdc256c400 RBX: ffff91bdc762b4c0 RCX: ffff91bdc3239900
[ 15.308457] RDX: 0000000000000400 RSI: ffffa59240053df8 RDI: ffff91bdc67e6d88
[ 15.309109] RBP: ffffa59240053e48 R08: 0000000000000000 R09: 0000000000000001
[ 15.309725] R10: ffffa59240053bc0 R11: 000000000430d5a0 R12: ffff91bdc762b510
[ 15.310344] R13: ffffa59240053df8 R14: 000000000000000a R15: 0000000000000202
[ 15.311176] FS: 0000000000000000(0000) GS:ffff91bdc7600000(0000) knlGS:0000000000000000
[ 15.311892] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 15.312403] CR2: ffff91bdc256c400 CR3: 0000000002f6c000 CR4: 00000000000006f0
[ 15.313336] Call Trace:
[ 15.314947] ? rcu_core+0x2b5/0x450
[ 15.315635] rcu_core_si+0xe/0x10
[ 15.316138] __do_softirq+0xe4/0x2da
[ 15.316573] run_ksoftirqd+0x2b/0x40
[ 15.316952] smpboot_thread_fn+0xfc/0x170
[ 15.317326] kthread+0x121/0x140
[ 15.317648] ? sort_range+0x30/0x30
[ 15.317980] ? kthread_park+0xb0/0xb0
[ 15.318337] ret_from_fork+0x35/0x40
[ 15.318819] Modules linked in: ip_tables x_tables nls_utf8 nls_iso8859_1 nls_ascii isofs hid_generic usbhid hid virtio_rng virtio_gpu drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops ttm drm virtio_scsi virtio_net net_failover failover virtio_input virtio_blk qemu_fw_cfg 9pnet_virtio 9pnet pcnet32 8139cp mii ne2k_pci 8390 e1000
[ 15.322811] CR2: ffff91bdc256c400
[ 15.324234] ---[ end trace 73d738baa971ca73 ]---
[ 15.324797] RIP: 0010:0xffff91bdc256c400
[ 15.325176] Code: 6f 72 69 74 79 20 66 6f 72 20 25 64 20 28 25 73 29 00 25 64 20 28 25 73 29 20 6f 6c 64 20 70 72 69 6f 72 69 74 79 20 25 64 2c <20> 6e 65 77 20 70 72 69 6f 72 69 74 79 20 25 64 0a 00 70 72 6f 63
[ 15.326679] RSP: 0018:ffffa59240053de8 EFLAGS: 00000292
[ 15.327142] RAX: ffff91bdc256c400 RBX: ffff91bdc762b4c0 RCX: ffff91bdc3239900
[ 15.327742] RDX: 0000000000000400 RSI: ffffa59240053df8 RDI: ffff91bdc67e6d88
[ 15.328342] RBP: ffffa59240053e48 R08: 0000000000000000 R09: 0000000000000001
[ 15.328964] R10: ffffa59240053bc0 R11: 000000000430d5a0 R12: ffff91bdc762b510
[ 15.329563] R13: ffffa59240053df8 R14: 000000000000000a R15: 0000000000000202
[ 15.330167] FS: 0000000000000000(0000) GS:ffff91bdc7600000(0000) knlGS:0000000000000000
[ 15.330854] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 15.331348] CR2: ffff91bdc256c400 CR3: 0000000002f6c000 CR4: 00000000000006f0
[ 15.332107] Kernel panic - not syncing: Fatal exception in interrupt
[ 15.333470] Kernel Offset: 0x34800000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
[ 15.334628] ---[ end Kernel panic - not syncing: Fatal exception in interrupt ]---
Additional hits: https://paste.opendev.org/show/810868/
** Tags added: gate-failure
--
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to OpenStack Compute (nova).
https://bugs.launchpad.net/bugs/1950310
Title:
guest kernel panic in live migration test
Status in OpenStack Compute (nova):
New
Status in OpenStack Compute (nova) victoria series:
New
Bug description:
There are various kernel panics visible in the guest in the nova-live-
migration job. It is so far mostly visible on stable/victoria .
Example run:
https://zuul.opendev.org/t/openstack/build/67c89daf17e3475cb1d632f87beeb60d/logs
Nova stack trace:
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall [-] Dynamic interval
looping call
'oslo_service.loopingcall.RetryDecorator.__call__.<locals>._func'
failed: nova.exception.DeviceDetachFailed: Device detach failed for
vdb: Unable to detach the device from the live config.
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall Traceback (most recent
call last):
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/oslo_service/loopingcall.py",
line 150, in _run_loop
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall result =
func(*self.args, **self.kw)
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/oslo_service/loopingcall.py",
line 428, in _func
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall return
self._sleep_time
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/oslo_utils/excutils.py", line
220, in __exit__
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall
self.force_reraise()
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/oslo_utils/excutils.py", line
196, in force_reraise
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall
six.reraise(self.type_, self.value, self.tb)
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/six.py", line 703, in reraise
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall raise value
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/usr/local/lib/python3.8/dist-packages/oslo_service/loopingcall.py",
line 407, in _func
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall result = f(*args,
**kwargs)
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall File
"/opt/stack/nova/nova/virt/libvirt/guest.py", line 483, in
_do_wait_and_retry_detach
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall raise
exception.DeviceDetachFailed(
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall
nova.exception.DeviceDetachFailed: Device detach failed for vdb:
Unable to detach the device from the live config.
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: ERROR oslo.service.loopingcall
Nov 08 16:43:33.983139 ubuntu-focal-rax-iad-0027262323 nova-
compute[52781]: WARNING nova.virt.block_device [None
req-9e67da96-3f0f-4e77-8bc3-0682b443b86c tempest-
LiveAutoBlockMigrationV225Test-2108607333 tempest-
LiveAutoBlockMigrationV225Test-2108607333-project] [instance:
06c70e38-8f0e-43a9-b783-cda99fa6b5e0] Guest refused to detach volume
8806a6e3-b560-4f72-8f1b-8cef2c257fa1:
nova.exception.DeviceDetachFailed: Device detach failed for vdb:
Unable to detach the device from the live config.
https://zuul.opendev.org/t/openstack/build/67c89daf17e3475cb1d632f87beeb60d/log/compute1/logs/screen-
n-cpu.txt#10396-10491
Guest kernel panic:
2021-11-08 16:41:35,132 99547 DEBUG [tempest.common.waiters] Console output for 06c70e38-8f0e-43a9-b783-cda99fa6b5e0
body=
[ 15.293919] kernel tried to execute NX-protected page - exploit attempt? (uid: 0)
[ 15.298512] BUG: unable to handle page fault for address: ffff91bdc256c400
[ 15.299353] #PF: supervisor instruction fetch in kernel mode
[ 15.299943] #PF: error_code(0x0011) - permissions violation
[ 15.300902] PGD 5e01067 P4D 5e01067 PUD 5e02067 PMD 80000000024001e3
[ 15.302056] Oops: 0011 [#1] SMP NOPTI
[ 15.302770] CPU: 0 PID: 9 Comm: ksoftirqd/0 Not tainted 5.3.0-26-generic #28~18.04.1-Ubuntu
[ 15.303549] Hardware name: OpenStack Foundation OpenStack Nova, BIOS 1.13.0-1ubuntu1.1 04/01/2014
[ 15.305100] RIP: 0010:0xffff91bdc256c400
[ 15.305758] Code: 6f 72 69 74 79 20 66 6f 72 20 25 64 20 28 25 73 29 00 25 64 20 28 25 73 29 20 6f 6c 64 20 70 72 69 6f 72 69 74 79 20 25 64 2c <20> 6e 65 77 20 70 72 69 6f 72 69 74 79 20 25 64 0a 00 70 72 6f 63
[ 15.307322] RSP: 0018:ffffa59240053de8 EFLAGS: 00000292
[ 15.307835] RAX: ffff91bdc256c400 RBX: ffff91bdc762b4c0 RCX: ffff91bdc3239900
[ 15.308457] RDX: 0000000000000400 RSI: ffffa59240053df8 RDI: ffff91bdc67e6d88
[ 15.309109] RBP: ffffa59240053e48 R08: 0000000000000000 R09: 0000000000000001
[ 15.309725] R10: ffffa59240053bc0 R11: 000000000430d5a0 R12: ffff91bdc762b510
[ 15.310344] R13: ffffa59240053df8 R14: 000000000000000a R15: 0000000000000202
[ 15.311176] FS: 0000000000000000(0000) GS:ffff91bdc7600000(0000) knlGS:0000000000000000
[ 15.311892] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 15.312403] CR2: ffff91bdc256c400 CR3: 0000000002f6c000 CR4: 00000000000006f0
[ 15.313336] Call Trace:
[ 15.314947] ? rcu_core+0x2b5/0x450
[ 15.315635] rcu_core_si+0xe/0x10
[ 15.316138] __do_softirq+0xe4/0x2da
[ 15.316573] run_ksoftirqd+0x2b/0x40
[ 15.316952] smpboot_thread_fn+0xfc/0x170
[ 15.317326] kthread+0x121/0x140
[ 15.317648] ? sort_range+0x30/0x30
[ 15.317980] ? kthread_park+0xb0/0xb0
[ 15.318337] ret_from_fork+0x35/0x40
[ 15.318819] Modules linked in: ip_tables x_tables nls_utf8 nls_iso8859_1 nls_ascii isofs hid_generic usbhid hid virtio_rng virtio_gpu drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops ttm drm virtio_scsi virtio_net net_failover failover virtio_input virtio_blk qemu_fw_cfg 9pnet_virtio 9pnet pcnet32 8139cp mii ne2k_pci 8390 e1000
[ 15.322811] CR2: ffff91bdc256c400
[ 15.324234] ---[ end trace 73d738baa971ca73 ]---
[ 15.324797] RIP: 0010:0xffff91bdc256c400
[ 15.325176] Code: 6f 72 69 74 79 20 66 6f 72 20 25 64 20 28 25 73 29 00 25 64 20 28 25 73 29 20 6f 6c 64 20 70 72 69 6f 72 69 74 79 20 25 64 2c <20> 6e 65 77 20 70 72 69 6f 72 69 74 79 20 25 64 0a 00 70 72 6f 63
[ 15.326679] RSP: 0018:ffffa59240053de8 EFLAGS: 00000292
[ 15.327142] RAX: ffff91bdc256c400 RBX: ffff91bdc762b4c0 RCX: ffff91bdc3239900
[ 15.327742] RDX: 0000000000000400 RSI: ffffa59240053df8 RDI: ffff91bdc67e6d88
[ 15.328342] RBP: ffffa59240053e48 R08: 0000000000000000 R09: 0000000000000001
[ 15.328964] R10: ffffa59240053bc0 R11: 000000000430d5a0 R12: ffff91bdc762b510
[ 15.329563] R13: ffffa59240053df8 R14: 000000000000000a R15: 0000000000000202
[ 15.330167] FS: 0000000000000000(0000) GS:ffff91bdc7600000(0000) knlGS:0000000000000000
[ 15.330854] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 15.331348] CR2: ffff91bdc256c400 CR3: 0000000002f6c000 CR4: 00000000000006f0
[ 15.332107] Kernel panic - not syncing: Fatal exception in interrupt
[ 15.333470] Kernel Offset: 0x34800000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
[ 15.334628] ---[ end Kernel panic - not syncing: Fatal exception in interrupt ]---
Additional hits: https://paste.opendev.org/show/810868/
To manage notifications about this bug go to:
https://bugs.launchpad.net/nova/+bug/1950310/+subscriptions
Follow ups