yahoo-eng-team team mailing list archive
-
yahoo-eng-team team
-
Mailing list archive
-
Message #93173
[Bug 1970642] Re: evacuation failure causes POST_FAILURE in the nova-live-migration job
Reviewed: https://review.opendev.org/c/openstack/nova/+/901815
Committed: https://opendev.org/openstack/nova/commit/1f8ce3cc70ceaf8edd25b1ae2646e525cf2b3973
Submitter: "Zuul (22348)"
Branch: master
commit 1f8ce3cc70ceaf8edd25b1ae2646e525cf2b3973
Author: Rajesh Tailor <ratailor@xxxxxxxxxx>
Date: Fri Nov 24 13:25:25 2023 +0530
Add retry on attachment_update
In nova-live-migration job, evacuation failures are causing
POST_FAILURE.
As per discussion on bug, it looks like 500 is coming from cinder
which is cause of this failure.
Similar to attachment_delete method, this change adds a retry
mechanism in cinder API calls attachment_update method.
Closes-Bug: #1970642
Change-Id: I1da3c8481f7e7a1e8776cf03f5c4cf117b7fabaa
** Changed in: nova
Status: In Progress => Fix Released
--
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to OpenStack Compute (nova).
https://bugs.launchpad.net/bugs/1970642
Title:
evacuation failure causes POST_FAILURE in the nova-live-migration job
Status in OpenStack Compute (nova):
Fix Released
Bug description:
The nova-live-migration job has a post task to test evacuation. This
test fails intermittently causing the job to fail with POST_FAILURE
status.
Evacuation fails due to libvirt connection error.
job log:
2022-04-27 14:24:41.092806 | controller | + /opt/stack/nova/roles/run-
evacuate-hook/files/test_evacuate.sh:evacuate_and_wait_for_active:34 :
echo 'Timed out waiting for server evacuate-bfv-test to go to ACTIVE
status'
2022-04-27 14:24:41.092962 | controller | Timed out waiting for server
evacuate-bfv-test to go to ACTIVE status
nova-compute log:
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server [None
req-a012194e-8201-43fa-ae56-a9194a5fd5ad demo admin] Exception during
message handling: nova.exception.HypervisorUnavailable: Connection to
the hypervisor is broken on host
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server Traceback (most recent
call last):
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/virt/libvirt/host.py", line 586, in
get_connection
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server conn =
self._get_connection()
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/virt/libvirt/host.py", line 566, in
_get_connection
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server
self._queue_conn_event_handler(
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/usr/local/lib/python3.8/dist-packages/oslo_utils/excutils.py", line
227, in __exit__
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server
self.force_reraise()
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/usr/local/lib/python3.8/dist-packages/oslo_utils/excutils.py", line
200, in force_reraise
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server raise self.value
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/virt/libvirt/host.py", line 558, in
_get_connection
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server self._wrapped_conn
= self._get_new_connection()
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/virt/libvirt/host.py", line 502, in
_get_new_connection
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server wrapped_conn =
self._connect(self._uri, self._read_only)
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/virt/libvirt/host.py", line 353, in _connect
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server return
self._libvirt_proxy.openAuth(uri, auth, flags)
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/usr/local/lib/python3.8/dist-packages/eventlet/tpool.py", line 193,
in doit
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server result =
proxy_call(self._autowrap, f, *args, **kwargs)
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/usr/local/lib/python3.8/dist-packages/eventlet/tpool.py", line 151,
in proxy_call
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server rv = execute(f,
*args, **kwargs)
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/usr/local/lib/python3.8/dist-packages/eventlet/tpool.py", line 132,
in execute
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server six.reraise(c, e,
tb)
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/usr/local/lib/python3.8/dist-packages/six.py", line 719, in reraise
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server raise value
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/usr/local/lib/python3.8/dist-packages/eventlet/tpool.py", line 86,
in tworker
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server rv = meth(*args,
**kwargs)
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/usr/lib/python3/dist-packages/libvirt.py", line 104, in openAuth
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server if ret is
None:raise libvirtError('virConnectOpenAuth() failed')
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server libvirt.libvirtError:
Failed to connect socket to '/var/run/libvirt/libvirt-sock':
Connection refused
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server During handling of the
above exception, another exception occurred:
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server Traceback (most recent
call last):
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/usr/local/lib/python3.8/dist-packages/oslo_messaging/rpc/server.py",
line 165, in _process_incoming
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server res =
self.dispatcher.dispatch(message)
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/usr/local/lib/python3.8/dist-
packages/oslo_messaging/rpc/dispatcher.py", line 309, in dispatch
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server return
self._do_dispatch(endpoint, method, ctxt, args)
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/usr/local/lib/python3.8/dist-
packages/oslo_messaging/rpc/dispatcher.py", line 229, in _do_dispatch
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server result =
func(ctxt, **new_args)
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/usr/local/lib/python3.8/dist-packages/oslo_messaging/rpc/server.py",
line 241, in inner
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server return func(*args,
**kwargs)
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/exception_wrapper.py", line 71, in wrapped
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server
_emit_versioned_exception_notification(
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/usr/local/lib/python3.8/dist-packages/oslo_utils/excutils.py", line
227, in __exit__
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server
self.force_reraise()
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/usr/local/lib/python3.8/dist-packages/oslo_utils/excutils.py", line
200, in force_reraise
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server raise self.value
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/exception_wrapper.py", line 63, in wrapped
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server return f(self,
context, *args, **kw)
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/compute/manager.py", line 184, in
decorated_function
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server
LOG.warning("Failed to revert task state for instance. "
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/usr/local/lib/python3.8/dist-packages/oslo_utils/excutils.py", line
227, in __exit__
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server
self.force_reraise()
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/usr/local/lib/python3.8/dist-packages/oslo_utils/excutils.py", line
200, in force_reraise
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server raise self.value
Apr 27 14:13:41.050639 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/compute/manager.py", line 155, in
decorated_function
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server return
function(self, context, *args, **kwargs)
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/compute/utils.py", line 1439, in
decorated_function
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server return
function(self, context, *args, **kwargs)
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/compute/manager.py", line 212, in
decorated_function
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server
compute_utils.add_instance_fault_from_exc(context,
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/usr/local/lib/python3.8/dist-packages/oslo_utils/excutils.py", line
227, in __exit__
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server
self.force_reraise()
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/usr/local/lib/python3.8/dist-packages/oslo_utils/excutils.py", line
200, in force_reraise
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server raise self.value
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/compute/manager.py", line 201, in
decorated_function
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server return
function(self, context, *args, **kwargs)
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/compute/manager.py", line 3548, in
rebuild_instance
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server
self._do_rebuild_instance_with_claim(
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/compute/manager.py", line 3625, in
_do_rebuild_instance_with_claim
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server claim_context =
rebuild_claim(
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/usr/local/lib/python3.8/dist-
packages/oslo_concurrency/lockutils.py", line 391, in inner
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server return f(*args,
**kwargs)
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/compute/resource_tracker.py", line 204, in
rebuild_claim
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server return
self._move_claim(
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/compute/resource_tracker.py", line 280, in
_move_claim
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server if
self.disabled(nodename):
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/compute/resource_tracker.py", line 666, in
disabled
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server not
self.driver.node_is_available(nodename))
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/virt/driver.py", line 1564, in node_is_available
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server if nodename in
self.get_available_nodes():
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/virt/libvirt/driver.py", line 11037, in
get_available_nodes
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server return
[self._host.get_hostname()]
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/virt/libvirt/host.py", line 1051, in
get_hostname
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server hostname =
self.get_connection().getHostname()
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server File
"/opt/stack/nova/nova/virt/libvirt/host.py", line 596, in
get_connection
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server raise
exception.HypervisorUnavailable()
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server
nova.exception.HypervisorUnavailable: Connection to the hypervisor is
broken on host
Apr 27 14:13:41.061728 ubuntu-focal-iweb-mtl01-0029462367 nova-
compute[97475]: ERROR oslo_messaging.rpc.server
Example failure:
https://zuul.opendev.org/t/openstack/build/6c792e4ee72542cbaf386710e57a4b01/log/controller/logs/screen-
n-cpu.txt#12329
I see multiple matching failures from the last two weeks:
https://paste.opendev.org/show/bbB6cuBQBGSUPyAxPXva/
To manage notifications about this bug go to:
https://bugs.launchpad.net/nova/+bug/1970642/+subscriptions
References