yahoo-eng-team team mailing list archive
-
yahoo-eng-team team
-
Mailing list archive
-
Message #81992
[Bug 1868234] [NEW] nova-live-migration evacuation fails if volumes created on subnode c-vol backend
Public bug reported:
Description
===========
I8af2ad741ca08c3d88efb9aa817c4d1470491a23 has started to correctly fence
the subnode during evacuation testing. However it missed that we deploy
c-vol and g-api on these nodes. As a result during BFV evacuation
testing we will fail if the volume has been created on the subnode
c-vol.
https://zuul.opendev.org/t/openstack/build/c78d3ab4e6a748b4a53c6ff6dc273106/log/logs/screen-n-cpu.txt#7060
Mar 19 19:43:26.844295 ubuntu-bionic-rax-ord-0015339373 nova-
compute[9838]: ERROR nova.compute.manager [None req-
512a96c8-8b32-49c7-8d29-7ff300ed4482 demo admin] [instance:
702ff125-d947-4a28-853b-82dcd58b990e] Setting instance vm_state to
ERROR: ClientException: The server has either erred or is incapable of
performing the requested operation. (HTTP 500)
https://zuul.opendev.org/t/openstack/build/c78d3ab4e6a748b4a53c6ff6dc273106/log/logs/screen-c-api.txt#1936
Mar 19 19:43:26.262818 ubuntu-bionic-rax-ord-0015339373 devstack@c-api.service[27200]: ERROR cinder.api.middleware.fault [req-512a96c8-8b32-49c7-8d29-7ff300ed4482 req-826f7c01-3c02-4d9e-9046-8a15d7fa9b61 demo admin] Caught error: <class 'oslo_messaging.exceptions.MessagingTimeout'> Timed out waiting for a reply to message ID 23fabce9b79441198fbe4fe71c0ac7ab: MessagingTimeout: Timed out waiting for a reply to message ID 23fabce9b79441198fbe4fe71c0ac7ab
Mar 19 19:43:26.262818 ubuntu-bionic-rax-ord-0015339373 devstack@c-api.service[27200]: ERROR
Ultimately we shouldn't run these services on the computes but for now
we should limit the services we stop on the subnode to n-cpu and q-agt.
Steps to reproduce
==================
Run nova-live-migration, if volumes are created on the subnode evacuation testing will fail.
Expected result
===============
nova-live-migration passes.
Actual result
=============
nova-live-migration fails.
Environment
===========
1. Exact version of OpenStack you are running. See the following
list for all releases: http://docs.openstack.org/releases/
Master or stabe/train with I8af2ad741ca08c3d88efb9aa817c4d1470491a23
applied.
2. Which hypervisor did you use?
(For example: Libvirt + KVM, Libvirt + XEN, Hyper-V, PowerKVM, ...)
What's the version of that?
Libvirt + KVM
2. Which storage type did you use?
(For example: Ceph, LVM, GPFS, ...)
What's the version of that?
N/A
3. Which networking type did you use?
(For example: nova-network, Neutron with OpenVSwitch, ...)
N/A
** Affects: nova
Importance: High
Assignee: Lee Yarwood (lyarwood)
Status: In Progress
** Affects: nova/stein
Importance: Undecided
Status: New
** Affects: nova/train
Importance: Undecided
Status: New
** Affects: nova/ussuri
Importance: High
Assignee: Lee Yarwood (lyarwood)
Status: In Progress
** Tags: evacuate live-migration volumes
--
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to OpenStack Compute (nova).
https://bugs.launchpad.net/bugs/1868234
Title:
nova-live-migration evacuation fails if volumes created on subnode
c-vol backend
Status in OpenStack Compute (nova):
In Progress
Status in OpenStack Compute (nova) stein series:
New
Status in OpenStack Compute (nova) train series:
New
Status in OpenStack Compute (nova) ussuri series:
In Progress
Bug description:
Description
===========
I8af2ad741ca08c3d88efb9aa817c4d1470491a23 has started to correctly
fence the subnode during evacuation testing. However it missed that we
deploy c-vol and g-api on these nodes. As a result during BFV
evacuation testing we will fail if the volume has been created on the
subnode c-vol.
https://zuul.opendev.org/t/openstack/build/c78d3ab4e6a748b4a53c6ff6dc273106/log/logs/screen-n-cpu.txt#7060
Mar 19 19:43:26.844295 ubuntu-bionic-rax-ord-0015339373 nova-
compute[9838]: ERROR nova.compute.manager [None req-
512a96c8-8b32-49c7-8d29-7ff300ed4482 demo admin] [instance:
702ff125-d947-4a28-853b-82dcd58b990e] Setting instance vm_state to
ERROR: ClientException: The server has either erred or is incapable of
performing the requested operation. (HTTP 500)
https://zuul.opendev.org/t/openstack/build/c78d3ab4e6a748b4a53c6ff6dc273106/log/logs/screen-c-api.txt#1936
Mar 19 19:43:26.262818 ubuntu-bionic-rax-ord-0015339373 devstack@c-api.service[27200]: ERROR cinder.api.middleware.fault [req-512a96c8-8b32-49c7-8d29-7ff300ed4482 req-826f7c01-3c02-4d9e-9046-8a15d7fa9b61 demo admin] Caught error: <class 'oslo_messaging.exceptions.MessagingTimeout'> Timed out waiting for a reply to message ID 23fabce9b79441198fbe4fe71c0ac7ab: MessagingTimeout: Timed out waiting for a reply to message ID 23fabce9b79441198fbe4fe71c0ac7ab
Mar 19 19:43:26.262818 ubuntu-bionic-rax-ord-0015339373 devstack@c-api.service[27200]: ERROR
Ultimately we shouldn't run these services on the computes but for now
we should limit the services we stop on the subnode to n-cpu and
q-agt.
Steps to reproduce
==================
Run nova-live-migration, if volumes are created on the subnode evacuation testing will fail.
Expected result
===============
nova-live-migration passes.
Actual result
=============
nova-live-migration fails.
Environment
===========
1. Exact version of OpenStack you are running. See the following
list for all releases: http://docs.openstack.org/releases/
Master or stabe/train with
I8af2ad741ca08c3d88efb9aa817c4d1470491a23 applied.
2. Which hypervisor did you use?
(For example: Libvirt + KVM, Libvirt + XEN, Hyper-V, PowerKVM, ...)
What's the version of that?
Libvirt + KVM
2. Which storage type did you use?
(For example: Ceph, LVM, GPFS, ...)
What's the version of that?
N/A
3. Which networking type did you use?
(For example: nova-network, Neutron with OpenVSwitch, ...)
N/A
To manage notifications about this bug go to:
https://bugs.launchpad.net/nova/+bug/1868234/+subscriptions
Follow ups