yahoo-eng-team team mailing list archive
-
yahoo-eng-team team
-
Mailing list archive
-
Message #58342
[Bug 1622854] Re: pci: double pci migration is putting vm in ERROR
Reviewed: https://review.openstack.org/369180
Committed: https://git.openstack.org/cgit/openstack/nova/commit/?id=6689c96cdfaa148037dc5df2baa058b09a075ec7
Submitter: Jenkins
Branch: master
commit 6689c96cdfaa148037dc5df2baa058b09a075ec7
Author: Moshe Levi <moshele@xxxxxxxxxxxx>
Date: Tue Sep 13 09:30:59 2016 +0300
pci: remove pci device from claims and allocations when freeing it
In drop_move_claim we call free pci device when we need to drop a
specific device from the src/dest node. This is done by calling
pci manager free_device. The current code just update the device
status to available in database but doesn't remove it from the
pci manager claims and allocations lists. This patch adds the
removal as well.
Closes-Bug: #1622854
Change-Id: If1cd6f3a635759cd55d116a34ca164630c61e085
** Changed in: nova
Status: In Progress => Fix Released
--
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to OpenStack Compute (nova).
https://bugs.launchpad.net/bugs/1622854
Title:
pci: double pci migration is putting vm in ERROR
Status in OpenStack Compute (nova):
Fix Released
Bug description:
nova master
devstack multinode with 2 compute nodes
1. booting vm with direct port
2. nova migrate 128a2ba4-fb6e-49f4-a6e0-45cde1c60215
3. nova resize-confirm 128a2ba4-fb6e-49f4-a6e0-45cde1c60215
4. nova migrate 128a2ba4-fb6e-49f4-a6e0-45cde1c60215
5. nova resize-confirm 128a2ba4-fb6e-49f4-a6e0-45cde1c60215
The second migration failed with this error:
2016-09-12 13:12:45.750 8388 DEBUG oslo_concurrency.lockutils [req-a4a0126a-215a-489a-b043-ad38d3b5e28d - -] Lock "compute_resources" released by "nova.compute.resource_tracker._update_available_resource" :: held 0.143s inner /usr/lib/python2.7/site-packages/oslo_concurrency/lockutils.py:282
2016-09-12 13:12:45.750 8388 ERROR nova.compute.manager [req-a4a0126a-215a-489a-b043-ad38d3b5e28d - -] Error updating resources for node r-dcs224.mtr.labs.mlnx.
2016-09-12 13:12:45.750 8388 ERROR nova.compute.manager Traceback (most recent call last):
2016-09-12 13:12:45.750 8388 ERROR nova.compute.manager File "/.autodirect/mtrswgwork/moshele/openstack/nova/nova/compute/manager.py", line 6408, in update_available_resource_for_node
2016-09-12 13:12:45.750 8388 ERROR nova.compute.manager rt.update_available_resource(context)
2016-09-12 13:12:45.750 8388 ERROR nova.compute.manager File "/.autodirect/mtrswgwork/moshele/openstack/nova/nova/compute/resource_tracker.py", line 526, in update_available_resource
2016-09-12 13:12:45.750 8388 ERROR nova.compute.manager self._update_available_resource(context, resources)
2016-09-12 13:12:45.750 8388 ERROR nova.compute.manager File "/usr/lib/python2.7/site-packages/oslo_concurrency/lockutils.py", line 271, in inner
2016-09-12 13:12:45.750 8388 ERROR nova.compute.manager return f(*args, **kwargs)
2016-09-12 13:12:45.750 8388 ERROR nova.compute.manager File "/.autodirect/mtrswgwork/moshele/openstack/nova/nova/compute/resource_tracker.py", line 580, in _update_available_resource
2016-09-12 13:12:45.750 8388 ERROR nova.compute.manager self.pci_tracker.clean_usage(instances, migrations, orphans)
2016-09-12 13:12:45.750 8388 ERROR nova.compute.manager File "/.autodirect/mtrswgwork/moshele/openstack/nova/nova/pci/manager.py", line 326, in clean_usage
2016-09-12 13:12:45.750 8388 ERROR nova.compute.manager self._free_device(dev)
2016-09-12 13:12:45.750 8388 ERROR nova.compute.manager File "/.autodirect/mtrswgwork/moshele/openstack/nova/nova/pci/manager.py", line 270, in _free_device
2016-09-12 13:12:45.750 8388 ERROR nova.compute.manager freed_devs = dev.free(instance)
2016-09-12 13:12:45.750 8388 ERROR nova.compute.manager File "/.autodirect/mtrswgwork/moshele/openstack/nova/nova/objects/pci_device.py", line 397, in free
2016-09-12 13:12:45.750 8388 ERROR nova.compute.manager hopestatus=ok_statuses)
2016-09-12 13:12:45.750 8388 ERROR nova.compute.manager PciDeviceInvalidStatus: PCI device 3:0000:03:00.5 is available instead of ('allocated', 'claimed')
2016-09-12 13:12:45.750 8388 ERROR nova.compute.manager
2016-09-12 13:12:46.220 8388 DEBUG oslo_service.periodic_task [req-a4a0126a-215a-489a-b043-ad38d3b5e28d - -] Running periodic task ComputeManager._sync_scheduler_instance_info run_periodic_tasks /usr/lib/python2.7/site-packages/oslo_service/periodic_task.py:215
To manage notifications about this bug go to:
https://bugs.launchpad.net/nova/+bug/1622854/+subscriptions
References