yahoo-eng-team team mailing list archive
-
yahoo-eng-team team
-
Mailing list archive
-
Message #86298
[Bug 1929446] Re: OVS polling loop created by ovsdbapp and os-vif starving n-cpu threads
Another possible hit in
https://bugs.launchpad.net/nova/+bug/1863889/comments/3 ?
** Summary changed:
- check_can_live_migrate_source taking > 60 seconds in CI
+ OVS polling loop created by ovsdbapp and os-vif starving n-cpu threads
** Also affects: os-vif
Importance: Undecided
Status: New
--
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to OpenStack Compute (nova).
https://bugs.launchpad.net/bugs/1929446
Title:
OVS polling loop created by ovsdbapp and os-vif starving n-cpu threads
Status in OpenStack Compute (nova):
Triaged
Status in os-vif:
New
Bug description:
I've been seeing lots of failures caused by timeouts in
test_volume_backed_live_migration during the live-migration and
multinode grenade jobs, for example:
https://zuul.opendev.org/t/openstack/build/bb6fd21b5d8c471a89f4f6598aa84e5d/logs
During check_can_live_migrate_source I'm seeing the following gap in
the logs that I can't explain:
12225 May 24 10:23:02.637600 ubuntu-focal-inap-mtl01-0024794054 nova-compute[107012]: DEBUG nova.virt.libvirt.driver [None req-b5288b85-d642-426f-a525-c64724fe4091 tempest-LiveMigrationTest-312230369 tempest-LiveMigrationTest-312230369-project-admin] [instance: 91a0e0ca-e6a8-43ab-8e68-a10a77ad615b] Check if temp file /opt/stack/data/nova/instances/tmp5lcmhuri exists to indicate shared storage is being used for migration. Exists? False {{(pid=107012) _check_shared_storage_test_file /opt/stack/nova/nova/virt/libvirt/driver.py:9367}}
[..]
12282 May 24 10:24:22.385187 ubuntu-focal-inap-mtl01-0024794054 nova-compute[107012]: DEBUG nova.virt.libvirt.driver [None req-b5288b85-d642-426f-a525-c64724fe4091 tempest-LiveMigrationTest-312230369 tempest-LiveMigrationTest-312230369-project-admin] skipping disk /dev/sdb (vda) as it is a volume {{(pid=107012) _get_instance_disk_info_from_config /opt/stack/nova/nova/virt/libvirt/driver.py:10458}}
^ this leads to both the HTTP request to live migrate (that's still a
synchronous call at this point [1]) *and* the RPC call from the dest
to the source both timing out.
[1] https://docs.openstack.org/nova/latest/reference/live-
migration.html
To manage notifications about this bug go to:
https://bugs.launchpad.net/nova/+bug/1929446/+subscriptions
References