← Back to team overview

yahoo-eng-team team mailing list archive

[Bug 1582558] Re: Live-migration exception handling need improvement

 

I am a bit skeptical in ignoring the exception in both cases you described. For example during the roll_back if we fail calling cinder the risk is that we leave some open connections between the host and the cinder server, which is not good.
The same thing could happen during the _post_live_migration.
I agree with you that would be good to not stop those two actions for errors which Nova can't control but ignoring those errors doesn't seem the right approach.
Probably we need a new design for implementing the rollback and the post_live_migration tasks in a more async way but with the ability to control the results and take correct actions to fix potential errors but I reckon that to do that it is a complicated task which, probably, require some help from other services (Cidner, for example). I am seeing this more than a new spec than a bug, I'd suggest to come up with a more structured proposal and put a spec where we can get a discussion involving a broader audience.

I am going to mark this bug as invalid as we do not have a real bug here.
if you are not happy about this decision please join the IRC meeting for the nova live migration sub-team:
https://wiki.openstack.org/wiki/Meetings/NovaLiveMigration



** Changed in: nova
       Status: New => Invalid

-- 
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to OpenStack Compute (nova).
https://bugs.launchpad.net/bugs/1582558

Title:
  Live-migration exception handling need improvement

Status in OpenStack Compute (nova):
  Invalid

Bug description:
  Description:
  1.In /nova/compute/manager.py the function _rollback_live_migration, we should be catch exception of 
  'remove_volume_connection', let the rollback process continue to clean up other resources. Becaue,the
  'remove_volume_connection' will visit cinder, the probability of an exception is relatively large.
  2.In /nova/compute/manager.py the function _post_live_migration, we should be catch all exception of 
  source host clean up resources. Because the vm has been migrated the dest host, we should try to make sure the vm run normally.

To manage notifications about this bug go to:
https://bugs.launchpad.net/nova/+bug/1582558/+subscriptions


References