← Back to team overview

openstack team mailing list archive

Re: detecting errors when determining libvirt vm power state

 

Quoting Serge Hallyn (serge.hallyn@xxxxxxxxxxxxx):
> Hi,
> 
> I'm looking at what first manifested as a bug when launching multiple
> lxc containers simultaneously, i.e. 'euca-run-instances -n 4', as
> reported at https://bugs.launchpad.net/ubuntu/+source/nova/+bug/842845.
> 
> The problem appears to be that nova uses self.driver.get_info().  Libvirt
> can raise excpetions on this for several reasons - the vm could be bad or
> not exist, or it could be in a transient state i.e. cgroups are not set
> up yet.
> 
> What is the right way to handle this?  Should the drivers categorize
> their exceptions into either 'broken' or 'transient' ones, so that
> nova can detect former and bail, and retry on the latter?

Now that I've sent that, I guess it seems pretty clear that the
lxc getinfo helper should understand that -ENOENT from getcgroup
means it's not yet ready, and set the values to 0 as it does if
the domain is not running.

-serge


References