← Back to team overview

openstack team mailing list archive

Re: strange problem when reboot nova-compute node: domain not found: no domain with matching name

 

Razique,

 

Thanks.

 

Yes,at the first time I use a share nfs to store all the instances’ data. So
the config is same like “live migration” instruction from doc.openstack.org,
libvirtd.conf,libvirt-bin was modified.

 

To resolve the right of share directory, I modified the
/etc/libvirt/qemu.conf as following to let qemu run as root to avoid
permission denied problem when instance was created,but right now vm image
was store on compute-node harddisk, not on NFS node,but I did not change the
config:

# The user ID for QEMU processes run by the system instance.

user = "root"

 

# The group ID for QEMU processes run by the system instance.

group = "root"

 

# Whether libvirt should dynamically change file ownership

# to match the configured user/group above. Defaults to 1.

# Set to 0 to disable file ownership changes.

#dynamic_ownership = 1

dynamic_ownership = 0

 

righ now, here is the right information about /var/lib/nova and instances,
you can see, /var/lib/nova owner is nova,but instances is root, and
/etc/libvirt/qemu owner is root, is there anything wrong?

 

root@CM215:/var/lib/nova# ls -l

total 136

drwxr-xr-x 2 nova nova   4096 Aug 29 07:39 buckets

drwxr-xr-x 6 nova nova   4096 Sep 18 16:11 CA

drwxr-xr-x 2 nova nova   4096 Aug 29 07:39 images

drwxr-xr-x 5 nova nova   4096 Sep 18 17:22 instances

drwxr-xr-x 2 nova nova   4096 Aug 29 07:39 keys

drwxr-xr-x 2 nova nova   4096 Sep 18 17:09 networks

-rw------- 1 nova nova 104448 Sep 18 16:12 nova.sqlite

drwxr-xr-x 2 nova nova   4096 Aug 29 07:39 tmp

 

 

root@CM215:/var/lib/nova# cd instances

root@CM215:/var/lib/nova/instances# ls -l

total 12

drwxr-xr-x 2 root root 4096 Sep 18 17:22 _base

drwxr-xr-x 2 root root 4096 Sep 18 17:21 instance-00000019

drwxr-xr-x 2 root root 4096 Sep 18 17:22 instance-0000001a

root@CM215:/var/lib/nova/instances#

 

root@CM215:/var/lib/nova/instances# cd /etc/libvirt/qemu

root@CM215:/etc/libvirt/qemu# ls -l

total 12

-rw------- 1 root root 2650 Sep 18 17:34 instance-00000019.xml

-rw------- 1 root root 2650 Sep 18 17:34 instance-0000001a.xml

drwxr-xr-x 3 root root 4096 Sep 18 16:11 networks

root@CM215:/etc/libvirt/qemu#

 

regards,

 

Romi

 

 

From: Razique Mahroua [mailto:razique.mahroua@xxxxxxxxx] 
Sent: Tuesday, September 18, 2012 8:18 PM
To: romizhang1968
Cc: 'Ritesh Nanda'; openstack@xxxxxxxxxxxxxxxxxxx
Subject: Re: [Openstack] strange problem when reboot nova-compute node:
domain not found: no domain with matching name

 

Does that help ?

http://plosquare.blogspot.fr/2010/01/lvm2-snapshots-device-mapper-create.htm
l

 

Nuage & Co - Razique Mahroua 

razique.mahroua@xxxxxxxxx




 

Le 18 sept. 2012 à 14:16, Razique Mahroua <razique.mahroua@xxxxxxxxx> a
écrit :





Interesting....

you have a NFS share between several nodes ?

does the nova user have the rights to write on all the directories within
/var/lib/nova/instances ?

May be a right issue...but not so sure

 

Nuage & Co - Razique Mahroua 

razique.mahroua@xxxxxxxxx


<NUAGECO-LOGO-Fblan_petit.jpg> 

 

Le 18 sept. 2012 à 11:26, romizhang1968 <romizhang1968@xxxxxxx> a écrit :





additional information:

when I boot a new instance with centos image, system show me:

 

libvir: QEMU error : Domain not found: no domain with matching name
'instance-00000019'

.......

-4a73-b147-0c4f02191d2c] Ignoring error injecting data into image
9f595093-d52f-4b0f-8ff8-19ed98fc09f4 (Unexpected error while running
command.

Command: sudo kpartx -d /dev/nbd15

Exit code: 1

Stdout: ''

Stderr: 'device-mapper: remove ioctl failed: Device or resource busy\n')

2012-09-18 17:21:56 INFO nova.virt.libvirt.connection [-] [instance:
25b2e627-f37b-4a73-b147-0c4f02191d2c] Instance spawned successfully.


At 2012-09-17 21:18:27,"Razique Mahroua" <razique.mahroua@xxxxxxxxx> wrote:



Hey Romi,

looks like the image is not totally spawned, meaning it doesn't get added to
Virsh, during the spawing, look at the /var/log/nova/nova-compute.log and
look for errors.

Can you send the file ?

 

Nuage & Co - Razique Mahroua 

razique.mahroua@xxxxxxxxx


<NUAGECO-LOGO-Fblan_petit.jpg> 

 

Le 17 sept. 2012 à 14:56, romi zhang <romizhang1968@xxxxxxx> a écrit :





Razique,

 

Additional information:

Another testing system was not full of testing job. I met this problem
before at my small system. If I use ubuntu image download from network,this
problem was not happened. If I use centos and windows image create
instance,this problem will happened,and in the process of building centos or
windows instance, nova-compute always promote: libvirt qemu error,domain not
found. But finally instance would run normally, and then to reboot the host
of nova-compute and then run nova-compute ,it would stop at :CRITICAL error:
libvirt qemu error, domain not found,…..

 

Romi

 

From: romi zhang [mailto:romizhang1968@ <http://163.com/> 163.com] 
Sent: Monday, September 17, 2012 8:48 PM
To: 'Razique Mahroua'
Cc: 'Ritesh Nanda'; ' <mailto:openstack@xxxxxxxxxxxxxxxxxxx>
openstack@xxxxxxxxxxxxxxxxxxx'
Subject: RE: [Openstack] strange problem when reboot nova-compute node:
domain not found: no domain with matching name

 

Razique,

 

Appreciate for your attention.

 

# ls /etc/libvirt/qemu/networks/autostart

default.xml

 

at my memory,when the first time I computed the system installation, reboot
compute-node have no problem, and then after I created centos and windows
instances (in the booting process,system show :libvirt qemu error: domain
not found) and reboot one of compute-nodes ,nova-compute stopped and show me
: CRITICAL error, Libvirt: QEMU error: Domain not found: no domain with
matching name ‘instance-0000000a’. then the others also have same problem.

 

At my corner,it is a sysmatic behavior.

 

But at my another testing system, this problem had not happened.

 

Regards,

 

Romi

 

From: Razique Mahroua [mailto:razique.mahroua@ <http://gmail.com/>
gmail.com] 
Sent: Sunday, September 16, 2012 5:04 PM
To: romi zhang
Cc: 'Ritesh Nanda';  <mailto:openstack@xxxxxxxxxxxxxxxxxxx>
openstack@xxxxxxxxxxxxxxxxxxx
Subject: Re: [Openstack] strange problem when reboot nova-compute node:
domain not found: no domain with matching name

 

Hey, 

could you remove the flag :

--resume_guests_state_on_host_boot=true

ans restart the nova-compute service 

 

Can you run

$ ls  /etc/libvirt/qemu/autostart 

 

Does that behaviour is systematic ? Everytime you restart the service
nova-compute, or do you need to also have your compute node restarted ?

 

 

Nuage & Co - Razique Mahroua 

 <mailto:razique.mahroua@xxxxxxxxx> razique.mahroua@xxxxxxxxx


<image001.jpg>

 

Le 15 sept. 2012 à 12:00, romi zhang < <mailto:romizhang1968@xxxxxxx>
romizhang1968@xxxxxxx> a écrit :

 

Razique,

 

Thank you very much!

 

Here is the paste:

 <http://paste.openstack.org/show/20938/>
http://paste.openstack.org/show/20938/

 

this config was fine at the first testing system.

 

Current problem occurred at my second testing system, and at before I
created raw format image and convert to qcow2.

 

I am going to recreate image directly on qcow2 format,are just in
processing,……

 

Is there anything should be cared with image? It looks like related with the
glance image,I guess.

 

Romi

 

From: Razique Mahroua [mailto:razique.mahroua@ <http://gmail.com/>
gmail.com] 
Sent: Saturday, September 15, 2012 5:43 PM
To: romi zhang
Cc: 'Ritesh Nanda';  <mailto:openstack@xxxxxxxxxxxxxxxxxxx>
openstack@xxxxxxxxxxxxxxxxxxx
Subject: Re: [Openstack] strange problem when reboot nova-compute node:
domain not found: no domain with matching name

 

Hey I just wanted to see if the domain was still declared within virsh
database

in case you can also go to your instances directory and un a $ virsh define
$instance-name/domain.xml

Can you post your nova.conf please ?

 

Nuage & Co - Razique Mahroua 

 <mailto:razique.mahroua@xxxxxxxxx> razique.mahroua@xxxxxxxxx


<image001.jpg>

 

Le 15 sept. 2012 à 08:26, romi zhang < <mailto:romizhang1968@xxxxxxx>
romizhang1968@xxxxxxx> a écrit :






Hi Razique,

 

Is there anything need to have a deeper look?

Virsh list looks like well,same as the instance I created through nova boot.

 

Romi

 

From: Razique Mahroua [mailto:razique.mahroua@ <http://gmail.com/>
gmail.com] 
Sent: Friday, September 14, 2012 4:10 AM
To: Ritesh Nanda
Cc: romi zhang;  <mailto:openstack@xxxxxxxxxxxxxxxxxxx>
openstack@xxxxxxxxxxxxxxxxxxx
Subject: Re: [Openstack] strange problem when reboot nova-compute node:
domain not found: no domain with matching name

 

Hi Romi, 

what $ virsh list --all gives you ?

Nuage & Co - Razique Mahroua 

 <mailto:razique.mahroua@xxxxxxxxx> razique.mahroua@xxxxxxxxx


<image001.jpg>

 

Le 13 sept. 2012 à 21:11, Ritesh Nanda < <mailto:riteshnanda09@xxxxxxxxx>
riteshnanda09@xxxxxxxxx> a écrit :







Hello romi,

    Image got into a stuck state , only solution is to hack your database
for this particular instance and mark it as deleted, den restart nova-*
service, it would start working.

On Thu, Sep 13, 2012 at 9:45 PM, romi zhang < <mailto:romizhang1968@xxxxxxx>
romizhang1968@xxxxxxx> wrote:

Hi,

 

I’ve installed essex and when I reboot one of the nova-compute node and next
when I start nova-compute service again,the system promote:

Libvirt: QEMU error: Domain not found: no domain with matching name
‘instance-0000000a’

 

Then I found:  

#ls /var/lib/nova/instances

_base instance-0000000a instance-0000000b instance-0000000c

#ls /etc/libvirt/qemu

instance-0000000b.xml instance-0000000c.xml Network

 

So:

/var/lib/nova/instances/instance-0000000a# virsh define libvirt.xml

Domain instance-0000000a defined from libvirt.xml

And:

#ls /etc/libvirt/qemu

Instance-0000000a.xml instance-0000000b.xml instance-0000000c.xml network

 

Then I restart: nova-compute &, and nova-compute could normally started.

I did not understand why reboot compute node,and usually the first
instance’s xml file was deleted and nova-compute will terminate.

 

By the way, when I create a new instance, the first line in the log also is
: Libvirt: QEMU error: Domain not found: no domain with matching name
‘instance-0000000a’

 

Anyone who had met such a problem and why?

 

Regards,

 

romi


_______________________________________________
Mailing list:  <https://launchpad.net/~openstack>
https://launchpad.net/~openstack
Post to     :  <mailto:openstack@xxxxxxxxxxxxxxxxxxx>
openstack@xxxxxxxxxxxxxxxxxxx
Unsubscribe :  <https://launchpad.net/~openstack>
https://launchpad.net/~openstack
More help   :  <https://help.launchpad.net/ListHelp>
https://help.launchpad.net/ListHelp




--

 With Regards 

 Ritesh Nanda

 

 


_______________________________________________
Mailing list:  <https://launchpad.net/~openstack>
https://launchpad.net/~openstack
Post to     :  <mailto:openstack@xxxxxxxxxxxxxxxxxxx>
openstack@xxxxxxxxxxxxxxxxxxx
Unsubscribe :  <https://launchpad.net/~openstack>
https://launchpad.net/~openstack
More help   :  <https://help.launchpad.net/ListHelp>
https://help.launchpad.net/ListHelp

 






 

 

JPEG image


References