← Back to team overview

openstack team mailing list archive

Re: can not start VM instance with specific image

 

I would try this again. 

1. Delete all instances on the host.
2. Clean out the _base directory.
3. Restart nova-compute
4. Try to run the instance again.

If that doesn't work, I would suspect a bad sector on your hard drive that is getting reused.

Vish
 
On Apr 5, 2012, at 2:43 AM, yuanke wei wrote:

> hi all,
> 
> prob1:
> I deployed openstack on a 1controller+Ncompute basis. All the compute node seems work well and windows 2008 server image can be spawned successfully, except on one compute node, I encountered the following errors, can someone tell me what might be the problem??
> after deleting all the files under "_base" dir, the error still there.
> if needed, further info can be provided
> 
> 2012-04-05 09:30:03,874 DEBUG nova.rpc [-] Making asynchronous cast on network... from (pid=3068) cast /var/lib/nova/nova/rpc/impl_kombu.py:756
> 2012-04-05 09:30:03,932 ERROR nova.rpc [-] Exception during message handling
> (nova.rpc): TRACE: Traceback (most recent call last):
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/rpc/impl_kombu.py", line 620, in _process_data
> (nova.rpc): TRACE:     rval = node_func(context=ctxt, **node_args)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/exception.py", line 100, in wrapped
> (nova.rpc): TRACE:     return f(*args, **kw)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/compute/manager.py", line 485, in run_instance
> (nova.rpc): TRACE:     self._run_instance(context, instance_id, **kwargs)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/compute/manager.py", line 481, in _run_instance
> (nova.rpc): TRACE:     _cleanup()
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/compute/manager.py", line 406, in _cleanup
> (nova.rpc): TRACE:     _deallocate_network()
> (nova.rpc): TRACE:   File "/usr/lib/python2.6/contextlib.py", line 23, in __exit__
> (nova.rpc): TRACE:     self.gen.next()
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/compute/manager.py", line 459, in _run_instance
> (nova.rpc): TRACE:     network_info, block_device_info)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/exception.py", line 100, in wrapped
> (nova.rpc): TRACE:     return f(*args, **kw)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/virt/libvirt/connection.py", line 629, in spawn
> (nova.rpc): TRACE:     block_device_info=block_device_info)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/virt/libvirt/connection.py", line 896, in _create_image
> (nova.rpc): TRACE:     size=size)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/virt/libvirt/connection.py", line 788, in _cache_image
> (nova.rpc): TRACE:     call_if_not_exists(base, fn, *args, **kwargs)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/utils.py", line 687, in inner
> (nova.rpc): TRACE:     retval = f(*args, **kwargs)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/virt/libvirt/connection.py", line 786, in call_if_not_exists
> (nova.rpc): TRACE:     fn(target=base, *args, **kwargs)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/virt/libvirt/connection.py", line 800, in _fetch_image
> (nova.rpc): TRACE:     images.fetch_to_raw(context, image_id, target, user_id, project_id)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/virt/images.py", line 88, in fetch_to_raw
> (nova.rpc): TRACE:     path_tmp, staged)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/utils.py", line 190, in execute
> (nova.rpc): TRACE:     cmd=' '.join(cmd))
> (nova.rpc): TRACE: ProcessExecutionError: Unexpected error while running command.
> (nova.rpc): TRACE: Command: qemu-img convert -O raw /var/lib/nova/instances/_base/bc33ea4e26e5e1af1408321416956113a4658763.part /var/lib/nova/instances/_base/bc33ea4e26e5e1af1408321416956113a4658763.converted
> (nova.rpc): TRACE: Exit code: 1
> (nova.rpc): TRACE: Stdout: ''
> (nova.rpc): TRACE: Stderr: 'qemu-img: error while reading\n'
> (nova.rpc): TRACE:
> 
> prob2:
> since the problem may be on the fact that the cached image on the compute node is broken, so how to force the compute node to abandon its local cached images and get from the remote image server???
> simply deleting all the files under "_base" dir seems don't work, I see no data transfer between the compute node and the image server.
> 
> thks in advance!
> 
> 
> 
> 
> 
> -----------------------------------------------------------------
> 韦远科 
> wei yuanke(wei)
> gtalk: weiyuanke123@xxxxxxxxx
> msn: weiyuanke123@xxxxxxxxxxx
> 
> _______________________________________________
> Mailing list: https://launchpad.net/~openstack
> Post to     : openstack@xxxxxxxxxxxxxxxxxxx
> Unsubscribe : https://launchpad.net/~openstack
> More help   : https://help.launchpad.net/ListHelp


Follow ups

References