I would try this again. 

1. Delete all instances on the host.
2. Clean out the _base directory.
3. Restart nova-compute
4. Try to run the instance again.

If that doesn't work, I would suspect a bad sector on your hard drive that is 
getting reused.

Vish
 
On Apr 5, 2012, at 2:43 AM, yuanke wei wrote:

> hi all,
> 
> prob1:
> I deployed openstack on a 1controller+Ncompute basis. All the compute node 
> seems work well and windows 2008 server image can be spawned successfully, 
> except on one compute node, I encountered the following errors, can someone 
> tell me what might be the problem??
> after deleting all the files under "_base" dir, the error still there.
> if needed, further info can be provided
> 
> 2012-04-05 09:30:03,874 DEBUG nova.rpc [-] Making asynchronous cast on 
> network... from (pid=3068) cast /var/lib/nova/nova/rpc/impl_kombu.py:756
> 2012-04-05 09:30:03,932 ERROR nova.rpc [-] Exception during message handling
> (nova.rpc): TRACE: Traceback (most recent call last):
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/rpc/impl_kombu.py", line 620, 
> in _process_data
> (nova.rpc): TRACE:     rval = node_func(context=ctxt, **node_args)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/exception.py", line 100, in 
> wrapped
> (nova.rpc): TRACE:     return f(*args, **kw)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/compute/manager.py", line 485, 
> in run_instance
> (nova.rpc): TRACE:     self._run_instance(context, instance_id, **kwargs)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/compute/manager.py", line 481, 
> in _run_instance
> (nova.rpc): TRACE:     _cleanup()
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/compute/manager.py", line 406, 
> in _cleanup
> (nova.rpc): TRACE:     _deallocate_network()
> (nova.rpc): TRACE:   File "/usr/lib/python2.6/contextlib.py", line 23, in 
> __exit__
> (nova.rpc): TRACE:     self.gen.next()
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/compute/manager.py", line 459, 
> in _run_instance
> (nova.rpc): TRACE:     network_info, block_device_info)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/exception.py", line 100, in 
> wrapped
> (nova.rpc): TRACE:     return f(*args, **kw)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/virt/libvirt/connection.py", 
> line 629, in spawn
> (nova.rpc): TRACE:     block_device_info=block_device_info)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/virt/libvirt/connection.py", 
> line 896, in _create_image
> (nova.rpc): TRACE:     size=size)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/virt/libvirt/connection.py", 
> line 788, in _cache_image
> (nova.rpc): TRACE:     call_if_not_exists(base, fn, *args, **kwargs)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/utils.py", line 687, in inner
> (nova.rpc): TRACE:     retval = f(*args, **kwargs)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/virt/libvirt/connection.py", 
> line 786, in call_if_not_exists
> (nova.rpc): TRACE:     fn(target=base, *args, **kwargs)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/virt/libvirt/connection.py", 
> line 800, in _fetch_image
> (nova.rpc): TRACE:     images.fetch_to_raw(context, image_id, target, 
> user_id, project_id)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/virt/images.py", line 88, in 
> fetch_to_raw
> (nova.rpc): TRACE:     path_tmp, staged)
> (nova.rpc): TRACE:   File "/var/lib/nova/nova/utils.py", line 190, in execute
> (nova.rpc): TRACE:     cmd=' '.join(cmd))
> (nova.rpc): TRACE: ProcessExecutionError: Unexpected error while running 
> command.
> (nova.rpc): TRACE: Command: qemu-img convert -O raw 
> /var/lib/nova/instances/_base/bc33ea4e26e5e1af1408321416956113a4658763.part 
> /var/lib/nova/instances/_base/bc33ea4e26e5e1af1408321416956113a4658763.converted
> (nova.rpc): TRACE: Exit code: 1
> (nova.rpc): TRACE: Stdout: ''
> (nova.rpc): TRACE: Stderr: 'qemu-img: error while reading\n'
> (nova.rpc): TRACE:
> 
> prob2:
> since the problem may be on the fact that the cached image on the compute 
> node is broken, so how to force the compute node to abandon its local cached 
> images and get from the remote image server???
> simply deleting all the files under "_base" dir seems don't work, I see no 
> data transfer between the compute node and the image server.
> 
> thks in advance!
> 
> 
> 
> 
> 
> -----------------------------------------------------------------
> 韦远科 
> wei yuanke(wei)
> gtalk: weiyuanke...@gmail.com
> msn: weiyuanke...@hotmail.com
> 
> _______________________________________________
> Mailing list: https://launchpad.net/~openstack
> Post to     : openstack@lists.launchpad.net
> Unsubscribe : https://launchpad.net/~openstack
> More help   : https://help.launchpad.net/ListHelp

_______________________________________________
Mailing list: https://launchpad.net/~openstack
Post to     : openstack@lists.launchpad.net
Unsubscribe : https://launchpad.net/~openstack
More help   : https://help.launchpad.net/ListHelp

Reply via email to