[Openstack] Ok, how do I debug this problem?

2013-01-21 Thread Christopher Smith

So we have this four node Nova cluster running Essex.  It has worked fine, but 
we've added new network interfaces.  Openstack has not yet been configured to 
use them, but changes have been made that allows the system to come up and 
configure the network in a similar way to before.  Now, all of the services 
seem to be up and running, Rabbit is apparently happy as far as I can tell.  
Nova-scheduler and nova-compute are running and talking to Rabbit.  Now then, 
whenever I log into any nova node and boot an instance or ten, all instances 
get started on a single node; always the same node, never any attempt by any of 
the others to start an instance.  I thought this would be a Rabbit problem, but 
rabbit thinks it's doing fine on all nodes and the cluster is intact.  I've 
even tried completely rebuilding the Rabbit configuration, just in case.  If I 
stop services on the one working node, the other nodes don't even appear to 
receive messages from either themselves or each other, regardless of Rabbit's 
supposedly working state.

Any ideas?  Any clue how you would even test this to see what's happening?

Chris

-- 
Christopher Smith
Systems Engineer, Wolfram Research

___
Mailing list: https://launchpad.net/~openstack
Post to : openstack@lists.launchpad.net
Unsubscribe : https://launchpad.net/~openstack
More help   : https://help.launchpad.net/ListHelp


Re: [Openstack] Ok, how do I debug this problem?

2013-01-22 Thread Belmiro Moreira
Hi Chris,
Are all compute nodes happy?

nova-manage service list

Belmiro

On Jan 21, 2013, at 7:23 PM, Christopher Smith  wrote:

> 
> So we have this four node Nova cluster running Essex.  It has worked fine, 
> but we've added new network interfaces.  Openstack has not yet been 
> configured to use them, but changes have been made that allows the system to 
> come up and configure the network in a similar way to before.  Now, all of 
> the services seem to be up and running, Rabbit is apparently happy as far as 
> I can tell.  Nova-scheduler and nova-compute are running and talking to 
> Rabbit.  Now then, whenever I log into any nova node and boot an instance or 
> ten, all instances get started on a single node; always the same node, never 
> any attempt by any of the others to start an instance.  I thought this would 
> be a Rabbit problem, but rabbit thinks it's doing fine on all nodes and the 
> cluster is intact.  I've even tried completely rebuilding the Rabbit 
> configuration, just in case.  If I stop services on the one working node, the 
> other nodes don't even appear to receive messages from either themselves or 
> each other, regardless of Rabbit's supposedly working state.
> 
> Any ideas?  Any clue how you would even test this to see what's happening?
> 
> Chris
> 
> -- 
> Christopher Smith
> Systems Engineer, Wolfram Research
> 
> ___
> Mailing list: https://launchpad.net/~openstack
> Post to : openstack@lists.launchpad.net
> Unsubscribe : https://launchpad.net/~openstack
> More help   : https://help.launchpad.net/ListHelp


___
Mailing list: https://launchpad.net/~openstack
Post to : openstack@lists.launchpad.net
Unsubscribe : https://launchpad.net/~openstack
More help   : https://help.launchpad.net/ListHelp