Re: HA in CS 4.1

2013-07-29 Thread Salvatore Sciacco
Done :-) Il giorno 29/lug/2013 23:10, "Bryan Whitehead" ha scritto: > Salvatore, Please go vote for and add details to this bug: > https://issues.apache.org/jira/browse/CLOUDSTACK-3535 > > -Bryan > > On Mon, Jul 29, 2013 at 1:30 PM, Salvatore Sciacco > wrote: > > I've powered off a host of a KVM

Re: HA in CS 4.1

2013-07-29 Thread Bryan Whitehead
Salvatore, Please go vote for and add details to this bug: https://issues.apache.org/jira/browse/CLOUDSTACK-3535 -Bryan On Mon, Jul 29, 2013 at 1:30 PM, Salvatore Sciacco wrote: > I've powered off a host of a KVM cluster to simulate the server failure and > I'm experiencing the "Agent state cann

Re: HA in CS 4.1

2013-07-29 Thread Salvatore Sciacco
I've powered off a host of a KVM cluster to simulate the server failure and I'm experiencing the "Agent state cannot be determined, do nothing" loop. How I can tell the manager that the host is died and to start the HA procedure? There is some field on the db I can update? 2013/7/29 Kirk Jant

Re: HA in CS 4.1

2013-07-29 Thread Kirk Jantzer
Great information, thanks so much for sharing!! On Mon, Jul 29, 2013 at 4:28 AM, Ryan Lei wrote: > FYR, I have done some similar tests several weeks ago to test the host HA > functionality. > > CS 4.0.2 + XenServer 6.0.2: Works as expected. HA-enabled VMs (including > System VMs) were automatic

Re: HA in CS 4.1

2013-07-29 Thread Ryan Lei
FYR, I have done some similar tests several weeks ago to test the host HA functionality. CS 4.0.2 + XenServer 6.0.2: Works as expected. HA-enabled VMs (including System VMs) were automatically restarted on HA-dedicated hosts. CS 4.1.0 + XCP 1.6: No HA thing happened at all much like what you descr

Re: HA in CS 4.1

2013-07-25 Thread Bryan Whitehead
blue.com > > -Original Message- > From: Chip Childers [mailto:chip.child...@sungard.com] > Sent: 25 July 2013 19:23 > To: > Subject: Re: HA in CS 4.1 > > On Thu, Jul 25, 2013 at 2:20 PM, Paul Angus wrote: > >> Sounds like https://issues.apache.org/jira/browse

RE: HA in CS 4.1

2013-07-25 Thread Paul Angus
: Subject: Re: HA in CS 4.1 On Thu, Jul 25, 2013 at 2:20 PM, Paul Angus wrote: > Sounds like https://issues.apache.org/jira/browse/CLOUDSTACK-3535 then. > > It's been upgraded to a blocker now so 4.1.1 and 4.2 can't be released > until it's fixed. I hope you can get

Re: HA in CS 4.1

2013-07-25 Thread Kirk Jantzer
hich is blank by default. This > > > is the number of seconds CloudStack waits after it loses communication > > > a host before doing anything. Blank somehow equals 30 mins in this > case. > > > > > > > > > Regards, > > > > > > Paul An

Re: HA in CS 4.1

2013-07-25 Thread Chip Childers
0540 | M: +447711418784 | T: CloudyAngus > paul.an...@shapeblue.com > > -Original Message- > From: Kirk Jantzer [mailto:kirk.jant...@gmail.com] > Sent: 25 July 2013 19:04 > To: Cloudstack users mailing list > Subject: Re: HA in CS 4.1 > > I set alert.wait to 60sec, restarted

RE: HA in CS 4.1

2013-07-25 Thread Paul Angus
dyAngus paul.an...@shapeblue.com -Original Message- From: Kirk Jantzer [mailto:kirk.jant...@gmail.com] Sent: 25 July 2013 19:04 To: Cloudstack users mailing list Subject: Re: HA in CS 4.1 I set alert.wait to 60sec, restarted, shut down host, and nothing :-( On Wed, Jul 24, 2013 at 4:22

RE: HA in CS 4.1

2013-07-25 Thread Geoff Higginbottom
| S: +44 20 3603 0540 | M: +447968161581 geoff.higginbot...@shapeblue.com -Original Message- From: Kirk Jantzer [mailto:kirk.jant...@gmail.com] Sent: 25 July 2013 19:04 To: Cloudstack users mailing list Subject: Re: HA in CS 4.1 I set alert.wait to 60sec, restarted, shut down host, and

Re: HA in CS 4.1

2013-07-25 Thread Kirk Jantzer
11418784 | T: CloudyAngus > paul.an...@shapeblue.com > > -Original Message- > From: Kirk Jantzer [mailto:kirk.jant...@gmail.com] > Sent: 24 July 2013 19:11 > To: Cloudstack users mailing list > Subject: Re: HA in CS 4.1 > > 2013-07-24 10:08:50,973 DEBUG [cloud.ha.AbstractInves

RE: HA in CS 4.1

2013-07-24 Thread Paul Angus
ng. Blank somehow equals 30 mins in this case. Regards, Paul Angus S: +44 20 3603 0540 | M: +447711418784 | T: CloudyAngus paul.an...@shapeblue.com -Original Message- From: Kirk Jantzer [mailto:kirk.jant...@gmail.com] Sent: 24 July 2013 19:11 To: Cloudstack users mailing list Subject: Re

Re: HA in CS 4.1

2013-07-24 Thread Kirk Jantzer
2013-07-24 10:08:50,973 DEBUG [cloud.ha.AbstractInvestigatorImpl] (AgentTaskPool-16:null) host () cannot be pinged, returning null ('I don't know') 2013-07-24 10:08:50,973 DEBUG [cloud.ha.UserVmDomRInvestigator] (AgentTaskPool-16:null) could not reach agent, could not reach agent's host, returning

HA in CS 4.1

2013-07-24 Thread Kirk Jantzer
Can someone "explain like I'm 5" how HA in CS should work? I have 4.1 setup with XCP hosts. To simulate a host failure, I've hard powered off a host through iDRAC and CS doesn't seem to know about it, at all -- the instances still show as running, but I cannot connect to them, and the host shows as