Re: CS 4.5.2: all hosts reboot after 3 days at production

2015-09-21 Thread Vadim Kimlaychuk
. Can you tell us in some more details what happens and how we can reproduce it? Regards, Remi -⁠-⁠-⁠-⁠-⁠Original Message-⁠-⁠-⁠-⁠-⁠ From: Vadim Kimlaychuk [mailto:va...@kickcloud.net] Sent: zondag 13 september 2015 9:32 To: users@cloudstack.apache.org Cc: Remi Bergsma Subject: Re: CS 4.5.2: all hosts reb

Re: CS 4.5.2: all hosts reboot after 3 days at production

2015-09-15 Thread Frank Louwers
mi Bergsma wrote: >>> Hi Vadim, >>> Not sure what the problem is. Although I do know that when shared >>> storage is used, both CloudStack and XenServer will fence (reboot) the >>> box to prevent corruption in case access to the network or the storage >>> is n

Re: CS 4.5.2: all hosts reboot after 3 days at production

2015-09-15 Thread Abhinandan Prateek
vent corruption in case access to the network or the storage >> is not possible. What storage do you use? >> What does this return on a XenServer?: >> xe pool-⁠list params=all | grep -⁠E "ha-⁠enabled|ha-⁠config" >> HA should be on, or else a hypervisor crash will not r

Re: CS 4.5.2: all hosts reboot after 3 days at production

2015-09-15 Thread Vadim Kimlaychuk
, Remi -⁠-⁠-⁠-⁠-⁠Original Message-⁠-⁠-⁠-⁠-⁠ From: Vadim Kimlaychuk [mailto:va...@kickcloud.net] Sent: zondag 13 september 2015 9:32 To: users@cloudstack.apache.org Cc: Remi Bergsma Subject: Re: CS 4.5.2: all hosts reboot after 3 days at production Hello Remi, This issue has nothing to do w

Re: CS 4.5.2: all hosts reboot after 3 days at production

2015-09-15 Thread Remi Bergsma
e what the problem is. Although I do know that when shared >> storage is used, both CloudStack and XenServer will fence (reboot) the >> box to prevent corruption in case access to the network or the storage >> is not possible. What storage do you use? >> What does this return o

Re: CS 4.5.2: all hosts reboot after 3 days at production

2015-09-14 Thread Remi Bergsma
ypervisor crash will not recover properly. >> >> If you search the logs for Fence or reboot, does anything come back? >> >> The logs you mention are nothing to worry about. >> >> Can you tell us in some more details what happens and how we can >> reproduce

Re: CS 4.5.2: all hosts reboot after 3 days at production

2015-09-14 Thread Vadim Kimlaychuk
mlaychuk [mailto:va...@kickcloud.net] Sent: zondag 13 september 2015 9:32 To: users@cloudstack.apache.org Cc: Remi Bergsma Subject: Re: CS 4.5.2: all hosts reboot after 3 days at production Hello Remi, This issue has nothing to do with CS 4.5.2. We got host reboot after precisely 1 week with

Re: CS 4.5.2: all hosts reboot after 3 days at production

2015-09-14 Thread Remi Bergsma
>> >> What does this return on a XenServer?: >> xe pool-⁠list params=all | grep -⁠E "ha-⁠enabled|ha-⁠config" >> >> HA should be on, or else a hypervisor crash will not recover properly. >> >> If you search the logs for Fence or reboot, does anythi

Re: CS 4.5.2: all hosts reboot after 3 days at production

2015-09-14 Thread Vadim Kimlaychuk
Can you tell us in some more details what happens and how we can reproduce it? Regards, Remi -⁠-⁠-⁠-⁠-⁠Original Message-⁠-⁠-⁠-⁠-⁠ From: Vadim Kimlaychuk [mailto:va...@kickcloud.net] Sent: zondag 13 september 2015 9:32 To: users@cloudstack.apache.org Cc: Remi Bergsma Subject: Re: CS 4.5.2: all hosts reboo

RE: CS 4.5.2: all hosts reboot after 3 days at production

2015-09-14 Thread Vadim Kimlaychuk
rry about. Can you tell us in some more details what happens and how we can reproduce it? Regards, Remi -⁠-⁠-⁠-⁠-⁠Original Message-⁠-⁠-⁠-⁠-⁠ From: Vadim Kimlaychuk [mailto:va...@kickcloud.net] Sent: zondag 13 september 2015 9:32 To: users@cloudstack.apache.org Cc: Remi Bergsma Subject: Re: CS 4.5

Re: CS 4.5.2: all hosts reboot after 3 days at production

2015-09-13 Thread Vadim Kimlaychuk
Hello Remi, This issue has nothing to do with CS 4.5.2. We got host reboot after precisely 1 week with previous version of CS (4.5.1). Previous version has been working without restart for 106 days before. So it is not a software issue. What does really make me unhappy --

RE: CS 4.5.2: all hosts reboot after 3 days at production

2015-09-13 Thread Vadim Kimlaychuk
g Cc: Remi Bergsma Subject: Re: CS 4.5.2: all hosts reboot after 3 days at production Hello Remi, This issue has nothing to do with CS 4.5.2. We got host reboot after precisely 1 week with previous version of CS (4.5.1). Previous version has been working without restart for 106 days before. So

RE: CS 4.5.2: all hosts reboot after 3 days at production

2015-09-13 Thread Remi Bergsma
tails what happens and how we can reproduce it? Regards, Remi -Original Message- From: Vadim Kimlaychuk [mailto:va...@kickcloud.net] Sent: zondag 13 september 2015 9:32 To: users@cloudstack.apache.org Cc: Remi Bergsma Subject: Re: CS 4.5.2: all hosts reboot after 3 days at production

Re: CS 4.5.2: all hosts reboot after 3 days at production

2015-09-08 Thread Vadim Kimlaychuk
Hello Remi, First of all I don't have /var/log/xha.log file. I have examined logs in detail and haven't found any trace that heartbeat has failed. The only serious problem I have found in management logs before restart is repeating many times error:

Re: CS 4.5.2: all hosts reboot after 3 days at production

2015-09-07 Thread Remi Bergsma
Hi Vadim, What kind of storage do you use? Can you show /var/log/xha.log (I think that is the name) please? It could be xen-ha that fences the box if the heartbeat cannot be written. You suggest it is CloudStack. Did you see anything in the mgt logs? Regards, Remi Sent from my iPhone > On