Re: [DISCUSS] Out of Band VR migration, should we reboot VR or not?

Jayapal Reddy Uradi Wed, 03 Jun 2015 23:15:55 -0700

In VR configuration persistence (4.6) only iptables rules are persisted ?
There are other configuration (interface ips, routes etc in VR will be lost on 
reboot) are these taken care ?



Thanks,
Jayapal


On 04-Jun-2015, at 11:15 AM, Koushik Das <koushik....@citrix.com>
 wrote:

> 
> On 04-Jun-2015, at 12:04 AM, Remi Bergsma <rberg...@schubergphilis.com> wrote:
> 
>> Hi all,
>> 
>> I just had a look at this more closely and had a chat with Ian about it. The 
>> only way for the original problem to happen (losing iptables rules) is if 
>> the live migrate would fail and the hypervisor rebooted the vm. The cause is 
>> the non-persistance of the router configuration, which is fixed in 4.6 by 
>> the way. I would say failing live migrations does not happen often (I have 
>> never seen it happening).
>> 
> 
> What about native HA in Vmware and HyperV? Say the original host has failed, 
> Vmware will bring up the VR on another host as part of native HA. In this 
> case also the configuration is lost.
> 
>> Anyway, once this happens to the router, it is stuck in a state where it 
>> does not have the linklocal configuration any more. Would CloudStack be able 
>> to issue a aggregate command while it cannot reach it? Rebooting might be 
>> the only way out after all. It’s just that rebooting by default in case of 
>> out-of-band migrations I’d say is a little bit too much. CloudStack would 
>> detect the problem anyway, as it cannot reach the linklocal anymore.
>> 
>> The interesting situation is that we have releases out there that now reboot 
>> by default.
>> 
>> My proposal to solve it in 4.4 and 4.5:
>> - Implement a setting ‘reboot systemvm when out-of-band migration detected’.
>> The default should be false and release notes should mention a changed 
>> behaviour from 4.4.3 and 4.5.1. To get the old behaviour, switch to true. A 
>> small group of people should be interested in this.
>> 
>> I guess this is the best of both worlds. Do you guys agree?
>> 
>> The other option I see is to revert the commit, as I think that serves most 
>> people.
>> 
>> Who is willing to help implement it?
>> 
>> Regards, Remi
>> 
>> 
>> On 03 Jun 2015, at 17:42, Rene Moser 
>> <m...@renemoser.net<mailto:m...@renemoser.net>> wrote:
>> 
>> Hi
>> 
>> On 03.06.2015 17:06, Ian Southam wrote:
>> If the machine crashes and/or rebooted during the oob migration by a party 
>> that is not the orchestrator, (read vCenter) then the rules will be lost.
>> 
>> I fully agree, a reboot due a failing live migration, would cause a
>> reboot. So what? Then we blame VMware, the orchestration will reboot the
>> VR and everything is fine. This would cause seconds of outage.
>> 
>> But then the missing persistence of the iptables would be the problem,
>> not the live migration task, right?
>> 
>> We should fix the persistence of the rules during reboot and not try to
>> be more clever then the hypervisor cluster orchestration.
>> 
>> Just my 2 cents.
>> 
>> 
>> 
>> 
>> 
>

Re: [DISCUSS] Out of Band VR migration, should we reboot VR or not?

Reply via email to