I tested this patch in our environment and it worked. Scalr ignored the
openstack error state.
Thanks for the fix, this was a big help.
On Wednesday, February 8, 2017 at 1:14:14 PM UTC-6, Enrico Kern wrote:
>
> ok nvm. i patched CloudPoller.php myself todo that. If anyone is
> interessted. just add a continue; from line 347 on in
> /opt/scalr-server/embedded/scalr/app/src/Scalr/System/Zmq/Cron/Task/CloudPoller.php
>
> example:
>
> // If openstack server is in ERROR state we need
> more details
> if ($openstackErrorState) {
> try {
> $info =
> $p->GetServerExtendedInformation($DBServer);
> $status = empty($info['Status']) ? false :
> $info['Status'];
> } catch (Exception $e) {}
> // added by Enrico. ignore error (i hope)
> $this->getLogger()->error("openstack error and
> i want to terminate it... oh wait i better let it alone");
> continue;
> }
>
>
> On Tuesday, February 7, 2017 at 8:16:46 PM UTC+1, Enrico Kern wrote:
>>
>> Hello all,
>>
>> was there something implemented already? we have the same issues with the
>> latest open source scalr. Sometimes on hypervisor reboot openstack reports
>> error state or cant boot up vms for whatever reasons and then scalr packs
>> the hammer and terminates the instance. Would love to only allow manual
>> termination in whatever state.
>>
>> On Monday, December 5, 2016 at 11:12:24 PM UTC+1, Marc O'Brien wrote:
>>>
>>> Hi Brandon,
>>>
>>> Thank you for the details. This helps us to better understand our
>>> customer's needs and prioritize future development efforts.
>>>
>>> By the way, welcome to the Scalr Open Source community! We're excited to
>>> have you join us. Please feel free to post any issues or questions you
>>> have moving forward and we will do our best to lend a hand.
>>>
>>> Cheers,
>>> Wm. Marc O'Brien
>>> Scalr Technical Support
>>>
>>> On Monday, December 5, 2016 at 12:34:25 PM UTC-7, Brandon Newport wrote:
>>>>
>>>> We had an incident a few weeks ago with a Farm that was deployed. It
>>>> lost its DHCP address for some reason and while we were troubleshooting,
>>>> we
>>>> attempted a live migration from one host to another in Openstack, when
>>>> this
>>>> happened it forced the instance into Error state, which terminated the
>>>> instance.
>>>>
>>>> Since then we have been testing Scalr actions by forcing instances into
>>>> error state in OpenStack.
>>>>
>>>> However when we were testing today we forced a Hypervisor to shut
>>>> down. Which did nothing to the Instance, until the Hypervisor was
>>>> powered
>>>> on, changed the Instance to shut off state. This forced the Replace.
>>>>
>>>> I hope this helped answer your question.
>>>>
>>>> Brandon
>>>>
>>>> On Monday, December 5, 2016 at 1:21:11 PM UTC-6, Marc O'Brien wrote:
>>>>>
>>>>> Hi Brandon,
>>>>>
>>>>> To clarify, the ignore on missing setting will not impact servers in
>>>>> Error state so this does not directly apply to this case as you stated
>>>>> these instances are in Error state. Could you provide some additional
>>>>> details on why you would not want to terminate servers in Error state?
>>>>> Do
>>>>> you have a specific use case for this?
>>>>>
>>>>> Many thanks,
>>>>> Wm. Marc O'Brien
>>>>> Scalr Technical Support
>>>>>
>>>>> On Monday, December 5, 2016 at 10:55:38 AM UTC-7, Marc O'Brien wrote:
>>>>>>
>>>>>> Hi Brandon,
>>>>>>
>>>>>> This is expected behavior as Scalr implements a Replace vs a Repair
>>>>>> methodology to resource management. You may wish to merge
>>>>>> "scalr.openstack.action_on_missing_server = ignore" in to your
>>>>>> scalr-server.rb
>>>>>> <https://scalr-wiki.atlassian.net/wiki/display/docs/Advanced+Configuration>
>>>>>>
>>>>>> to prevent Scalr from terminating instances that are marked as "Missing"
>>>>>> but there is not currently a method to completely prevent termination of
>>>>>> resources that enter an "Error" state. This is intentional, but we do
>>>>>> understand your perspective/situation and have considered this
>>>>>> previously.
>>>>>> We are exploring possible changes to this behavior in future versions of
>>>>>> Enterprise Scalr, but this is currently a fundamental behavior of our
>>>>>> Desired State Engine and is not something that can be disabled.
>>>>>>
>>>>>> Many thanks,
>>>>>> Wm. Marc O'Brien
>>>>>> Scalr Technical Support
>>>>>>
>>>>>>
>>>>>> On Monday, December 5, 2016 at 8:19:38 AM UTC-7, Brandon Newport
>>>>>> wrote:
>>>>>>>
>>>>>>> We are using this version of Scalr, Version 5.11.22.
>>>>>>>
>>>>>>> On Monday, December 5, 2016 at 9:18:37 AM UTC-6, Brandon Newport
>>>>>>> wrote:
>>>>>>>>
>>>>>>>> Is there a way to disable the automatic termination of an Instance
>>>>>>>> with Scaling function in the Farm Role settings.
>>>>>>>>
>>>>>>>> We currently have a Community version of Scalr deployed, and
>>>>>>>> connected to OpenStack. By default when an instance goes into ERROR
>>>>>>>> state
>>>>>>>> in openstack. Scalr will terminate the existing instance and deploy a
>>>>>>>> new
>>>>>>>> one. Is there a way to disable this or ignore the instance in error
>>>>>>>> state?
>>>>>>>> We have tested the manual Scaling, but it will still terminate the
>>>>>>>> instance and wait for you to redeploy the new instance.
>>>>>>>>
>>>>>>>> We are gradually working our way to using scaling the way it is
>>>>>>>> designed but we are just not there yet.
>>>>>>>>
>>>>>>>> Any help here would be appreciated.
>>>>>>>>
>>>>>>>> Brandon
>>>>>>>>
>>>>>>>
--
You received this message because you are subscribed to the Google Groups
"scalr-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
For more options, visit https://groups.google.com/d/optout.