Are you intending to use server groups to divide the cluster into different 
logical groupings?  Unless the groups host entirely separate data sets I could 
see an asymmetric response pattern depending if the primary was hosted on a 
member with a short / long timeout.

Anthony

> On Sep 6, 2017, at 7:27 AM, Aravind Musigumpula 
> <aravind.musigump...@amdocs.com> wrote:
> 
> Hi Udo,
> 
> For your question: "If you feel that the member timeout is too short for some 
> members, why don't you increase the current member timeout?"
> 
> Yes, for some members I feel that member-timeout is short. I want to increase 
> the timeout for some members. But that timeout is not being used to monitor 
> themselves but instead the increased member timeout may be used to monitor 
> some other member.
> 
> If I want some member to be alive for a little more time even if it is not 
> responding, Now I need to increase the timeout of all the members.
> 
> Do you mean to increase the current member timeout for all the members.
> 
> 
> Thanks,
> Aravind Musigumpula 
> 
> 
> -----Original Message-----
> From: Udo Kohlmeyer [mailto:ukohlme...@pivotal.io] 
> Sent: Friday, September 01, 2017 10:05 PM
> To: dev@geode.apache.org
> Subject: Re: DISCUSS : Monitor the neighbour JVM using neihbour's 
> member-timeout (GEODE-3411)
> 
> Hi there Aravind,
> 
> I have a singular problem with this approach.
> 
> If a some members are designated to do more work, and don't have time to 
> respond to the cluster that they are alive using the current member timeout, 
> then they are not available to accept data. Which means they are not 
> effective members of the cluster and we cannot count on them to host data or 
> replicates.
> 
> This setting is there to safe guard the cluster against non-responsive 
> members that cause the whole cluster to be unhealthy if left unchecked for 
> too long. This can lead to potential data loss....
> 
> If you feel that the member timeout is too short for some members, why don't 
> you increase the current member timeout?
> 
> My opinion is a -1 for changing the current behavior.
> 
> --Udo
> 
> On 9/1/17 03:46, Aravind Musigumpula wrote:
>> Hi Brian,
>> 
>> This will help if the user has some member doing a heavy duty when compared 
>> to others, in this case we need to give such member some extra time to that 
>> member.
>> 
>> Thanks,
>> Aravind Musigumpula
>> 
>> 
>> -----Original Message-----
>> From: Brian Baynes [mailto:bbay...@pivotal.io]
>> Sent: Friday, September 01, 2017 4:39 AM
>> To: dev@geode.apache.org
>> Subject: Re: DISCUSS : Monitor the neighbour JVM using neihbour's 
>> member-timeout (GEODE-3411)
>> 
>> Hi, Aravind.
>> 
>> Can you help me understand why this might be a useful feature for Geode?  I 
>> see that your needs require it, but why would users in general want to allow 
>> longer timeouts for some members?  This is a significant change with 
>> backward-compatibility implications, so would be good for the community to 
>> understand the potential benefit.
>> 
>> Thanks!
>> Brian
>> 
>> On Mon, Aug 28, 2017 at 12:20 AM, Aravind Musigumpula < 
>> aravind.musigump...@amdocs.com> wrote:
>> 
>>> Hi Team,
>>> 
>>> We have a requirement to configure  different member timeout for 
>>> different members as we need some members to survive in the view for 
>>> longer time than the other the members before being kicked out of the 
>>> view in case they aren't responding.
>>> 
>>> 
>>> 1.       Now with the current monitoring system it is not possible to
>>> determine when the member will be kicked out of the view if we 
>>> configure different member-timeout's for some required members.
>>> 
>>> 2.       Because if a member is not responding to any heartbeat requests,
>>> the member who is monitoring the non-responding member will initiate 
>>> check member request.
>>> 
>>> 3.       In this check member request monitoring member pings the
>>> non-responding member and waits for member-timeout of monitoring 
>>> member for a response.
>>> 
>>> 4.       If still there is no response, it will initiate a final suspect
>>> request to coordinator where the coordinator does the final check 
>>> waiting for coordinators member-timeout.
>>> 
>>> 5.       If coordinator did not get any response, it will remove the
>>> non-responding member from the view and publishes it.
>>> 
>>> 6.       So, Here the time period for removing a member depends on its
>>> monitoring member's and coordinator's timeout. But the monitoring 
>>> member depends on the view but it may change from time to time.
>>> 
>>> So, now when a monitoring-member doing the check on a member, if we 
>>> wait for the non-responding member's timeout instead of the 
>>> monitoring member-timeout, then the time when the non-responding 
>>> member will be removed from the view depends on its own 
>>> member-timeout and the coordinators member-timeout.
>>> Hence we can configure different member-timeout for the required members.
>>> 
>>> I created a pull request based on the above scenario:
>>> https://github.com/apache/geode/pull/717
>>> 
>>> Is the above approach correct? Do we have any concerns around this area?
>>> Please give your insights on this issue.
>>> 
>>> Thanks,
>>> Aravind Musigumpula
>>> 
>>> This message and the information contained herein is proprietary and 
>>> confidential and subject to the Amdocs policy statement,
>>> 
>>> you may review at https://www.amdocs.com/about/email-disclaimer < 
>>> https://www.amdocs.com/about/email-disclaimer>
>>> 
>> This message and the information contained herein is proprietary and 
>> confidential and subject to the Amdocs policy statement,
>> 
>> you may review at https://www.amdocs.com/about/email-disclaimer 
>> <https://www.amdocs.com/about/email-disclaimer>
> 
> This message and the information contained herein is proprietary and 
> confidential and subject to the Amdocs policy statement,
> 
> you may review at https://www.amdocs.com/about/email-disclaimer 
> <https://www.amdocs.com/about/email-disclaimer>

Reply via email to