[ 
https://issues.apache.org/jira/browse/KUDU-2435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Grant Henke resolved KUDU-2435.
-------------------------------
    Fix Version/s: NA
       Resolution: Cannot Reproduce

> Consider non-fatal response to "Tried to update clock beyond the max. error"
> ----------------------------------------------------------------------------
>
>                 Key: KUDU-2435
>                 URL: https://issues.apache.org/jira/browse/KUDU-2435
>             Project: Kudu
>          Issue Type: Improvement
>          Components: server
>    Affects Versions: 1.7.0
>            Reporter: Mike Percy
>            Priority: Major
>             Fix For: NA
>
>
> Currently when one server is skewed, and it tries to replicate to other 
> servers in a cluster, it can cause the rest of the servers in the cluster to 
> crash with the following message:
> {code:java}
> F0428 05:27:23.480379 104613 raft_consensus.cc:1264] Check failed: _s.ok() 
> Bad status: Invalid argument: Tried to update clock beyond the max. 
> error.{code}
> We should consider alternative ways of handling this issue. Maybe the 
> replicas can reject requests that would cause this condition until NTP has a 
> chance to correct the clock of the offending server. We should also consider 
> whether clock skew should be taken into account when doing leader 
> elections... if a server is not within the max clock error of the voter then 
> maybe the vote should be withheld.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to