[ https://issues.apache.org/jira/browse/KUDU-2435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Grant Henke resolved KUDU-2435. ------------------------------- Fix Version/s: NA Resolution: Cannot Reproduce > Consider non-fatal response to "Tried to update clock beyond the max. error" > ---------------------------------------------------------------------------- > > Key: KUDU-2435 > URL: https://issues.apache.org/jira/browse/KUDU-2435 > Project: Kudu > Issue Type: Improvement > Components: server > Affects Versions: 1.7.0 > Reporter: Mike Percy > Priority: Major > Fix For: NA > > > Currently when one server is skewed, and it tries to replicate to other > servers in a cluster, it can cause the rest of the servers in the cluster to > crash with the following message: > {code:java} > F0428 05:27:23.480379 104613 raft_consensus.cc:1264] Check failed: _s.ok() > Bad status: Invalid argument: Tried to update clock beyond the max. > error.{code} > We should consider alternative ways of handling this issue. Maybe the > replicas can reject requests that would cause this condition until NTP has a > chance to correct the clock of the offending server. We should also consider > whether clock skew should be taken into account when doing leader > elections... if a server is not within the max clock error of the voter then > maybe the vote should be withheld. -- This message was sent by Atlassian Jira (v8.3.4#803005)