Yu Li created HBASE-14521: ----------------------------- Summary: Uniform the semantic of hbase.client.retries.number Key: HBASE-14521 URL: https://issues.apache.org/jira/browse/HBASE-14521 Project: HBase Issue Type: Bug Affects Versions: 1.1.2, 0.98.14 Reporter: Yu Li Assignee: Yu Li
>From name of the _hbase.client.retries.number_ property, it should be the >number of maximum *retries*, or say if we set the property to 1, there should >be 2 attempts in total. However, there're two different semantics when using >it in current code base. For example, in ConnectionImplementation#locateRegionInMeta: {code} int localNumRetries = (retry ? numTries : 1); for (int tries = 0; true; tries++) { if (tries >= localNumRetries) { throw new NoServerForRegionException("Unable to find region for " + Bytes.toStringBinary(row) + " in " + tableName + " after " + numTries + " tries."); } {code} the retries number is regarded as max times for *tries* While in RpcRetryingCallerImpl#callWithRetries: {code} for (int tries = 0;; tries++) { long expectedSleep; try { callable.prepare(tries != 0); // if called with false, check table status on ZK interceptor.intercept(context.prepare(callable, tries)); return callable.call(getRemainingTime(callTimeout)); } catch (PreemptiveFastFailException e) { throw e; } catch (Throwable t) { ... if (tries >= retries - 1) { throw new RetriesExhaustedException(tries, exceptions); } {code} it's regarded as exactly for *REtry* (try a call first with no condition and then check whether to retry or exceeds maximum retry number) This inconsistency will cause misunderstanding in usage, such as one of our customer set the property to zero expecting one single call but finally received NoServerForRegionException. We should uniform the semantic of the property, and I suggest to keep the original one for retry rather than total tries. -- This message was sent by Atlassian JIRA (v6.3.4#6332)