Hi all,

We have a Kerberized Accumulo 1.7.0 (HDP3) cluster with 25 tservers.
Recently, a couple of clients were reporting errrors writing data (fat
fingered from cluster, apologies for typos):

org.apache.accumulo.core.client.impl.TabletServerBatchWriter.checkForFailures
...
Caused by: 
org.apache.accumulo.core.client.impl.TabletServerBatchWriter$MutationWriter.sendMutationsToTabletServer

Digging into the logs on the problematic tservers, I think the
following was firing, but don't know why:

https://github.com/apache/thrift/blob/0.9.1/lib/java/src/org/apache/thrift/transport/TIOStreamTransport.java#L132

Also, the tserver logs report:

Internal error processing closeUpdate....TException: Commits are held

For now, I have stopped the two problematic tservers but any help
debugging would be much appreciated.

James

Reply via email to