Keith,
r1470734 seems to have resolved things for me. Thanks for the fix.
On 04/22/2013 12:36 PM, Keith Turner wrote:
On Fri, Apr 19, 2013 at 9:47 PM, Josh Elser <[email protected]> wrote:
Thought about it more, and remembered about the JUnit temp dir. Found that
the two TServers both lost their ZK lock.
Perhaps the configuration is just a little too constrained?
Possibly. I noticed the test did not have a timeout set, I added that.
On 04/19/2013 09:27 PM, Josh Elser wrote:
Is anyone else seeing this? I didn't have this happening early this week
(Wednesday, maybe?).
It gets stuck trying to get the Connector:
at org.apache.accumulo.core.util.**UtilWaitThread.sleep(**
UtilWaitThread.java:26)
at org.apache.accumulo.core.**client.impl.ServerClient.**
executeRaw(ServerClient.java:**112)
at org.apache.accumulo.core.**client.impl.ServerClient.**
execute(ServerClient.java:71)
at org.apache.accumulo.core.**client.impl.ConnectorImpl.<**
init>(ConnectorImpl.java:64)
at org.apache.accumulo.core.**client.ZooKeeperInstance.**
getConnector(**ZooKeeperInstance.java:227)
at org.apache.accumulo.core.**client.ZooKeeperInstance.**
getConnector(**ZooKeeperInstance.java:222)
at org.apache.accumulo.test.**TestAccumuloSplitRecovery.**test(**
TestAccumuloSplitRecovery.**java:87)
The ZKMain was running, as was the Master; I'm not quite sure how to
debug it. I grabbed stacktraces from each process when it happened, and it
appears that I can reliably reproduce it (about 3/3 so far).