Keith,

r1470734 seems to have resolved things for me. Thanks for the fix.

On 04/22/2013 12:36 PM, Keith Turner wrote:
On Fri, Apr 19, 2013 at 9:47 PM, Josh Elser <[email protected]> wrote:

Thought about it more, and remembered about the JUnit temp dir. Found that
the two TServers both lost their ZK lock.

Perhaps the configuration is just a little too constrained?

Possibly.  I noticed the test did not have a timeout set, I added that.



On 04/19/2013 09:27 PM, Josh Elser wrote:

Is anyone else seeing this? I didn't have this happening early this week
(Wednesday, maybe?).

It gets stuck trying to get the Connector:

     at org.apache.accumulo.core.util.**UtilWaitThread.sleep(**
UtilWaitThread.java:26)
     at org.apache.accumulo.core.**client.impl.ServerClient.**
executeRaw(ServerClient.java:**112)
     at org.apache.accumulo.core.**client.impl.ServerClient.**
execute(ServerClient.java:71)
     at org.apache.accumulo.core.**client.impl.ConnectorImpl.<**
init>(ConnectorImpl.java:64)
     at org.apache.accumulo.core.**client.ZooKeeperInstance.**
getConnector(**ZooKeeperInstance.java:227)
     at org.apache.accumulo.core.**client.ZooKeeperInstance.**
getConnector(**ZooKeeperInstance.java:222)
     at org.apache.accumulo.test.**TestAccumuloSplitRecovery.**test(**
TestAccumuloSplitRecovery.**java:87)

The ZKMain was running, as was the Master; I'm not quite sure how to
debug it. I grabbed stacktraces from each process when it happened, and it
appears that I can reliably reproduce it (about 3/3 so far).



Reply via email to