Hi Sebastian,

I do not know what is happening, I am also having problems of jobs blocking
while waiting to setup the zookeeper instance.
We should look into this.

Best,
Claudio


On Mon, Jan 21, 2013 at 1:59 PM, Sebastian Schelter <s...@apache.org> wrote:

> Hi,
>
> I'm testing a custom PageRank implementation using trunk on Hadoop
> 1.0.4. I seem to run into a deadlock after the input superstep.
>
> The workers report "finishSuperstep: (all workers done) WORKER_ONLY -
> Attempt=0, Superstep=0" and the master reports that all workers are done
> with superstep -1.
>
> I reconstructed this using a local setup and seems to me that the
> BspServiceMaster hangs in the cleanUpZooKeeper method infinitely.
>
> I'm not using a dedicated zk instance, I just have Giraph start one. Any
> ideas what can be done to fix my problem?
>
> Best,
> Sebastian
>
>
> excerpt from jstack
>
> "org.apache.giraph.master.MasterThread" prio=10 tid=0x00007f29fc385000
> nid=0x29d1 waiting on condition [0x00007f2a09a5f000]
>    java.lang.Thread.State: TIMED_WAITING (parking)
>         at sun.misc.Unsafe.park(Native Method)
>         - parking to wait for  <0x00000000f38967d8> (a
> java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
>         at
> java.util.concurrent.locks.LockSupport.parkNanos(LockSupport.java:198)
>         at
>
> java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.await(AbstractQueuedSynchronizer.java:2116)
>         at
> org.apache.giraph.zk.PredicateLock.waitMsecs(PredicateLock.java:112)
>         at
> org.apache.giraph.zk.PredicateLock.waitForever(PredicateLock.java:138)
>         at
>
> org.apache.giraph.master.BspServiceMaster.cleanUpZooKeeper(BspServiceMaster.java:1602)
>         at
>
> org.apache.giraph.master.BspServiceMaster.cleanup(BspServiceMaster.java:1692)
>         at org.apache.giraph.master.MasterThread.run(MasterThread.java:144)
>
>
>


-- 
   Claudio Martella
   claudio.marte...@gmail.com

Reply via email to