Re: Strange lookup behavior. Possible bug?

Yadid Ayzenberg Sun, 27 Apr 2014 17:36:06 -0700

Can someone please suggest how I can move forward with this?
My spark version is 0.9.1.

The big challenge is that this issue is not recreated when running inlocal mode. What could be the difference?


I would really appreciate any pointers, as currently the the job just hangs.



On 4/25/14, 7:37 PM, Yadid Ayzenberg wrote:

Some additional information - maybe this rings a bell with someone:

I suspect this happens when the lookup returns more than one value.
For 0 and 1 values, the function behaves as you would expect.

Anyone ?



On 4/25/14, 1:55 PM, Yadid Ayzenberg wrote:
Hi All,

Im running a lookup on a JavaPairRDD<String, Tuple2>.
When running on local machine - the lookup is successfull. However,when running a standalone cluster with the exact same dataset - oneof the tasks never ends (constantly in RUNNING status).When viewing the worker log, it seems that the task has finishedsuccessfully:
14/04/25 13:40:38 INFO BlockManager: Found block rdd_2_0 locally
14/04/25 13:40:38 INFO Executor: Serialized size of result for 2 is10896794
14/04/25 13:40:38 INFO Executor: Sending result for 2 directly to driver
14/04/25 13:40:38 INFO Executor: Finished task ID 2

But it seems the driver is not aware of this, and hangs indefinitely.
If I execute a count priot to the lookup - I get the correct numberwhich suggests that the cluster is operating as expected.
The exact same scenario works with a different type of key (Tuple2):JavaPairRDD<Tuple2, Tuple2>.
Any ideas on how to debug this problem ?

Thanks,

Yadid

Re: Strange lookup behavior. Possible bug?

Reply via email to