Hi,

I am using GRAPHX in standalone SPARK 1.5.1 in a medium size cluster (64+1).

I could execute PageRank with large number of iterations on this cluster.
However, when I run SSSP, it always fail at iteration 23 or 24. This is
always at after about 11 mins. Note that PageRank takes more than that.

*These are the messages in the log file:*

15/11/06 04:52:41 INFO Master: 172.31.27.133:59109 got disassociated,
removing it.
15/11/06 04:52:41 INFO Master: Removing app app-20151106043327-0000
15/11/06 04:52:41 WARN ReliableDeliverySupervisor: Association with remote
system [akka.tcp://sparkDriver@172.31.27.133:59109] has failed, address is
now gated for [5000] ms. Reason is: [Disassociated].


I tried to increase akka and spark network timeout to 100 and 600, with no
help.

I appreciate any comment or suggestion !

-- 
Thanks,
-Khaled

Reply via email to