Hi, I am using GRAPHX in standalone SPARK 1.5.1 in a medium size cluster (64+1).
I could execute PageRank with large number of iterations on this cluster. However, when I run SSSP, it always fail at iteration 23 or 24. This is always at after about 11 mins. Note that PageRank takes more than that. *These are the messages in the log file:* 15/11/06 04:52:41 INFO Master: 172.31.27.133:59109 got disassociated, removing it. 15/11/06 04:52:41 INFO Master: Removing app app-20151106043327-0000 15/11/06 04:52:41 WARN ReliableDeliverySupervisor: Association with remote system [akka.tcp://sparkDriver@172.31.27.133:59109] has failed, address is now gated for [5000] ms. Reason is: [Disassociated]. I tried to increase akka and spark network timeout to 100 and 600, with no help. I appreciate any comment or suggestion ! -- Thanks, -Khaled