Hi I’m trying to run Analytics cc (ConnectedComponents) but it is running without ending. Logs are fine, but I just keep getting Job xyz finished, reduce took some time:
... INFO DAGScheduler: Job 29 finished: reduce at VertexRDDImpl.scala:90, took 14.828033 s INFO DAGScheduler: Job 30 finished: reduce at VertexRDDImpl.scala:90, took 15.341294 s .. .. INFO TaskSetManager: Finished task 299.0 in stage 53059.0 (TID 88025) in 81 ms on 172.16.99.22 (195/480) INFO TaskSetManager: Starting task 47.0 in stage 53059.0 (TID 88075, 172.16.99.31, PROCESS_LOCAL, 5367 bytes) .. I am using Spark 1.5 standalone and input graph http://snap.stanford.edu/data/web-BerkStan.html <http://snap.stanford.edu/data/web-BerkStan.html> It seems there is no convergence, can you help me understand what is wrong in your example? Thanks Best, Ovidiu