thanks for replying. why is joining two vertexrdds without caching slow? what is recomputed unnecessarily? i am not sure what is different here from joining 2 regular RDDs (where nobody seems to recommend to cache before joining i think...)
On Thu, Jul 3, 2014 at 10:52 PM, Ankur Dave <ankurd...@gmail.com> wrote: > Oh, I just read your message more carefully and noticed that you're > joining a regular RDD with a VertexRDD. In that case I'm not sure why the > warning is occurring, but it might be worth caching both operands > (graph.vertices and the regular RDD) just to be sure. > > Ankur <http://www.ankurdave.com/> > >