Hi All, I am trying to run RowMatrix.similarity(0.5) on 60K users (n) with 130k features (m) on spark 1.3.0. Using 4 m3.2xlarge 30GB RAM and 8 cores but getting lots of ERROR YarnScheduler: Lost executor 1 on XXX.internal: remote Akka client disassociate
What could be the reason? Is it shuffle memory that I should increase? Thank You Parin Choganwala