from:"pauloangelo"

[GitHub] spark pull request: [SPARK-2251] fix concurrency issues in random ...

2015-10-11 Thread pauloangelo

Github user pauloangelo commented on the pull request: https://github.com/apache/spark/pull/1229#issuecomment-147266494 My RDD comes from a HBase table, which is growing. When I suspend the row insertion, the problem doesn't happen. The RDD is cached, should the pr

[GitHub] spark pull request: [SPARK-2251] fix concurrency issues in random ...

2015-10-11 Thread pauloangelo

Github user pauloangelo commented on the pull request: https://github.com/apache/spark/pull/1229#issuecomment-147232717 Hi all, I'm getting similar problem using kmeans clustering with Spark-1.5.1. The stacktrace is below. To overcome the problem, I'm sett