GitHub user liancheng opened a pull request: https://github.com/apache/spark/pull/96
[SPARK-1194] Fix the same-RDD rule for cache replacement SPARK-1194: https://spark-project.atlassian.net/browse/SPARK-1194 In the current implementation, when selecting candidate blocks to be swapped out, once we find a block from the same RDD that the block to be stored belongs to, cache eviction fails and aborts. In this PR, we keep selecting blocks *not* from the RDD that the block to be stored belongs to until either enough free space can be ensured (cache eviction succeeds) or all such blocks are checked (cache eviction fails). You can merge this pull request into a Git repository by running: $ git pull https://github.com/liancheng/spark fix-spark-1194 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/96.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #96 ---- commit 62c92ac7b8e616529bdaa52b73eb70e50bc01b47 Author: Cheng Lian <lian.cs....@gmail.com> Date: 2014-03-07T08:32:47Z Fixed SPARK-1194 https://spark-project.atlassian.net/browse/SPARK-1194 ---- --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---