GitHub user pwendell opened a pull request:
https://github.com/apache/spark/pull/43
SPARK-1145: Memory mapping with many small blocks can cause JVM allocation
failures
This includes some minor code clean-up as well. The main change is that
small files are not memory mapped. There is a nicer way to write that code
block using Scala's `Try` but to make it easy to back port and as simple as
possible, I opted for the more explicit but less pretty format.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/pwendell/spark block-iter-logging
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/43.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #43
----
commit d238b885d6974283f3b96c26e26693074ff557e6
Author: Patrick Wendell <[email protected]>
Date: 2014-02-26T23:44:18Z
Some logging and clean-up
commit 4e1514e666624d2285b190ff1419a9fae2ae929f
Author: Patrick Wendell <[email protected]>
Date: 2014-02-27T02:34:08Z
Don't memory map for small files
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---