git commit: [SPARK-1065] [PySpark] improve supporting for large broadcast

2014-08-16 Thread joshrosen
Repository: spark Updated Branches: refs/heads/branch-1.1 721f2fdc9 -> 5dd571c29 [SPARK-1065] [PySpark] improve supporting for large broadcast Passing large object by py4j is very slow (cost much memory), so pass broadcast objects via files (similar to parallelize()). Add an option to keep o

git commit: [SPARK-1065] [PySpark] improve supporting for large broadcast

2014-08-16 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master 379e7585c -> 2fc8aca08 [SPARK-1065] [PySpark] improve supporting for large broadcast Passing large object by py4j is very slow (cost much memory), so pass broadcast objects via files (similar to parallelize()). Add an option to keep objec