spark git commit: [SPARK-10411] [SQL] Move visualization above explain output and hide explain by default

2015-09-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 62b4690d6 -> 0349b5b43 [SPARK-10411] [SQL] Move visualization above explain output and hide explain by default New screenshots after this fix: https://cloud.githubusercontent.com/assets/1000778/9625782/4b2dba36-518b-11e5-9104-c713ff026e3d

spark git commit: [SPARK-10411] [SQL] Move visualization above explain output and hide explain by default

2015-09-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.5 b846a9dc3 -> 94404ee53 [SPARK-10411] [SQL] Move visualization above explain output and hide explain by default New screenshots after this fix: https://cloud.githubusercontent.com/assets/1000778/9625782/4b2dba36-518b-11e5-9104-c713ff02

spark git commit: [SPARK-10379] preserve first page in UnsafeShuffleExternalSorter

2015-09-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 3ddb9b323 -> 62b4690d6 [SPARK-10379] preserve first page in UnsafeShuffleExternalSorter Author: Davies Liu Closes #8543 from davies/preserve_page. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apac

spark git commit: [SPARK-10379] preserve first page in UnsafeShuffleExternalSorter

2015-09-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.5 2fce5d880 -> b846a9dc3 [SPARK-10379] preserve first page in UnsafeShuffleExternalSorter Author: Davies Liu Closes #8543 from davies/preserve_page. (cherry picked from commit 62b4690d6b3016f41292b640ac28644ef31e299d) Signed-off-by: An

spark git commit: [SPARK-10247] [CORE] improve readability of a test case in DAGSchedulerSuite

2015-09-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master f6c447f87 -> 3ddb9b323 [SPARK-10247] [CORE] improve readability of a test case in DAGSchedulerSuite This is pretty minor, just trying to improve the readability of `DAGSchedulerSuite`, I figure every bit helps. Before whenever I read this

spark git commit: Removed code duplication in ShuffleBlockFetcherIterator

2015-09-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 0985d2c30 -> f6c447f87 Removed code duplication in ShuffleBlockFetcherIterator Added fetchUpToMaxBytes() to prevent having to update both code blocks when a change is made. Author: Evan Racah Closes #8514 from eracah/master. Project:

spark git commit: [SPARK-8707] RDD#toDebugString fails if any cached RDD has invalid partitions

2015-09-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 4bd85d06e -> 0985d2c30 [SPARK-8707] RDD#toDebugString fails if any cached RDD has invalid partitions Added numPartitions(evaluate: Boolean) to RDD. With "evaluate=true" the method is same with "partitions.length". With "evaluate=false", it

spark git commit: [SPARK-5945] Spark should not retry a stage infinitely on a FetchFailedException

2015-09-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 44948a2e9 -> 4bd85d06e [SPARK-5945] Spark should not retry a stage infinitely on a FetchFailedException The ```Stage``` class now tracks whether there were a sufficient number of consecutive failures of that stage to trigger an abort. To

spark git commit: [SPARK-9723] [ML] params getordefault should throw more useful error

2015-09-02 Thread meng
Repository: spark Updated Branches: refs/heads/master 03f3e91ff -> 44948a2e9 [SPARK-9723] [ML] params getordefault should throw more useful error Params.getOrDefault should throw a more meaningful exception than what you get from a bad key lookup. Author: Holden Karau Closes #8567 from ho

spark git commit: [SPARK-10422] [SQL] String column in InMemoryColumnarCache needs to override clone method

2015-09-02 Thread davies
Repository: spark Updated Branches: refs/heads/branch-1.5 30efa96af -> 2fce5d880 [SPARK-10422] [SQL] String column in InMemoryColumnarCache needs to override clone method https://issues.apache.org/jira/browse/SPARK-10422 Author: Yin Huai Closes #8578 from yhuai/SPARK-10422. (cherry picked

spark git commit: [SPARK-10422] [SQL] String column in InMemoryColumnarCache needs to override clone method

2015-09-02 Thread davies
Repository: spark Updated Branches: refs/heads/master 6cd98c187 -> 03f3e91ff [SPARK-10422] [SQL] String column in InMemoryColumnarCache needs to override clone method https://issues.apache.org/jira/browse/SPARK-10422 Author: Yin Huai Closes #8578 from yhuai/SPARK-10422. Project: http://g

spark git commit: [SPARK-10417] [SQL] Iterating through Column results in infinite loop

2015-09-02 Thread davies
Repository: spark Updated Branches: refs/heads/master 2da3a9e98 -> 6cd98c187 [SPARK-10417] [SQL] Iterating through Column results in infinite loop `pyspark.sql.column.Column` object has `__getitem__` method, which makes it iterable for Python. In fact it has `__getitem__` to address the case

spark git commit: [SPARK-10353] [MLLIB] (1.3 backport) BLAS gemm not scaling when beta = 0.0 for some subset of matrix multiplications

2015-09-02 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.3 a58c1afe8 -> 29836e284 [SPARK-10353] [MLLIB] (1.3 backport) BLAS gemm not scaling when beta = 0.0 for some subset of matrix multiplications Apply fixes for alpha, beta parameter handling in gemm/gemv from #8525 to branch 1.3 CC mengx

spark git commit: [SPARK-10004] [SHUFFLE] Perform auth checks when clients read shuffle data.

2015-09-02 Thread vanzin
Repository: spark Updated Branches: refs/heads/master fc4830779 -> 2da3a9e98 [SPARK-10004] [SHUFFLE] Perform auth checks when clients read shuffle data. To correctly isolate applications, when requests to read shuffle data arrive at the shuffle service, proper authorization checks need to be p

spark git commit: [SPARK-10389] [SQL] support order by non-attribute grouping expression on Aggregate

2015-09-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 56c4c172e -> fc4830779 [SPARK-10389] [SQL] support order by non-attribute grouping expression on Aggregate For example, we can write `SELECT MAX(value) FROM src GROUP BY key + 1 ORDER BY key + 1` in PostgreSQL, and we should support this

spark git commit: [SPARK-10034] [SQL] add regression test for Sort on Aggregate

2015-09-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master c3b881a7d -> 56c4c172e [SPARK-10034] [SQL] add regression test for Sort on Aggregate Before #8371, there was a bug for `Sort` on `Aggregate` that we can't use aggregate expressions named `_aggOrdering` and can't use more than one ordering

spark git commit: [SPARK-7336] [HISTORYSERVER] Fix bug that applications status incorrect on JobHistory UI.

2015-09-02 Thread vanzin
Repository: spark Updated Branches: refs/heads/master 00d9af5e1 -> c3b881a7d [SPARK-7336] [HISTORYSERVER] Fix bug that applications status incorrect on JobHistory UI. Author: ArcherShao Closes #5886 from ArcherShao/SPARK-7336. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Com