spark git commit: [SPARK-12692][BUILD] Enforce style checking about white space before comma

2016-01-13 Thread rxin
Repository: spark Updated Branches: refs/heads/master cb7b864a2 -> 3d81d63f4 [SPARK-12692][BUILD] Enforce style checking about white space before comma This is the final PR about SPARK-12692. We have removed all of white spaces before comma from code so let's enforce style checking. Author:

spark git commit: [SPARK-9297] [SQL] Add covar_pop and covar_samp

2016-01-13 Thread davies
Repository: spark Updated Branches: refs/heads/master d6fd9b376 -> 63eee86cc [SPARK-9297] [SQL] Add covar_pop and covar_samp JIRA: https://issues.apache.org/jira/browse/SPARK-9297 Add two aggregation functions: covar_pop and covar_samp. Author: Liang-Chi Hsieh Author:

spark git commit: [SPARK-9383][PROJECT-INFRA] PR merge script should reset back to previous branch when possible

2016-01-13 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master 38148f737 -> 97e0c7c5a [SPARK-9383][PROJECT-INFRA] PR merge script should reset back to previous branch when possible This patch modifies our PR merge script to reset back to a named branch when restoring the original checkout upon exit.

spark git commit: [SPARK-12268][PYSPARK] Make pyspark shell pythonstartup work under python3

2016-01-13 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master 97e0c7c5a -> e4e0b3f7b [SPARK-12268][PYSPARK] Make pyspark shell pythonstartup work under python3 This replaces the `execfile` used for running custom python shell scripts with explicit open, compile and exec (as recommended by 2to3). The

spark git commit: [SPARK-12268][PYSPARK] Make pyspark shell pythonstartup work under python3

2016-01-13 Thread joshrosen
Repository: spark Updated Branches: refs/heads/branch-1.6 364f799cf -> cf6d506c7 [SPARK-12268][PYSPARK] Make pyspark shell pythonstartup work under python3 This replaces the `execfile` used for running custom python shell scripts with explicit open, compile and exec (as recommended by 2to3).

spark git commit: [SPARK-12685][MLLIB][BACKPORT TO 1.4] word2vec trainWordsCount gets overflow

2016-01-13 Thread jkbradley
Repository: spark Updated Branches: refs/heads/branch-1.6 f9ecd3a39 -> 364f799cf [SPARK-12685][MLLIB][BACKPORT TO 1.4] word2vec trainWordsCount gets overflow jira: https://issues.apache.org/jira/browse/SPARK-12685 master PR: https://github.com/apache/spark/pull/10627 the log of word2vec

spark git commit: [SPARK-12761][CORE] Remove duplicated code

2016-01-13 Thread rxin
Repository: spark Updated Branches: refs/heads/master cc91e2187 -> 38148f737 [SPARK-12761][CORE] Remove duplicated code Removes some duplicated code that was reintroduced during a merge. Author: Jakob Odersky Closes #10711 from jodersky/repl-2.11-duplicate. Project:

spark git commit: [SPARK-12805][MESOS] Fixes documentation on Mesos run modes

2016-01-13 Thread rxin
Repository: spark Updated Branches: refs/heads/master 63eee86cc -> cc91e2187 [SPARK-12805][MESOS] Fixes documentation on Mesos run modes The default run has changed, but the documentation didn't fully reflect the change. Author: Luc Bourlier Closes #10740 from

spark git commit: [SPARK-12791][SQL] Simplify CaseWhen by breaking "branches" into "conditions" and "values"

2016-01-13 Thread rxin
Repository: spark Updated Branches: refs/heads/master c2ea79f96 -> cbbcd8e42 [SPARK-12791][SQL] Simplify CaseWhen by breaking "branches" into "conditions" and "values" This pull request rewrites CaseWhen expression to break the single, monolithic "branches" field into a sequence of tuples

spark git commit: [SPARK-12685][MLLIB][BACKPORT TO 1.4] word2vec trainWordsCount gets overflow

2016-01-13 Thread jkbradley
Repository: spark Updated Branches: refs/heads/branch-1.4 0832530e8 -> 7bd256419 [SPARK-12685][MLLIB][BACKPORT TO 1.4] word2vec trainWordsCount gets overflow jira: https://issues.apache.org/jira/browse/SPARK-12685 master PR: https://github.com/apache/spark/pull/10627 the log of word2vec

spark git commit: [SPARK-12685][MLLIB][BACKPORT TO 1.4] word2vec trainWordsCount gets overflow

2016-01-13 Thread jkbradley
Repository: spark Updated Branches: refs/heads/branch-1.5 253e3eb71 -> 7fdd7cf09 [SPARK-12685][MLLIB][BACKPORT TO 1.4] word2vec trainWordsCount gets overflow jira: https://issues.apache.org/jira/browse/SPARK-12685 master PR: https://github.com/apache/spark/pull/10627 the log of word2vec

spark git commit: [SPARK-12642][SQL] improve the hash expression to be decoupled from unsafe row

2016-01-13 Thread rxin
Repository: spark Updated Branches: refs/heads/master e4e0b3f7b -> c2ea79f96 [SPARK-12642][SQL] improve the hash expression to be decoupled from unsafe row https://issues.apache.org/jira/browse/SPARK-12642 Author: Wenchen Fan Closes #10694 from cloud-fan/hash-expr.

[spark] Git Push Summary

2016-01-13 Thread rxin
Repository: spark Updated Tags: refs/tags/v1.6.0-preview [deleted] 31db36100 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

[spark] Git Push Summary

2016-01-13 Thread rxin
Repository: spark Updated Tags: refs/tags/v1.6-snapshot0-test [deleted] 609d6e87a refs/tags/v1.6.0-preview1 [deleted] f8369412d refs/tags/v1.6.0-preview2 [deleted] 308381420 - To unsubscribe, e-mail:

[spark] Git Push Summary

2016-01-13 Thread rxin
Repository: spark Updated Tags: refs/tags/v1.6.0-rc1 [deleted] bf525845c - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

[spark] Git Push Summary

2016-01-13 Thread rxin
Repository: spark Updated Tags: refs/tags/v1.6.0-rc3 [deleted] 168c89e07 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

spark git commit: [SPARK-12756][SQL] use hash expression in Exchange

2016-01-13 Thread rxin
Repository: spark Updated Branches: refs/heads/master e2ae7bd04 -> 962e9bcf9 [SPARK-12756][SQL] use hash expression in Exchange This PR makes bucketing and exchange share one common hash algorithm, so that we can guarantee the data distribution is same between shuffle and bucketed data

spark git commit: [SPARK-12707][SPARK SUBMIT] Remove submit python/R scripts through py…

2016-01-13 Thread rxin
Repository: spark Updated Branches: refs/heads/master 962e9bcf9 -> 8f13cd4cc [SPARK-12707][SPARK SUBMIT] Remove submit python/R scripts through py… …spark/sparkR Author: Jeff Zhang Closes #10658 from zjffdu/SPARK-12707. Project:

[spark] Git Push Summary

2016-01-13 Thread rxin
Repository: spark Updated Tags: refs/tags/v1.6.0-rc2 [deleted] 23f8dfd45 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

spark git commit: [SPARK-12690][CORE] Fix NPE in UnsafeInMemorySorter.free()

2016-01-13 Thread joshrosen
Repository: spark Updated Branches: refs/heads/branch-1.6 cf6d506c7 -> 26f13faa9 [SPARK-12690][CORE] Fix NPE in UnsafeInMemorySorter.free() I hit the exception below. The `UnsafeKVExternalSorter` does pass `null` as the consumer when creating an `UnsafeInMemorySorter`. Normally the NPE

spark git commit: [SPARK-12690][CORE] Fix NPE in UnsafeInMemorySorter.free()

2016-01-13 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master cbbcd8e42 -> eabc7b8ee [SPARK-12690][CORE] Fix NPE in UnsafeInMemorySorter.free() I hit the exception below. The `UnsafeKVExternalSorter` does pass `null` as the consumer when creating an `UnsafeInMemorySorter`. Normally the NPE doesn't

spark git commit: [SPARK-12400][SHUFFLE] Avoid generating temp shuffle files for empty partitions

2016-01-13 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master eabc7b8ee -> cd81fc9e8 [SPARK-12400][SHUFFLE] Avoid generating temp shuffle files for empty partitions This problem lies in `BypassMergeSortShuffleWriter`, empty partition will also generate a temp shuffle file with several bytes. So here

spark git commit: [SPARK-12026][MLLIB] ChiSqTest gets slower and slower over time when number of features is large

2016-01-13 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master cd81fc9e8 -> 021dafc6a [SPARK-12026][MLLIB] ChiSqTest gets slower and slower over time when number of features is large jira: https://issues.apache.org/jira/browse/SPARK-12026 The issue is valid as

spark git commit: [SPARK-12026][MLLIB] ChiSqTest gets slower and slower over time when number of features is large

2016-01-13 Thread jkbradley
Repository: spark Updated Branches: refs/heads/branch-1.6 26f13faa9 -> a490787da [SPARK-12026][MLLIB] ChiSqTest gets slower and slower over time when number of features is large jira: https://issues.apache.org/jira/browse/SPARK-12026 The issue is valid as

spark git commit: [SPARK-12703][MLLIB][DOC][PYTHON] Fixed pyspark.mllib.clustering.KMeans user guide example

2016-01-13 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master 021dafc6a -> 20d8ef858 [SPARK-12703][MLLIB][DOC][PYTHON] Fixed pyspark.mllib.clustering.KMeans user guide example Fixed WSSSE computeCost in Python mllib KMeans user guide example by using new computeCost method API in Python. Author:

spark git commit: [SPARK-12819] Deprecate TaskContext.isRunningLocally()

2016-01-13 Thread rxin
Repository: spark Updated Branches: refs/heads/master 20d8ef858 -> e2ae7bd04 [SPARK-12819] Deprecate TaskContext.isRunningLocally() We've already removed local execution but didn't deprecate `TaskContext.isRunningLocally()`; we should deprecate it for 2.0. Author: Josh Rosen

spark git commit: [SPARK-12692][BUILD][HOT-FIX] Fix the scala style of KinesisBackedBlockRDDSuite.scala.

2016-01-13 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 3d81d63f4 -> d6fd9b376 [SPARK-12692][BUILD][HOT-FIX] Fix the scala style of KinesisBackedBlockRDDSuite.scala. https://github.com/apache/spark/pull/10736 was merged yesterday and caused the master start to fail because of the style issue.