spark git commit: [SPARK-19734][PYTHON][ML] Correct OneHotEncoder doc string to say dropLast

2017-03-01 Thread yliang
Repository: spark Updated Branches: refs/heads/master 3bd8ddf7c -> d2a879762 [SPARK-19734][PYTHON][ML] Correct OneHotEncoder doc string to say dropLast ## What changes were proposed in this pull request? Updates the doc string to match up with the code i.e. say dropLast instead of

spark git commit: [MINOR][ML] Fix comments in LSH Examples and Python API

2017-03-01 Thread yliang
Repository: spark Updated Branches: refs/heads/master de2b53df4 -> 3bd8ddf7c [MINOR][ML] Fix comments in LSH Examples and Python API ## What changes were proposed in this pull request? Remove `org.apache.spark.examples.` in Add slash in one of the python doc. ## How was this patch tested?

spark git commit: [SPARK-19583][SQL] CTAS for data source table with a created location should succeed

2017-03-01 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 89990a010 -> de2b53df4 [SPARK-19583][SQL] CTAS for data source table with a created location should succeed ## What changes were proposed in this pull request? ``` spark.sql( s""" |CREATE TABLE t

spark git commit: [SPARK-13931] Stage can hang if an executor fails while speculated tasks are running

2017-03-01 Thread kayousterhout
Repository: spark Updated Branches: refs/heads/master 51be63365 -> 89990a010 [SPARK-13931] Stage can hang if an executor fails while speculated tasks are running ## What changes were proposed in this pull request? When function 'executorLost' is invoked in class 'TaskSetManager', it's

spark git commit: [SPARK-19777] Scan runningTasksSet when check speculatable tasks in TaskSetManager.

2017-03-01 Thread kayousterhout
Repository: spark Updated Branches: refs/heads/master db0ddce52 -> 51be63365 [SPARK-19777] Scan runningTasksSet when check speculatable tasks in TaskSetManager. ## What changes were proposed in this pull request? When check speculatable tasks in `TaskSetManager`, only scan `runningTasksSet`

spark git commit: [SPARK-19775][SQL] Remove an obsolete `partitionBy().insertInto()` test case

2017-03-01 Thread srowen
Repository: spark Updated Branches: refs/heads/master 2ff1467d6 -> db0ddce52 [SPARK-19775][SQL] Remove an obsolete `partitionBy().insertInto()` test case ## What changes were proposed in this pull request? This issue removes [a test

spark git commit: [SPARK-19373][MESOS] Base spark.scheduler.minRegisteredResourceRatio …

2017-03-01 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.1 bbe0d8caa -> 27347b5f2 [SPARK-19373][MESOS] Base spark.scheduler.minRegisteredResourceRatio … …on registered cores rather than accepted cores See JIRA Unit tests, Mesos/Spark integration tests cc skonto susanxhuynh Author:

spark git commit: [DOC][MINOR][SPARKR] Update SparkR doc for names, columns and colnames

2017-03-01 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 417140e44 -> 2ff1467d6 [DOC][MINOR][SPARKR] Update SparkR doc for names, columns and colnames Update R doc: 1. columns, names and colnames returns a vector of strings, not **list** as in current doc. 2. `colnames<-` does allow the subset

spark git commit: [SPARK-19787][ML] Changing the default parameter of regParam.

2017-03-01 Thread mlnick
Repository: spark Updated Branches: refs/heads/master 8aa560b75 -> 417140e44 [SPARK-19787][ML] Changing the default parameter of regParam. ## What changes were proposed in this pull request? In the ALS method the default values of regParam do not match within the same file (lines

spark git commit: [SPARK-19761][SQL] create InMemoryFileIndex with an empty rootPaths when set PARALLEL_PARTITION_DISCOVERY_THRESHOLD to zero failed

2017-03-01 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 5502a9cf8 -> 8aa560b75 [SPARK-19761][SQL] create InMemoryFileIndex with an empty rootPaths when set PARALLEL_PARTITION_DISCOVERY_THRESHOLD to zero failed ## What changes were proposed in this pull request? If we create a

spark git commit: [SPARK-19766][SQL] Constant alias columns in INNER JOIN should not be folded by FoldablePropagation rule

2017-03-01 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.1 f719cccdc -> bbe0d8caa [SPARK-19766][SQL] Constant alias columns in INNER JOIN should not be folded by FoldablePropagation rule ## What changes were proposed in this pull request? This PR fixes the code in Optimizer phase where the

spark git commit: [SPARK-19766][SQL] Constant alias columns in INNER JOIN should not be folded by FoldablePropagation rule

2017-03-01 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 38e783534 -> 5502a9cf8 [SPARK-19766][SQL] Constant alias columns in INNER JOIN should not be folded by FoldablePropagation rule ## What changes were proposed in this pull request? This PR fixes the code in Optimizer phase where the

spark git commit: [SPARK-19736][SQL] refreshByPath should clear all cached plans with the specified path

2017-03-01 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 4913c92c2 -> 38e783534 [SPARK-19736][SQL] refreshByPath should clear all cached plans with the specified path ## What changes were proposed in this pull request? `Catalog.refreshByPath` can refresh the cache entry and the associated