spark git commit: [SPARK-16104] [SQL] Do not creaate CSV writer object for every flush when writing

2016-06-21 Thread davies
Repository: spark Updated Branches: refs/heads/master d77c4e6e2 -> 7580f3041 [SPARK-16104] [SQL] Do not creaate CSV writer object for every flush when writing ## What changes were proposed in this pull request? This PR let `CsvWriter` object is not created for each time but able to be

spark git commit: [MINOR][MLLIB] deprecate setLabelCol in ChiSqSelectorModel

2016-06-21 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master 79aa1d82c -> d77c4e6e2 [MINOR][MLLIB] deprecate setLabelCol in ChiSqSelectorModel ## What changes were proposed in this pull request? Deprecate `labelCol`, which is not used by ChiSqSelectorModel. Author: Xiangrui Meng

spark git commit: [MINOR][MLLIB] deprecate setLabelCol in ChiSqSelectorModel

2016-06-21 Thread jkbradley
Repository: spark Updated Branches: refs/heads/branch-2.0 77d8226df -> f3a2ebe0b [MINOR][MLLIB] deprecate setLabelCol in ChiSqSelectorModel ## What changes were proposed in this pull request? Deprecate `labelCol`, which is not used by ChiSqSelectorModel. Author: Xiangrui Meng

spark git commit: [SQL][DOC] SQL programming guide add deprecated methods in 2.0.0

2016-06-21 Thread lian
Repository: spark Updated Branches: refs/heads/branch-2.0 5a4fce456 -> 77d8226df [SQL][DOC] SQL programming guide add deprecated methods in 2.0.0 ## What changes were proposed in this pull request? Doc changes ## How was this patch tested? manual liancheng Author: Felix Cheung

spark git commit: [SQL][DOC] SQL programming guide add deprecated methods in 2.0.0

2016-06-21 Thread lian
Repository: spark Updated Branches: refs/heads/master 9493b079a -> 79aa1d82c [SQL][DOC] SQL programming guide add deprecated methods in 2.0.0 ## What changes were proposed in this pull request? Doc changes ## How was this patch tested? manual liancheng Author: Felix Cheung

[2/2] spark git commit: Preparing development version 2.0.1-SNAPSHOT

2016-06-21 Thread pwendell
Preparing development version 2.0.1-SNAPSHOT Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/5a4fce45 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/5a4fce45 Diff:

[1/2] spark git commit: Preparing Spark release v2.0.0-rc1

2016-06-21 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-2.0 fe41f68fd -> 5a4fce456 Preparing Spark release v2.0.0-rc1 Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/0c66ca41 Tree:

[spark] Git Push Summary

2016-06-21 Thread pwendell
Repository: spark Updated Tags: refs/tags/v2.0.0-rc1 [created] 0c66ca41a - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

spark git commit: Update branch-2.0 for 2.0.0 release.

2016-06-21 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 f2413736d -> fe41f68fd Update branch-2.0 for 2.0.0 release. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/fe41f68f Tree:

spark git commit: [SPARK-16118][MLLIB] add getDropLast to OneHotEncoder

2016-06-21 Thread meng
Repository: spark Updated Branches: refs/heads/branch-2.0 8d5a62d5f -> f2413736d [SPARK-16118][MLLIB] add getDropLast to OneHotEncoder ## What changes were proposed in this pull request? We forgot the getter of `dropLast` in `OneHotEncoder` ## How was this patch tested? unit test Author:

spark git commit: [SPARK-16118][MLLIB] add getDropLast to OneHotEncoder

2016-06-21 Thread meng
Repository: spark Updated Branches: refs/heads/master f4e8c31ad -> 9493b079a [SPARK-16118][MLLIB] add getDropLast to OneHotEncoder ## What changes were proposed in this pull request? We forgot the getter of `dropLast` in `OneHotEncoder` ## How was this patch tested? unit test Author:

spark git commit: [SPARK-16117][MLLIB] hide LibSVMFileFormat and move its doc to LibSVMDataSource

2016-06-21 Thread rxin
Repository: spark Updated Branches: refs/heads/master dbfdae4e4 -> f4e8c31ad [SPARK-16117][MLLIB] hide LibSVMFileFormat and move its doc to LibSVMDataSource ## What changes were proposed in this pull request? LibSVMFileFormat implements data source for LIBSVM format. However, users do not

spark git commit: [SPARK-16117][MLLIB] hide LibSVMFileFormat and move its doc to LibSVMDataSource

2016-06-21 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 aeda9a153 -> 8d5a62d5f [SPARK-16117][MLLIB] hide LibSVMFileFormat and move its doc to LibSVMDataSource ## What changes were proposed in this pull request? LibSVMFileFormat implements data source for LIBSVM format. However, users do

spark git commit: [SPARK-15606][CORE] Use non-blocking removeExecutor call to avoid deadlocks

2016-06-21 Thread zsxwing
Repository: spark Updated Branches: refs/heads/branch-1.6 abe36c53d -> d98fb19c1 [SPARK-15606][CORE] Use non-blocking removeExecutor call to avoid deadlocks ## What changes were proposed in this pull request? Set minimum number of dispatcher threads to 3 to avoid deadlocks on machines with

spark git commit: [SPARK-16096][SPARKR] add union and deprecate unionAll

2016-06-21 Thread shivaram
Repository: spark Updated Branches: refs/heads/master 918c91954 -> dbfdae4e4 [SPARK-16096][SPARKR] add union and deprecate unionAll ## What changes were proposed in this pull request? add union and deprecate unionAll, separate roxygen2 doc for rbind (since their usage and parameter lists

spark git commit: [SPARK-16096][SPARKR] add union and deprecate unionAll

2016-06-21 Thread shivaram
Repository: spark Updated Branches: refs/heads/branch-2.0 591bf7909 -> aeda9a153 [SPARK-16096][SPARKR] add union and deprecate unionAll ## What changes were proposed in this pull request? add union and deprecate unionAll, separate roxygen2 doc for rbind (since their usage and parameter

spark git commit: [MINOR][MLLIB] move setCheckpointInterval to non-expert setters

2016-06-21 Thread meng
Repository: spark Updated Branches: refs/heads/master c399c7f0e -> 918c91954 [MINOR][MLLIB] move setCheckpointInterval to non-expert setters ## What changes were proposed in this pull request? The `checkpointInterval` is a non-expert param. This PR moves its setter to non-expert group.

spark git commit: [MINOR][MLLIB] move setCheckpointInterval to non-expert setters

2016-06-21 Thread meng
Repository: spark Updated Branches: refs/heads/branch-2.0 afa14b71b -> 591bf7909 [MINOR][MLLIB] move setCheckpointInterval to non-expert setters ## What changes were proposed in this pull request? The `checkpointInterval` is a non-expert param. This PR moves its setter to non-expert group.

spark git commit: [SPARK-16002][SQL] Sleep when no new data arrives to avoid 100% CPU usage

2016-06-21 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 0d7e1d11d -> afa14b71b [SPARK-16002][SQL] Sleep when no new data arrives to avoid 100% CPU usage ## What changes were proposed in this pull request? Add a configuration to allow people to set a minimum polling delay when no new data

spark git commit: [SPARK-16002][SQL] Sleep when no new data arrives to avoid 100% CPU usage

2016-06-21 Thread yhuai
Repository: spark Updated Branches: refs/heads/master f4a3d45e3 -> c399c7f0e [SPARK-16002][SQL] Sleep when no new data arrives to avoid 100% CPU usage ## What changes were proposed in this pull request? Add a configuration to allow people to set a minimum polling delay when no new data

spark git commit: [SPARK-16037][SQL] Follow-up: add DataFrameWriter.insertInto() test cases for by position resolution

2016-06-21 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 f805b989b -> 0d7e1d11d [SPARK-16037][SQL] Follow-up: add DataFrameWriter.insertInto() test cases for by position resolution ## What changes were proposed in this pull request? This PR migrates some test cases introduced in #12313 as

spark git commit: [SPARK-16037][SQL] Follow-up: add DataFrameWriter.insertInto() test cases for by position resolution

2016-06-21 Thread yhuai
Repository: spark Updated Branches: refs/heads/master b76e35537 -> f4a3d45e3 [SPARK-16037][SQL] Follow-up: add DataFrameWriter.insertInto() test cases for by position resolution ## What changes were proposed in this pull request? This PR migrates some test cases introduced in #12313 as a

spark git commit: [SPARK-15741][PYSPARK][ML] Pyspark cleanup of set default seed to None

2016-06-21 Thread meng
Repository: spark Updated Branches: refs/heads/branch-2.0 867baaada -> f805b989b [SPARK-15741][PYSPARK][ML] Pyspark cleanup of set default seed to None ## What changes were proposed in this pull request? Several places set the seed Param default value to None which will translate to a zero

spark git commit: [SPARK-15741][PYSPARK][ML] Pyspark cleanup of set default seed to None

2016-06-21 Thread meng
Repository: spark Updated Branches: refs/heads/master 57746295e -> b76e35537 [SPARK-15741][PYSPARK][ML] Pyspark cleanup of set default seed to None ## What changes were proposed in this pull request? Several places set the seed Param default value to None which will translate to a zero

spark git commit: [SPARK-16109][SPARKR][DOC] R more doc fixes

2016-06-21 Thread shivaram
Repository: spark Updated Branches: refs/heads/branch-2.0 703a526e7 -> 867baaada [SPARK-16109][SPARKR][DOC] R more doc fixes ## What changes were proposed in this pull request? Found these issues while reviewing for SPARK-16090 ## How was this patch tested? roxygen2 doc gen, checked output

spark git commit: [SPARK-16109][SPARKR][DOC] R more doc fixes

2016-06-21 Thread shivaram
Repository: spark Updated Branches: refs/heads/master 2d6919bea -> 57746295e [SPARK-16109][SPARKR][DOC] R more doc fixes ## What changes were proposed in this pull request? Found these issues while reviewing for SPARK-16090 ## How was this patch tested? roxygen2 doc gen, checked output

spark git commit: [SPARK-16086] [SQL] [PYSPARK] create Row without any fields

2016-06-21 Thread davies
Repository: spark Updated Branches: refs/heads/branch-2.0 052779a0c -> 703a526e7 [SPARK-16086] [SQL] [PYSPARK] create Row without any fields ## What changes were proposed in this pull request? This PR allows us to create a Row without any fields. ## How was this patch tested? Added a test

spark git commit: [SPARK-16086] [SQL] [PYSPARK] create Row without any fields

2016-06-21 Thread davies
Repository: spark Updated Branches: refs/heads/master bcb0258ae -> 2d6919bea [SPARK-16086] [SQL] [PYSPARK] create Row without any fields ## What changes were proposed in this pull request? This PR allows us to create a Row without any fields. ## How was this patch tested? Added a test for

spark git commit: [SPARK-16080][YARN] Set correct link name for conf archive in executors.

2016-06-21 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 93338807a -> bcb0258ae [SPARK-16080][YARN] Set correct link name for conf archive in executors. This makes sure the files are in the executor's classpath as they're expected to be. Also update the unit test to make sure the files are there

spark git commit: [SPARK-16080][YARN] Set correct link name for conf archive in executors.

2016-06-21 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-2.0 943239bf4 -> 052779a0c [SPARK-16080][YARN] Set correct link name for conf archive in executors. This makes sure the files are in the executor's classpath as they're expected to be. Also update the unit test to make sure the files are

spark git commit: [SPARK-13792][SQL] Addendum: Fix Python API

2016-06-21 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 282be71dd -> 943239bf4 [SPARK-13792][SQL] Addendum: Fix Python API ## What changes were proposed in this pull request? This is a follow-up to https://github.com/apache/spark/pull/13795 to properly set CSV options in Python API. As

spark git commit: [SPARK-13792][SQL] Addendum: Fix Python API

2016-06-21 Thread rxin
Repository: spark Updated Branches: refs/heads/master 4f83ca105 -> 93338807a [SPARK-13792][SQL] Addendum: Fix Python API ## What changes were proposed in this pull request? This is a follow-up to https://github.com/apache/spark/pull/13795 to properly set CSV options in Python API. As part of

spark git commit: [SPARK-15177][.1][R] make SparkR model params and default values consistent with MLlib

2016-06-21 Thread meng
Repository: spark Updated Branches: refs/heads/branch-2.0 34a8e23c7 -> 282be71dd [SPARK-15177][.1][R] make SparkR model params and default values consistent with MLlib ## What changes were proposed in this pull request? This PR is a subset of #13023 by yanboliang to make SparkR model param

spark git commit: [SPARK-15177][.1][R] make SparkR model params and default values consistent with MLlib

2016-06-21 Thread meng
Repository: spark Updated Branches: refs/heads/master f3a768b7b -> 4f83ca105 [SPARK-15177][.1][R] make SparkR model params and default values consistent with MLlib ## What changes were proposed in this pull request? This PR is a subset of #13023 by yanboliang to make SparkR model param

spark git commit: [SPARK-16084][SQL] Minor comments update for "DESCRIBE" table

2016-06-21 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 0499ed961 -> 34a8e23c7 [SPARK-16084][SQL] Minor comments update for "DESCRIBE" table ## What changes were proposed in this pull request? 1. FORMATTED is actually supported, but partition is not supported; 2. Remove parenthesis as it

spark git commit: [SPARK-16084][SQL] Minor comments update for "DESCRIBE" table

2016-06-21 Thread srowen
Repository: spark Updated Branches: refs/heads/master a58f40239 -> f3a768b7b [SPARK-16084][SQL] Minor comments update for "DESCRIBE" table ## What changes were proposed in this pull request? 1. FORMATTED is actually supported, but partition is not supported; 2. Remove parenthesis as it is

spark git commit: [SPARK-16045][ML][DOC] Spark 2.0 ML.feature: doc update for stopwords and binarizer

2016-06-21 Thread meng
Repository: spark Updated Branches: refs/heads/master 37494a18e -> a58f40239 [SPARK-16045][ML][DOC] Spark 2.0 ML.feature: doc update for stopwords and binarizer ## What changes were proposed in this pull request? jira: https://issues.apache.org/jira/browse/SPARK-16045 2.0 Audit: Update

spark git commit: [SPARK-16045][ML][DOC] Spark 2.0 ML.feature: doc update for stopwords and binarizer

2016-06-21 Thread meng
Repository: spark Updated Branches: refs/heads/branch-2.0 14e5decc5 -> 0499ed961 [SPARK-16045][ML][DOC] Spark 2.0 ML.feature: doc update for stopwords and binarizer ## What changes were proposed in this pull request? jira: https://issues.apache.org/jira/browse/SPARK-16045 2.0 Audit: Update

spark git commit: [SPARK-10258][DOC][ML] Add @Since annotations to ml.feature

2016-06-21 Thread meng
Repository: spark Updated Branches: refs/heads/master ce49bfc25 -> 37494a18e [SPARK-10258][DOC][ML] Add @Since annotations to ml.feature This PR adds missing `Since` annotations to `ml.feature` package. Closes #8505. ## How was this patch tested? Existing tests. Author: Nick Pentreath

spark git commit: [SPARK-10258][DOC][ML] Add @Since annotations to ml.feature

2016-06-21 Thread meng
Repository: spark Updated Branches: refs/heads/branch-2.0 37d05ec9e -> 14e5decc5 [SPARK-10258][DOC][ML] Add @Since annotations to ml.feature This PR adds missing `Since` annotations to `ml.feature` package. Closes #8505. ## How was this patch tested? Existing tests. Author: Nick Pentreath

spark git commit: Revert "[SPARK-16086] [SQL] fix Python UDF without arguments (for 1.6)"

2016-06-21 Thread meng
Repository: spark Updated Branches: refs/heads/branch-2.0 34feea336 -> 37d05ec9e Revert "[SPARK-16086] [SQL] fix Python UDF without arguments (for 1.6)" This reverts commit 087bd2799366f4914d248e9b1f0fb921adbbdb43. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit:

spark git commit: Revert "[SPARK-16086] [SQL] fix Python UDF without arguments (for 1.6)"

2016-06-21 Thread meng
Repository: spark Updated Branches: refs/heads/master 843a1eba8 -> ce49bfc25 Revert "[SPARK-16086] [SQL] fix Python UDF without arguments (for 1.6)" This reverts commit a46553cbacf0e4012df89fe55385dec5beaa680a. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit:

spark git commit: [SPARK-15319][SPARKR][DOCS] Fix SparkR doc layout for corr and other DataFrame stats functions

2016-06-21 Thread shivaram
Repository: spark Updated Branches: refs/heads/branch-2.0 38f3b76bd -> 34feea336 [SPARK-15319][SPARKR][DOCS] Fix SparkR doc layout for corr and other DataFrame stats functions ## What changes were proposed in this pull request? Doc only changes. Please see screenshots. Before:

spark git commit: [SPARK-15319][SPARKR][DOCS] Fix SparkR doc layout for corr and other DataFrame stats functions

2016-06-21 Thread shivaram
Repository: spark Updated Branches: refs/heads/master 09f4ceaeb -> 843a1eba8 [SPARK-15319][SPARKR][DOCS] Fix SparkR doc layout for corr and other DataFrame stats functions ## What changes were proposed in this pull request? Doc only changes. Please see screenshots. Before:

spark git commit: [SPARKR][DOCS] R code doc cleanup

2016-06-21 Thread shivaram
Repository: spark Updated Branches: refs/heads/branch-2.0 4e193d3da -> 38f3b76bd [SPARKR][DOCS] R code doc cleanup ## What changes were proposed in this pull request? I ran a full pass from A to Z and fixed the obvious duplications, improper grouping etc. There are still more doc issues to

spark git commit: [SPARKR][DOCS] R code doc cleanup

2016-06-21 Thread shivaram
Repository: spark Updated Branches: refs/heads/master 41e0ffb19 -> 09f4ceaeb [SPARKR][DOCS] R code doc cleanup ## What changes were proposed in this pull request? I ran a full pass from A to Z and fixed the obvious duplications, improper grouping etc. There are still more doc issues to be

spark git commit: [SPARK-15894][SQL][DOC] Update docs for controlling #partitions

2016-06-21 Thread lian
Repository: spark Updated Branches: refs/heads/master 58f6e27dd -> 41e0ffb19 [SPARK-15894][SQL][DOC] Update docs for controlling #partitions ## What changes were proposed in this pull request? Update docs for two parameters `spark.sql.files.maxPartitionBytes` and

spark git commit: [SPARK-15894][SQL][DOC] Update docs for controlling #partitions

2016-06-21 Thread lian
Repository: spark Updated Branches: refs/heads/branch-2.0 dbf7f48b6 -> 4e193d3da [SPARK-15894][SQL][DOC] Update docs for controlling #partitions ## What changes were proposed in this pull request? Update docs for two parameters `spark.sql.files.maxPartitionBytes` and