spark git commit: [SPARK-16772] Correct API doc references to PySpark classes + formatting fixes

2016-07-28 Thread rxin
Repository: spark Updated Branches: refs/heads/master 3fd39b87b -> 274f3b9ec [SPARK-16772] Correct API doc references to PySpark classes + formatting fixes ## What's Been Changed The PR corrects several broken or missing class references in the Python API docs. It also correct formatting

spark git commit: [SPARK-16772] Correct API doc references to PySpark classes + formatting fixes

2016-07-28 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 fb09a693d -> 5cd79c396 [SPARK-16772] Correct API doc references to PySpark classes + formatting fixes ## What's Been Changed The PR corrects several broken or missing class references in the Python API docs. It also correct

spark git commit: [SPARK-16764][SQL] Recommend disabling vectorized parquet reader on OutOfMemoryError

2016-07-28 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 f46a07451 -> fb09a693d [SPARK-16764][SQL] Recommend disabling vectorized parquet reader on OutOfMemoryError ## What changes were proposed in this pull request? We currently don't bound or manage the data array size used by column

spark git commit: [SPARK-16764][SQL] Recommend disabling vectorized parquet reader on OutOfMemoryError

2016-07-28 Thread rxin
Repository: spark Updated Branches: refs/heads/master 1178d61ed -> 3fd39b87b [SPARK-16764][SQL] Recommend disabling vectorized parquet reader on OutOfMemoryError ## What changes were proposed in this pull request? We currently don't bound or manage the data array size used by column vectors

spark git commit: [SPARK-16740][SQL] Fix Long overflow in LongToUnsafeRowMap

2016-07-28 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 825c83717 -> f46a07451 [SPARK-16740][SQL] Fix Long overflow in LongToUnsafeRowMap Avoid overflow of Long type causing a NegativeArraySizeException a few lines later. Unit tests for HashedRelationSuite still pass. I can confirm the

spark git commit: [SPARK-16740][SQL] Fix Long overflow in LongToUnsafeRowMap

2016-07-28 Thread rxin
Repository: spark Updated Branches: refs/heads/master 9ade77c3f -> 1178d61ed [SPARK-16740][SQL] Fix Long overflow in LongToUnsafeRowMap ## What changes were proposed in this pull request? Avoid overflow of Long type causing a NegativeArraySizeException a few lines later. ## How was this

spark git commit: [SPARK-15232][SQL] Add subquery SQL building tests to LogicalPlanToSQLSuite

2016-07-28 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 2d56a2136 -> 0fd2dfb6d [SPARK-15232][SQL] Add subquery SQL building tests to LogicalPlanToSQLSuite ## What changes were proposed in this pull request? We currently test subquery SQL building using the `HiveCompatibilitySuite`. The is

spark git commit: [SPARK-15232][SQL] Add subquery SQL building tests to LogicalPlanToSQLSuite

2016-07-28 Thread rxin
Repository: spark Updated Branches: refs/heads/master 11d427c92 -> 5c2ae79bf [SPARK-15232][SQL] Add subquery SQL building tests to LogicalPlanToSQLSuite ## What changes were proposed in this pull request? We currently test subquery SQL building using the `HiveCompatibilitySuite`. The is not

spark git commit: [MINOR][DOC] missing keyword new

2016-07-27 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 8bc2877d8 -> 2f4e06e38 [MINOR][DOC] missing keyword new ## What changes were proposed in this pull request? added missing keyword for java example ## How was this patch tested? wasn't Author: Bartek Wiśniewski

spark git commit: [MINOR][DOC] missing keyword new

2016-07-27 Thread rxin
Repository: spark Updated Branches: refs/heads/master 70f846a31 -> bc4851ade [MINOR][DOC] missing keyword new ## What changes were proposed in this pull request? added missing keyword for java example ## How was this patch tested? wasn't Author: Bartek Wiśniewski

spark-website git commit: Fix a few bugs in the release notes.

2016-07-27 Thread rxin
Repository: spark-website Updated Branches: refs/heads/asf-site 46fb65a40 -> 62155dfa6 Fix a few bugs in the release notes. Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/62155dfa Tree:

spark git commit: [SPARK-16729][SQL] Throw analysis exception for invalid date casts

2016-07-27 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 4e98e6905 -> 8bc2877d8 [SPARK-16729][SQL] Throw analysis exception for invalid date casts Spark currently throws exceptions for invalid casts for all other data types except date type. Somehow date type returns null. It should be

[1/2] spark git commit: [SPARK-16621][SQL] Generate stable SQLs in SQLBuilder

2016-07-27 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 44234b1c4 -> be9965b07 http://git-wip-us.apache.org/repos/asf/spark/blob/be9965b0/sql/hive/src/test/resources/sqlgen/rollup_cube_4_2.sql -- diff --git

[2/2] spark git commit: [SPARK-16621][SQL] Generate stable SQLs in SQLBuilder

2016-07-27 Thread rxin
[SPARK-16621][SQL] Generate stable SQLs in SQLBuilder Currently, the generated SQLs have not-stable IDs for generated attributes. The stable generated SQL will give more benefit for understanding or testing the queries. This PR provides stable SQL generation by the followings. - Provide unique

[2/2] spark-website git commit: Move 2.0.0 release date to July 26 since I managed to push it out tonight.

2016-07-27 Thread rxin
Move 2.0.0 release date to July 26 since I managed to push it out tonight. Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/46fb65a4 Tree: http://git-wip-us.apache.org/repos/asf/spark-website/tree/46fb65a4

[1/2] spark-website git commit: Move 2.0.0 release date to July 26 since I managed to push it out tonight.

2016-07-27 Thread rxin
Repository: spark-website Updated Branches: refs/heads/asf-site 0915efb8d -> 46fb65a40 http://git-wip-us.apache.org/repos/asf/spark-website/blob/46fb65a4/site/news/spark-summit-europe-agenda-posted.html -- diff --git

spark-website git commit: Updated Scala 2.11 note

2016-07-26 Thread rxin
Repository: spark-website Updated Branches: refs/heads/asf-site d7dcb6c8b -> 0915efb8d Updated Scala 2.11 note Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/0915efb8 Tree:

spark-website git commit: removed old versions (<1.3) and re-arranged download options.

2016-07-26 Thread rxin
Repository: spark-website Updated Branches: refs/heads/asf-site 33d86d7bf -> d7dcb6c8b removed old versions (<1.3) and re-arranged download options. Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/d7dcb6c8

[1/3] spark-website git commit: 2.0.0 release

2016-07-26 Thread rxin
Repository: spark-website Updated Branches: refs/heads/asf-site 214938a57 -> 33d86d7bf http://git-wip-us.apache.org/repos/asf/spark-website/blob/33d86d7b/site/releases/spark-release-1-0-2.html -- diff --git

[3/3] spark-website git commit: 2.0.0 release

2016-07-26 Thread rxin
2.0.0 release Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/33d86d7b Tree: http://git-wip-us.apache.org/repos/asf/spark-website/tree/33d86d7b Diff:

[2/3] spark-website git commit: 2.0.0 release

2016-07-26 Thread rxin
http://git-wip-us.apache.org/repos/asf/spark-website/blob/33d86d7b/site/news/spark-1-6-1-released.html -- diff --git a/site/news/spark-1-6-1-released.html b/site/news/spark-1-6-1-released.html index 8c838a0..c0e1d15 100644 ---

spark-website git commit: Update release notes

2016-07-26 Thread rxin
Repository: spark-website Updated Branches: refs/heads/asf-site 33bab055d -> 214938a57 Update release notes Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/214938a5 Tree:

[2/2] spark-website git commit: Change title for Documentation -> "Other Resources" to Documentation -> Older Versions and Other Resources.

2016-07-26 Thread rxin
Change title for Documentation -> "Other Resources" to Documentation -> Older Versions and Other Resources. Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/33bab055 Tree:

[1/2] spark-website git commit: Change title for Documentation -> "Other Resources" to Documentation -> Older Versions and Other Resources.

2016-07-26 Thread rxin
Repository: spark-website Updated Branches: refs/heads/asf-site f0578ab3f -> 33bab055d http://git-wip-us.apache.org/repos/asf/spark-website/blob/33bab055/site/releases/spark-release-1-5-2.html -- diff --git

spark-website git commit: Add 2.0.0 to documentation page

2016-07-26 Thread rxin
Repository: spark-website Updated Branches: refs/heads/asf-site 7cd1fdf23 -> f0578ab3f Add 2.0.0 to documentation page Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/f0578ab3 Tree:

spark-website git commit: More comprehensive new features

2016-07-26 Thread rxin
Repository: spark-website Updated Branches: refs/heads/asf-site 175d31a25 -> 7cd1fdf23 More comprehensive new features Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/7cd1fdf2 Tree:

spark-website git commit: Better release notes for 2.0.0

2016-07-26 Thread rxin
Repository: spark-website Updated Branches: refs/heads/asf-site 0323eb078 -> a9ba7a4c6 Better release notes for 2.0.0 Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/a9ba7a4c Tree:

svn commit: r14549 - /dev/spark/spark-2.0.0/ /release/spark/spark-2.0.0/

2016-07-26 Thread rxin
Author: rxin Date: Tue Jul 26 22:06:04 2016 New Revision: 14549 Log: Spark 2.0.0 Added: release/spark/spark-2.0.0/ - copied from r14548, dev/spark/spark-2.0.0/ Removed: dev/spark/spark-2.0.0/ - To unsubscribe, e

svn commit: r14548 - /dev/spark/spark-2.0.0/

2016-07-26 Thread rxin
Author: rxin Date: Tue Jul 26 21:58:58 2016 New Revision: 14548 Log: Add Spark 2.0.0 Added: dev/spark/spark-2.0.0/ dev/spark/spark-2.0.0/spark-2.0.0-bin-hadoop2.3.tgz (with props) dev/spark/spark-2.0.0/spark-2.0.0-bin-hadoop2.3.tgz.asc dev/spark/spark-2.0.0/spark-2.0.0-bin

spark-website git commit: Add 2.0.0 release notes

2016-07-26 Thread rxin
Repository: spark-website Updated Branches: refs/heads/asf-site effcd547b -> 0323eb078 Add 2.0.0 release notes Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/0323eb07 Tree:

spark-website git commit: Remove test.html

2016-07-26 Thread rxin
Repository: spark-website Updated Branches: refs/heads/asf-site b98c7b9d1 -> effcd547b Remove test.html Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/effcd547 Tree:

spark-website git commit: Test

2016-07-26 Thread rxin
Repository: spark-website Updated Branches: refs/heads/asf-site aff7e088c -> b98c7b9d1 Test Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/b98c7b9d Tree:

[spark] Git Push Summary

2016-07-25 Thread rxin
Repository: spark Updated Tags: refs/tags/v2.0.0-rc1 [deleted] 0c66ca41a - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

[spark] Git Push Summary

2016-07-25 Thread rxin
Repository: spark Updated Tags: refs/tags/v2.0.0-rc2 [deleted] 4a55b2326 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

[spark] Git Push Summary

2016-07-25 Thread rxin
Repository: spark Updated Tags: refs/tags/v2.0.0-rc3 [deleted] 48d1fa3e7 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

[spark] Git Push Summary

2016-07-25 Thread rxin
Repository: spark Updated Tags: refs/tags/v2.0.0-rc4 [deleted] e5f8c1117 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

[spark] Git Push Summary

2016-07-25 Thread rxin
Repository: spark Updated Tags: refs/tags/v2.0.0-rc5 [deleted] 13650fc58 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

[spark] Git Push Summary

2016-07-25 Thread rxin
Repository: spark Updated Tags: refs/tags/v2.0.0 [created] 13650fc58 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

spark git commit: [SPARK-16724] Expose DefinedByConstructorParams

2016-07-25 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 aeb6d5c05 -> 4b38a6a53 [SPARK-16724] Expose DefinedByConstructorParams We don't generally make things in catalyst/execution private. Instead they are just undocumented due to their lack of stability guarantees. Author: Michael

spark git commit: [SPARK-16724] Expose DefinedByConstructorParams

2016-07-25 Thread rxin
Repository: spark Updated Branches: refs/heads/master 8a8d26f1e -> f99e34e8e [SPARK-16724] Expose DefinedByConstructorParams We don't generally make things in catalyst/execution private. Instead they are just undocumented due to their lack of stability guarantees. Author: Michael Armbrust

spark git commit: [SPARK-16672][SQL] SQLBuilder should not raise exceptions on EXISTS queries

2016-07-25 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 3d3547487 -> aeb6d5c05 [SPARK-16672][SQL] SQLBuilder should not raise exceptions on EXISTS queries ## What changes were proposed in this pull request? Currently, `SQLBuilder` raises `empty.reduceLeft` exceptions on *unoptimized*

spark git commit: [SPARK-16672][SQL] SQLBuilder should not raise exceptions on EXISTS queries

2016-07-25 Thread rxin
Repository: spark Updated Branches: refs/heads/master ba0aade6d -> 8a8d26f1e [SPARK-16672][SQL] SQLBuilder should not raise exceptions on EXISTS queries ## What changes were proposed in this pull request? Currently, `SQLBuilder` raises `empty.reduceLeft` exceptions on *unoptimized* `EXISTS`

spark git commit: Fix description of spark.speculation.quantile

2016-07-25 Thread rxin
Repository: spark Updated Branches: refs/heads/master 3fc456694 -> ba0aade6d Fix description of spark.speculation.quantile ## What changes were proposed in this pull request? Minor doc fix regarding the spark.speculation.quantile configuration parameter. It incorrectly states it should be

spark git commit: Fix description of spark.speculation.quantile

2016-07-25 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 9d581dc61 -> 3d3547487 Fix description of spark.speculation.quantile ## What changes were proposed in this pull request? Minor doc fix regarding the spark.speculation.quantile configuration parameter. It incorrectly states it should

spark git commit: [SQL][DOC] Fix a default name for parquet compression

2016-07-25 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 f0d05f669 -> 1b4f7cf13 [SQL][DOC] Fix a default name for parquet compression ## What changes were proposed in this pull request? This pr is to fix a wrong description for parquet default compression. Author: Takeshi YAMAMURO

spark git commit: [SQL][DOC] Fix a default name for parquet compression

2016-07-25 Thread rxin
Repository: spark Updated Branches: refs/heads/master fc17121d5 -> cda4603de [SQL][DOC] Fix a default name for parquet compression ## What changes were proposed in this pull request? This pr is to fix a wrong description for parquet default compression. Author: Takeshi YAMAMURO

spark git commit: [SPARK-16703][SQL] Remove extra whitespace in SQL generation for window functions

2016-07-25 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 b52e639a8 -> 57d65e511 [SPARK-16703][SQL] Remove extra whitespace in SQL generation for window functions ## What changes were proposed in this pull request? This PR fixes a minor formatting issue of `WindowSpecDefinition.sql` when no

spark git commit: [SPARK-16703][SQL] Remove extra whitespace in SQL generation for window functions

2016-07-25 Thread rxin
Repository: spark Updated Branches: refs/heads/master 79826f3c7 -> 7ea6d282b [SPARK-16703][SQL] Remove extra whitespace in SQL generation for window functions ## What changes were proposed in this pull request? This PR fixes a minor formatting issue of `WindowSpecDefinition.sql` when no

spark git commit: [SPARK-16699][SQL] Fix performance bug in hash aggregate on long string keys

2016-07-24 Thread rxin
Repository: spark Updated Branches: refs/heads/master daace6014 -> 468a3c3ac [SPARK-16699][SQL] Fix performance bug in hash aggregate on long string keys In the following code in `VectorizedHashMapGenerator.scala`: ``` def hashBytes(b: String): String = { val hash =

spark git commit: [SPARK-16690][TEST] rename SQLTestUtils.withTempTable to withTempView

2016-07-23 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 da34e8e8f -> 31c3bcb46 [SPARK-16690][TEST] rename SQLTestUtils.withTempTable to withTempView after https://github.com/apache/spark/pull/12945, we renamed the `registerTempTable` to `createTempView`, as we do create a view actually.

spark git commit: [SPARK-16380][EXAMPLES] Update SQL examples and programming guide for Python language binding

2016-07-23 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 28bb2b044 -> da34e8e8f [SPARK-16380][EXAMPLES] Update SQL examples and programming guide for Python language binding This PR is based on PR #14098 authored by wangmiao1981. ## What changes were proposed in this pull request? This PR

spark git commit: [SPARK-16380][EXAMPLES] Update SQL examples and programming guide for Python language binding

2016-07-23 Thread rxin
Repository: spark Updated Branches: refs/heads/master 86c275206 -> 53b2456d1 [SPARK-16380][EXAMPLES] Update SQL examples and programming guide for Python language binding This PR is based on PR #14098 authored by wangmiao1981. ## What changes were proposed in this pull request? This PR

spark git commit: [SPARK-16334] Maintain single dictionary per row-batch in vectorized parquet reader

2016-07-21 Thread rxin
Repository: spark Updated Branches: refs/heads/master 9abd99b3c -> 46f80a307 [SPARK-16334] Maintain single dictionary per row-batch in vectorized parquet reader ## What changes were proposed in this pull request? As part of the bugfix in https://github.com/apache/spark/pull/12279, if a row

spark git commit: [SPARK-16334] Maintain single dictionary per row-batch in vectorized parquet reader

2016-07-21 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 cd41e6a33 -> 4cb8ff73f [SPARK-16334] Maintain single dictionary per row-batch in vectorized parquet reader ## What changes were proposed in this pull request? As part of the bugfix in https://github.com/apache/spark/pull/12279, if a

spark git commit: [SPARK-16613][CORE] RDD.pipe returns values for empty partitions

2016-07-20 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 83b957e6a -> b177e082f [SPARK-16613][CORE] RDD.pipe returns values for empty partitions ## What changes were proposed in this pull request? Document RDD.pipe semantics; don't execute process for empty input partitions. Note this

spark git commit: [SPARK-16613][CORE] RDD.pipe returns values for empty partitions

2016-07-20 Thread rxin
Repository: spark Updated Branches: refs/heads/master 95abbe537 -> 4b079dc39 [SPARK-16613][CORE] RDD.pipe returns values for empty partitions ## What changes were proposed in this pull request? Document RDD.pipe semantics; don't execute process for empty input partitions. Note this includes

spark git commit: [SPARK-16568][SQL][DOCUMENTATION] update sql programming guide refreshTable API in python code

2016-07-19 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 307f8922b -> f58fd4620 [SPARK-16568][SQL][DOCUMENTATION] update sql programming guide refreshTable API in python code ## What changes were proposed in this pull request? update `refreshTable` API in python code of the

spark git commit: [SPARK-16568][SQL][DOCUMENTATION] update sql programming guide refreshTable API in python code

2016-07-19 Thread rxin
Repository: spark Updated Branches: refs/heads/master 004e29cba -> 9674af6f6 [SPARK-16568][SQL][DOCUMENTATION] update sql programming guide refreshTable API in python code ## What changes were proposed in this pull request? update `refreshTable` API in python code of the

spark git commit: [SPARK-15705][SQL] Change the default value of spark.sql.hive.convertMetastoreOrc to false.

2016-07-19 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 f18f9ca5b -> 80ab8b666 [SPARK-15705][SQL] Change the default value of spark.sql.hive.convertMetastoreOrc to false. ## What changes were proposed in this pull request? In 2.0, we add a new logic to convert HiveTableScan on ORC tables

spark git commit: [SPARK-15705][SQL] Change the default value of spark.sql.hive.convertMetastoreOrc to false.

2016-07-19 Thread rxin
Repository: spark Updated Branches: refs/heads/master 162d04a30 -> 2ae7b88a0 [SPARK-15705][SQL] Change the default value of spark.sql.hive.convertMetastoreOrc to false. ## What changes were proposed in this pull request? In 2.0, we add a new logic to convert HiveTableScan on ORC tables to

spark git commit: [SPARK-16602][SQL] `Nvl` function should support numeric-string cases

2016-07-19 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 6ca1d941b -> f18f9ca5b [SPARK-16602][SQL] `Nvl` function should support numeric-string cases ## What changes were proposed in this pull request? `Nvl` function should support numeric-straing cases like Hive/Spark1.6. Currently, `Nvl`

spark git commit: [SPARK-16602][SQL] `Nvl` function should support numeric-string cases

2016-07-19 Thread rxin
Repository: spark Updated Branches: refs/heads/master 0bd76e872 -> 162d04a30 [SPARK-16602][SQL] `Nvl` function should support numeric-string cases ## What changes were proposed in this pull request? `Nvl` function should support numeric-straing cases like Hive/Spark1.6. Currently, `Nvl`

spark git commit: [SPARK-16620][CORE] Add back the tokenization process in `RDD.pipe(command: String)`

2016-07-19 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 2c74b6d73 -> 6ca1d941b [SPARK-16620][CORE] Add back the tokenization process in `RDD.pipe(command: String)` ## What changes were proposed in this pull request? Currently `RDD.pipe(command: String)`: - works only when the command is

spark git commit: [DOC] improve python doc for rdd.histogram and dataframe.join

2016-07-19 Thread rxin
Repository: spark Updated Branches: refs/heads/master 1426a0805 -> 6ee40d2cc [DOC] improve python doc for rdd.histogram and dataframe.join ## What changes were proposed in this pull request? doc change only ## How was this patch tested? doc change only Author: Mortada Mehyar

spark git commit: [DOC] improve python doc for rdd.histogram and dataframe.join

2016-07-19 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 ef2a6f131 -> 504aa6f7a [DOC] improve python doc for rdd.histogram and dataframe.join ## What changes were proposed in this pull request? doc change only ## How was this patch tested? doc change only Author: Mortada Mehyar

spark git commit: [MINOR] Remove unused arg in als.py

2016-07-18 Thread rxin
Repository: spark Updated Branches: refs/heads/master 69c773052 -> e5fbb182c [MINOR] Remove unused arg in als.py ## What changes were proposed in this pull request? The second arg in method `update()` is never used. So I delete it. ## How was this patch tested? local run with

spark git commit: [SPARK-16615][SQL] Expose sqlContext in SparkSession

2016-07-18 Thread rxin
che-spark-developers-list.1001551.n3.nabble.com/Re-transtition-SQLContext-to-SparkSession-td18342.html ## How was this patch tested? N/A - this is a visibility change. Author: Reynold Xin <r...@databricks.com> Closes #14252 from rxin/SPARK-16615. Project: http://git-wip-us.apache.org/repos/

spark git commit: [SPARK-16615][SQL] Expose sqlContext in SparkSession

2016-07-18 Thread rxin
ttp://apache-spark-developers-list.1001551.n3.nabble.com/Re-transtition-SQLContext-to-SparkSession-td18342.html ## How was this patch tested? N/A - this is a visibility change. Author: Reynold Xin <r...@databricks.com> Closes #14252 from rxin/SPARK-16615. (cherry picked fr

spark git commit: [HOTFIX] Fix Scala 2.10 compilation

2016-07-18 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 aac860802 -> 1dd152656 [HOTFIX] Fix Scala 2.10 compilation (cherry picked from commit c4524f5193e1b3ce1c56c5aed126f4121ce26d23) Signed-off-by: Reynold Xin Project:

spark git commit: [HOTFIX] Fix Scala 2.10 compilation

2016-07-18 Thread rxin
Repository: spark Updated Branches: refs/heads/master ea78edb80 -> c4524f519 [HOTFIX] Fix Scala 2.10 compilation Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/c4524f51 Tree:

[1/2] spark git commit: [SPARK-16590][SQL] Improve LogicalPlanToSQLSuite to check generated SQL directly

2016-07-18 Thread rxin
Repository: spark Updated Branches: refs/heads/master 75f0efe74 -> ea78edb80 http://git-wip-us.apache.org/repos/asf/spark/blob/ea78edb8/sql/hive/src/test/resources/sqlgen/script_transformation_row_format_one.sql -- diff --git

[2/2] spark git commit: [SPARK-16590][SQL] Improve LogicalPlanToSQLSuite to check generated SQL directly

2016-07-18 Thread rxin
[SPARK-16590][SQL] Improve LogicalPlanToSQLSuite to check generated SQL directly ## What changes were proposed in this pull request? This PR improves `LogicalPlanToSQLSuite` to check the generated SQL directly by **structure**. So far, `LogicalPlanToSQLSuite` relies on `checkHiveQl` to ensure

[1/2] spark git commit: [SPARK-16590][SQL] Improve LogicalPlanToSQLSuite to check generated SQL directly

2016-07-18 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 7889585cc -> aac860802 http://git-wip-us.apache.org/repos/asf/spark/blob/aac86080/sql/hive/src/test/resources/sqlgen/script_transformation_row_format_one.sql -- diff

[2/2] spark git commit: [SPARK-16590][SQL] Improve LogicalPlanToSQLSuite to check generated SQL directly

2016-07-18 Thread rxin
[SPARK-16590][SQL] Improve LogicalPlanToSQLSuite to check generated SQL directly ## What changes were proposed in this pull request? This PR improves `LogicalPlanToSQLSuite` to check the generated SQL directly by **structure**. So far, `LogicalPlanToSQLSuite` relies on `checkHiveQl` to ensure

spark git commit: [SPARK-16588][SQL] Deprecate monotonicallyIncreasingId in Scala/Java

2016-07-17 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 a4bf13a74 -> 808d69aaa [SPARK-16588][SQL] Deprecate monotonicallyIncreasingId in Scala/Java This patch deprecates monotonicallyIncreasingId in Scala/Java, as done in Python. This patch was originally written by HyukjinKwon. Closes

spark git commit: [SPARK-16588][SQL] Deprecate monotonicallyIncreasingId in Scala/Java

2016-07-17 Thread rxin
Repository: spark Updated Branches: refs/heads/master d27fe9ba6 -> 480c87064 [SPARK-16588][SQL] Deprecate monotonicallyIncreasingId in Scala/Java This patch deprecates monotonicallyIncreasingId in Scala/Java, as done in Python. This patch was originally written by HyukjinKwon. Closes

spark git commit: [SPARK-16584][SQL] Move regexp unit tests to RegexpExpressionsSuite

2016-07-17 Thread rxin
ite to RegexpExpressionsSuite to match the file name for regexp expressions. ## How was this patch tested? This is a test only change. Author: Reynold Xin <r...@databricks.com> Closes #14230 from rxin/SPARK-16584. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/

spark git commit: [SPARK-16584][SQL] Move regexp unit tests to RegexpExpressionsSuite

2016-07-17 Thread rxin
ite to RegexpExpressionsSuite to match the file name for regexp expressions. ## How was this patch tested? This is a test only change. Author: Reynold Xin <r...@databricks.com> Closes #14230 from rxin/SPARK-16584. (cherry picked from commit 7b84758034b9bceca1168438ef5d0beefd5b5273) Signed-off-by: Reyno

spark git commit: [SPARK-3359][DOCS] More changes to resolve javadoc 8 errors that will help unidoc/genjavadoc compatibility

2016-07-16 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 5d495292b -> cad4693f9 [SPARK-3359][DOCS] More changes to resolve javadoc 8 errors that will help unidoc/genjavadoc compatibility ## What changes were proposed in this pull request? These are yet more changes that resolve problems

spark git commit: [SPARK-3359][DOCS] More changes to resolve javadoc 8 errors that will help unidoc/genjavadoc compatibility

2016-07-16 Thread rxin
Repository: spark Updated Branches: refs/heads/master a1ffbada8 -> 5ec0d692b [SPARK-3359][DOCS] More changes to resolve javadoc 8 errors that will help unidoc/genjavadoc compatibility ## What changes were proposed in this pull request? These are yet more changes that resolve problems with

spark git commit: [SPARK-16582][SQL] Explicitly define isNull = false for non-nullable expressions

2016-07-16 Thread rxin
Repository: spark Updated Branches: refs/heads/master b2f24f945 -> a1ffbada8 [SPARK-16582][SQL] Explicitly define isNull = false for non-nullable expressions ## What changes were proposed in this pull request? This patch is just a slightly safer way to fix the issue we encountered in

spark git commit: [SPARK-16582][SQL] Explicitly define isNull = false for non-nullable expressions

2016-07-16 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 34ac45a34 -> 5d495292b [SPARK-16582][SQL] Explicitly define isNull = false for non-nullable expressions ## What changes were proposed in this pull request? This patch is just a slightly safer way to fix the issue we encountered in

spark git commit: [SPARK-16546][SQL][PYSPARK] update python dataframe.drop

2016-07-14 Thread rxin
Repository: spark Updated Branches: refs/heads/master 2e4075e2e -> 183242382 [SPARK-16546][SQL][PYSPARK] update python dataframe.drop ## What changes were proposed in this pull request? Make `dataframe.drop` API in python support multi-columns parameters, so that it is the same with scala

spark git commit: [SPARK-16557][SQL] Remove stale doc in sql/README.md

2016-07-14 Thread rxin
uld be useful to keep the list of projects to explain what's going on, and everything else should be removed. ## How was this patch tested? N/A Author: Reynold Xin <r...@databricks.com> Closes #14211 from rxin/SPARK-16557. (cherry picked from commit 2e4075e2ece9574100c79558cab054485e25c2e

spark git commit: [SPARK-16557][SQL] Remove stale doc in sql/README.md

2016-07-14 Thread rxin
uld be useful to keep the list of projects to explain what's going on, and everything else should be removed. ## How was this patch tested? N/A Author: Reynold Xin <r...@databricks.com> Closes #14211 from rxin/SPARK-16557. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: h

spark git commit: [SPARK-16555] Work around Jekyll error-handling bug which led to silent failures

2016-07-14 Thread rxin
Repository: spark Updated Branches: refs/heads/master 01c4c1fa5 -> 972673aca [SPARK-16555] Work around Jekyll error-handling bug which led to silent failures If a custom Jekyll template tag throws Ruby's equivalent of a "file not found" exception, then Jekyll will stop the doc building

spark git commit: [SPARK-16555] Work around Jekyll error-handling bug which led to silent failures

2016-07-14 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 5c56bc00c -> aa4690b1b [SPARK-16555] Work around Jekyll error-handling bug which led to silent failures If a custom Jekyll template tag throws Ruby's equivalent of a "file not found" exception, then Jekyll will stop the doc building

spark git commit: [SPARK-16553][DOCS] Fix SQL example file name in docs

2016-07-14 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 1fe0bcdd0 -> 5c56bc00c [SPARK-16553][DOCS] Fix SQL example file name in docs ## What changes were proposed in this pull request? Fixes a typo in the sql programming guide ## How was this patch tested? Building docs locally (If this

spark git commit: [SPARK-16528][SQL] Fix NPE problem in HiveClientImpl

2016-07-14 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 741801921 -> 23e1ab9c7 [SPARK-16528][SQL] Fix NPE problem in HiveClientImpl ## What changes were proposed in this pull request? There are some calls to methods or fields (getParameters, properties) which are then passed to Java/Scala

spark git commit: [SPARK-16528][SQL] Fix NPE problem in HiveClientImpl

2016-07-14 Thread rxin
Repository: spark Updated Branches: refs/heads/master c576f9fb9 -> 31ca741ae [SPARK-16528][SQL] Fix NPE problem in HiveClientImpl ## What changes were proposed in this pull request? There are some calls to methods or fields (getParameters, properties) which are then passed to Java/Scala

spark git commit: [SPARK-16503] SparkSession should provide Spark version

2016-07-13 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 5244f86cf -> f6eda6b30 [SPARK-16503] SparkSession should provide Spark version ## What changes were proposed in this pull request? This patch enables SparkSession to provide spark version. ## How was this patch tested? Manual test:

[spark] Git Push Summary

2016-07-13 Thread rxin
Repository: spark Updated Tags: refs/tags/v2.0.0-rc3 [deleted] 47eb9a621 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

spark git commit: [SPARK-16536][SQL][PYSPARK][MINOR] Expose `sql` in PySpark Shell

2016-07-13 Thread rxin
Repository: spark Updated Branches: refs/heads/master a5f51e216 -> 9c530576a [SPARK-16536][SQL][PYSPARK][MINOR] Expose `sql` in PySpark Shell ## What changes were proposed in this pull request? This PR exposes `sql` in PySpark Shell like Scala/R Shells for consistency. **Background** *

spark git commit: [HOTFIX] Fix build break.

2016-07-13 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.6 7c8a399a2 -> 980db2bd4 [HOTFIX] Fix build break. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/980db2bd Tree:

spark git commit: [SPARK-16489][SQL] Guard against variable reuse mistakes in expression code generation

2016-07-13 Thread rxin
fixes the bug in crc32 expression. This is a test harness change, but I also created a new test suite for testing the test harness. Author: Reynold Xin <r...@databricks.com> Closes #14146 from rxin/SPARK-16489. (cherry picked from commit c377e49e38a290e5c4fbc178278069788674dfb7) Signed-

spark git commit: [SPARK-16488] Fix codegen variable namespace collision in pmod and partitionBy

2016-07-13 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.6 9808735e0 -> d1c992fea [SPARK-16488] Fix codegen variable namespace collision in pmod and partitionBy This patch fixes a variable namespace collision bug in pmod and partitionBy Regression test for one possible occurrence. A more

spark git commit: [SPARK-16514][SQL] Fix various regex codegen bugs

2016-07-13 Thread rxin
aces. ## How was this patch tested? Unit tests. rxin Author: Eric Liang <e...@databricks.com> Closes #14168 from ericl/sc-3906. (cherry picked from commit 1c58fa905b6543d366d00b2e5394dfd633987f6d) Signed-off-by: Reynold Xin <r...@databricks.com> Project: http://git-wip-us.apache.org/re

spark git commit: [SPARK-16514][SQL] Fix various regex codegen bugs

2016-07-13 Thread rxin
aces. ## How was this patch tested? Unit tests. rxin Author: Eric Liang <e...@databricks.com> Closes #14168 from ericl/sc-3906. (cherry picked from commit 1c58fa905b6543d366d00b2e5394dfd633987f6d) Signed-off-by: Reynold Xin <r...@databricks.com> Project: http://git-wip-us.apache.org/re

spark git commit: [SPARK-16405] Add metrics and source for external shuffle service

2016-07-12 Thread rxin
Repository: spark Updated Branches: refs/heads/master d513c99c1 -> 68df47aca [SPARK-16405] Add metrics and source for external shuffle service ## What changes were proposed in this pull request? Since externalShuffleService is essential for spark, better monitoring for shuffle service is

spark git commit: [SPARK-16489][SQL] Guard against variable reuse mistakes in expression code generation

2016-07-12 Thread rxin
fixes the bug in crc32 expression. This is a test harness change, but I also created a new test suite for testing the test harness. Author: Reynold Xin <r...@databricks.com> Closes #14146 from rxin/SPARK-16489. (cherry picked from commit c377e49e38a290e5c4fbc178278069788674dfb7) Signed-

<    4   5   6   7   8   9   10   11   12   13   >