[spark] branch branch-3.3 updated: [MINOR][TEST][SQL] Add a CTE subquery scope test case

2022-12-23 Thread rxin
This is an automated email from the ASF dual-hosted git repository. rxin pushed a commit to branch branch-3.3 in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/branch-3.3 by this push: new aa39b06462a [MINOR][TEST][SQL] Add a CTE

[spark] branch master updated: [MINOR][TEST][SQL] Add a CTE subquery scope test case

2022-12-23 Thread rxin
This is an automated email from the ASF dual-hosted git repository. rxin pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new 24edf8ecb5e [MINOR][TEST][SQL] Add a CTE subquery

svn commit: r46414 - /dev/spark/v3.1.1-rc3-bin/ /release/spark/spark-3.1.1/

2021-03-02 Thread rxin
Author: rxin Date: Tue Mar 2 11:00:12 2021 New Revision: 46414 Log: Moving Apache Spark 3.1.1 RC3 to Apache Spark 3.1.1 Added: release/spark/spark-3.1.1/ - copied from r46413, dev/spark/v3.1.1-rc3-bin/ Removed: dev/spark/v3.1.1-rc3-bin

svn commit: r46413 - in /dev/spark: v3.1.1-rc3-bin/ v3.1.1-rc3-docs/

2021-03-02 Thread rxin
Author: rxin Date: Tue Mar 2 10:55:39 2021 New Revision: 46413 Log: Recover 3.1.1 RC3 Added: dev/spark/v3.1.1-rc3-bin/ - copied from r46410, dev/spark/v3.1.1-rc3-bin/ dev/spark/v3.1.1-rc3-docs/ - copied from r46410, dev/spark/v3.1.1-rc3-docs

svn commit: r46411 - in /dev/spark: v3.1.1-rc3-bin/ v3.1.1-rc3-docs/

2021-03-02 Thread rxin
Author: rxin Date: Tue Mar 2 10:39:38 2021 New Revision: 46411 Log: Removing RC artifacts. Removed: dev/spark/v3.1.1-rc3-bin/ dev/spark/v3.1.1-rc3-docs/ - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org

svn commit: r46412 - in /dev/spark: v3.1.0-rc1-bin/ v3.1.0-rc1-docs/

2021-03-02 Thread rxin
Author: rxin Date: Tue Mar 2 10:39:58 2021 New Revision: 46412 Log: Removing RC artifacts. Removed: dev/spark/v3.1.0-rc1-bin/ dev/spark/v3.1.0-rc1-docs/ - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org

svn commit: r46410 - in /dev/spark: v3.1.1-rc2-bin/ v3.1.1-rc2-docs/

2021-03-02 Thread rxin
Author: rxin Date: Tue Mar 2 10:39:32 2021 New Revision: 46410 Log: Removing RC artifacts. Removed: dev/spark/v3.1.1-rc2-bin/ dev/spark/v3.1.1-rc2-docs/ - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org

svn commit: r46409 - in /dev/spark: v3.1.1-rc1-bin/ v3.1.1-rc1-docs/

2021-03-02 Thread rxin
Author: rxin Date: Tue Mar 2 10:39:25 2021 New Revision: 46409 Log: Removing RC artifacts. Removed: dev/spark/v3.1.1-rc1-bin/ dev/spark/v3.1.1-rc1-docs/ - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org

svn commit: r40088 - in /dev/spark: v3.0.0-rc1-bin/ v3.0.0-rc1-docs/ v3.0.0-rc2-bin/ v3.0.0-rc2-docs/ v3.0.0-rc3-docs/

2020-06-18 Thread rxin
Author: rxin Date: Thu Jun 18 16:41:27 2020 New Revision: 40088 Log: Removing RC artifacts. Removed: dev/spark/v3.0.0-rc1-bin/ dev/spark/v3.0.0-rc1-docs/ dev/spark/v3.0.0-rc2-bin/ dev/spark/v3.0.0-rc2-docs/ dev/spark/v3.0.0-rc3-docs

svn commit: r40050 - /dev/spark/v3.0.0-rc3-bin/ /release/spark/spark-3.0.0/

2020-06-16 Thread rxin
Author: rxin Date: Tue Jun 16 09:18:02 2020 New Revision: 40050 Log: release 3.0.0 Added: release/spark/spark-3.0.0/ - copied from r40049, dev/spark/v3.0.0-rc3-bin/ Removed: dev/spark/v3.0.0-rc3-bin

[spark] tag v3.0.0 created (now 3fdfce3)

2020-06-14 Thread rxin
This is an automated email from the ASF dual-hosted git repository. rxin pushed a change to tag v3.0.0 in repository https://gitbox.apache.org/repos/asf/spark.git. at 3fdfce3 (commit) No new revisions were added by this update

svn commit: r39960 - in /dev/spark/v3.0.0-rc3-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _site/api/java/org/apache/parqu

2020-06-06 Thread rxin
Author: rxin Date: Sat Jun 6 14:03:25 2020 New Revision: 39960 Log: Apache Spark v3.0.0-rc3 docs [This commit notification would consist of 1920 parts, which exceeds the limit of 50 ones, so it was shortened to the summary

svn commit: r39959 - /dev/spark/v3.0.0-rc3-bin/

2020-06-06 Thread rxin
Author: rxin Date: Sat Jun 6 13:35:40 2020 New Revision: 39959 Log: Apache Spark v3.0.0-rc3 Added: dev/spark/v3.0.0-rc3-bin/ dev/spark/v3.0.0-rc3-bin/SparkR_3.0.0.tar.gz (with props) dev/spark/v3.0.0-rc3-bin/SparkR_3.0.0.tar.gz.asc dev/spark/v3.0.0-rc3-bin/SparkR_3.0.0

svn commit: r39958 - /dev/spark/v3.0.0-rc3-bin/

2020-06-06 Thread rxin
Author: rxin Date: Sat Jun 6 11:18:32 2020 New Revision: 39958 Log: remove 3.0 rc3 binary Removed: dev/spark/v3.0.0-rc3-bin/ - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail

[spark] branch branch-3.0 updated (fa608b9 -> 3ea461d)

2020-06-05 Thread rxin
This is an automated email from the ASF dual-hosted git repository. rxin pushed a change to branch branch-3.0 in repository https://gitbox.apache.org/repos/asf/spark.git. from fa608b9 [SPARK-31904][SQL] Fix case sensitive problem of char and varchar partition columns add 3fdfce3

[spark] 01/01: Preparing development version 3.0.1-SNAPSHOT

2020-06-05 Thread rxin
This is an automated email from the ASF dual-hosted git repository. rxin pushed a commit to branch branch-3.0 in repository https://gitbox.apache.org/repos/asf/spark.git commit 3ea461d61e635835c07bacb5a0c403ae2a3099a0 Author: Reynold Xin AuthorDate: Sat Jun 6 02:57:41 2020 + Preparing

[spark] 01/01: Preparing Spark release v3.0.0-rc3

2020-06-05 Thread rxin
This is an automated email from the ASF dual-hosted git repository. rxin pushed a commit to tag v3.0.0-rc3 in repository https://gitbox.apache.org/repos/asf/spark.git commit 3fdfce3120f307147244e5eaf46d61419a723d50 Author: Reynold Xin AuthorDate: Sat Jun 6 02:57:35 2020 + Preparing

[spark] tag v3.0.0-rc3 created (now 3fdfce3)

2020-06-05 Thread rxin
This is an automated email from the ASF dual-hosted git repository. rxin pushed a change to tag v3.0.0-rc3 in repository https://gitbox.apache.org/repos/asf/spark.git. at 3fdfce3 (commit) This tag includes the following new commits: new 3fdfce3 Preparing Spark release v3.0.0-rc3

svn commit: r39951 - /dev/spark/v3.0.0-rc3-bin/

2020-06-05 Thread rxin
Author: rxin Date: Fri Jun 5 19:08:09 2020 New Revision: 39951 Log: Apache Spark v3.0.0-rc3 Added: dev/spark/v3.0.0-rc3-bin/ dev/spark/v3.0.0-rc3-bin/SparkR_3.0.0.tar.gz (with props) dev/spark/v3.0.0-rc3-bin/SparkR_3.0.0.tar.gz.asc dev/spark/v3.0.0-rc3-bin/SparkR_3.0.0

svn commit: r39657 - in /dev/spark/v3.0.0-rc2-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _site/api/java/org/apache/parqu

2020-05-18 Thread rxin
Author: rxin Date: Mon May 18 16:11:38 2020 New Revision: 39657 Log: Apache Spark v3.0.0-rc2 docs [This commit notification would consist of 1921 parts, which exceeds the limit of 50 ones, so it was shortened to the summary

svn commit: r39656 - /dev/spark/v3.0.0-rc2-bin/

2020-05-18 Thread rxin
Author: rxin Date: Mon May 18 15:42:56 2020 New Revision: 39656 Log: Apache Spark v3.0.0-rc2 Added: dev/spark/v3.0.0-rc2-bin/ dev/spark/v3.0.0-rc2-bin/SparkR_3.0.0.tar.gz (with props) dev/spark/v3.0.0-rc2-bin/SparkR_3.0.0.tar.gz.asc dev/spark/v3.0.0-rc2-bin/SparkR_3.0.0

[spark] branch branch-3.0 updated (740da34 -> f6053b9)

2020-05-18 Thread rxin
This is an automated email from the ASF dual-hosted git repository. rxin pushed a change to branch branch-3.0 in repository https://gitbox.apache.org/repos/asf/spark.git. from 740da34 [SPARK-31738][SQL][DOCS] Describe 'L' and 'M' month pattern letters add 29853ec Preparing Spark

[spark] 01/01: Preparing Spark release v3.0.0-rc2

2020-05-18 Thread rxin
This is an automated email from the ASF dual-hosted git repository. rxin pushed a commit to tag v3.0.0-rc2 in repository https://gitbox.apache.org/repos/asf/spark.git commit 29853eca69bceefd227cbe8421a09c116b7b753a Author: Reynold Xin AuthorDate: Mon May 18 13:21:37 2020 + Preparing

[spark] tag v3.0.0-rc2 created (now 29853ec)

2020-05-18 Thread rxin
This is an automated email from the ASF dual-hosted git repository. rxin pushed a change to tag v3.0.0-rc2 in repository https://gitbox.apache.org/repos/asf/spark.git. at 29853ec (commit) This tag includes the following new commits: new 29853ec Preparing Spark release v3.0.0-rc2

[spark] 01/01: Preparing development version 3.0.1-SNAPSHOT

2020-05-18 Thread rxin
This is an automated email from the ASF dual-hosted git repository. rxin pushed a commit to branch branch-3.0 in repository https://gitbox.apache.org/repos/asf/spark.git commit f6053b94f874c62856baa7bfa35df14c78bebc9f Author: Reynold Xin AuthorDate: Mon May 18 13:21:43 2020 + Preparing

svn commit: r38759 - in /dev/spark/v3.0.0-rc1-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _site/api/java/org/apache/parqu

2020-03-31 Thread rxin
Author: rxin Date: Tue Mar 31 13:45:27 2020 New Revision: 38759 Log: Apache Spark v3.0.0-rc1 docs [This commit notification would consist of 1911 parts, which exceeds the limit of 50 ones, so it was shortened to the summary

svn commit: r38754 - /dev/spark/v3.0.0-rc1-bin/

2020-03-31 Thread rxin
Author: rxin Date: Tue Mar 31 09:57:10 2020 New Revision: 38754 Log: Apache Spark v3.0.0-rc1 Added: dev/spark/v3.0.0-rc1-bin/ dev/spark/v3.0.0-rc1-bin/SparkR_3.0.0.tar.gz (with props) dev/spark/v3.0.0-rc1-bin/SparkR_3.0.0.tar.gz.asc dev/spark/v3.0.0-rc1-bin/SparkR_3.0.0

svn commit: r38753 - /dev/spark/v3.0.0-rc1-bin/

2020-03-31 Thread rxin
Author: rxin Date: Tue Mar 31 07:25:15 2020 New Revision: 38753 Log: retry Removed: dev/spark/v3.0.0-rc1-bin/ - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h

svn commit: r38740 - /dev/spark/v3.0.0-rc1-bin/

2020-03-30 Thread rxin
Author: rxin Date: Mon Mar 30 16:00:46 2020 New Revision: 38740 Log: Apache Spark v3.0.0-rc1 Added: dev/spark/v3.0.0-rc1-bin/ dev/spark/v3.0.0-rc1-bin/SparkR_3.0.0.tar.gz (with props) dev/spark/v3.0.0-rc1-bin/SparkR_3.0.0.tar.gz.asc dev/spark/v3.0.0-rc1-bin/SparkR_3.0.0

[spark] 01/01: Preparing development version 3.0.1-SNAPSHOT

2020-03-30 Thread rxin
This is an automated email from the ASF dual-hosted git repository. rxin pushed a commit to branch branch-3.0 in repository https://gitbox.apache.org/repos/asf/spark.git commit fc5079841907443369af98b17c20f1ac24b3727d Author: Reynold Xin AuthorDate: Mon Mar 30 08:42:27 2020 + Preparing

[spark] branch branch-3.0 updated (5687b31 -> fc50798)

2020-03-30 Thread rxin
This is an automated email from the ASF dual-hosted git repository. rxin pushed a change to branch branch-3.0 in repository https://gitbox.apache.org/repos/asf/spark.git. from 5687b31 [SPARK-30532] DataFrameStatFunctions to work with TABLE.COLUMN syntax add 6550d0d Preparing Spark

[spark] tag v3.0.0-rc1 created (now 6550d0d)

2020-03-30 Thread rxin
This is an automated email from the ASF dual-hosted git repository. rxin pushed a change to tag v3.0.0-rc1 in repository https://gitbox.apache.org/repos/asf/spark.git. at 6550d0d (commit) This tag includes the following new commits: new 6550d0d Preparing Spark release v3.0.0-rc1

[spark] 01/01: Preparing Spark release v3.0.0-rc1

2020-03-30 Thread rxin
This is an automated email from the ASF dual-hosted git repository. rxin pushed a commit to tag v3.0.0-rc1 in repository https://gitbox.apache.org/repos/asf/spark.git commit 6550d0d5283efdbbd838f3aeaf0476c7f52a0fb1 Author: Reynold Xin AuthorDate: Mon Mar 30 08:42:10 2020 + Preparing

svn commit: r38725 - /dev/spark/KEYS

2020-03-30 Thread rxin
Author: rxin Date: Mon Mar 30 07:26:00 2020 New Revision: 38725 Log: Update KEYS Modified: dev/spark/KEYS Modified: dev/spark/KEYS == --- dev/spark/KEYS (original) +++ dev/spark/KEYS Mon Mar 30 07:26:00 2020

[spark] branch test-branch deleted (was 0f8b07e)

2019-02-01 Thread rxin
This is an automated email from the ASF dual-hosted git repository. rxin pushed a change to branch test-branch in repository https://gitbox.apache.org/repos/asf/spark.git. was 0f8b07e test This change permanently discards the following revisions: discard 0f8b07e test

[spark] branch test-branch created (now 0f8b07e)

2019-02-01 Thread rxin
This is an automated email from the ASF dual-hosted git repository. rxin pushed a change to branch test-branch in repository https://gitbox.apache.org/repos/asf/spark.git. at 0f8b07e test This branch includes the following new commits: new 0f8b07e test The 1 revisions listed

[spark] 01/01: test

2019-02-01 Thread rxin
This is an automated email from the ASF dual-hosted git repository. rxin pushed a commit to branch test-branch in repository https://gitbox.apache.org/repos/asf/spark.git commit 0f8b07e5034af2819b75b53aadffda82ae0c31b8 Author: Reynold Xin AuthorDate: Fri Feb 1 13:28:18 2019 -0800 test

spark git commit: [SPARK-26142] followup: Move sql shuffle read metrics relatives to SQLShuffleMetricsReporter

2018-11-29 Thread rxin
Repository: spark Updated Branches: refs/heads/master 9fdc7a840 -> cb368f2c2 [SPARK-26142] followup: Move sql shuffle read metrics relatives to SQLShuffleMetricsReporter ## What changes were proposed in this pull request? Follow up for https://github.com/apache/spark/pull/23128, move sql

spark git commit: [SPARK-26141] Enable custom metrics implementation in shuffle write

2018-11-26 Thread rxin
How was this patch tested? No behavior change expected, as it is a straightforward refactoring. Updated all existing test cases. Closes #23106 from rxin/SPARK-26141. Authored-by: Reynold Xin Signed-off-by: Reynold Xin Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-

spark git commit: [SPARK-26129][SQL] Instrumentation for per-query planning time

2018-11-21 Thread rxin
vs physical planning). This patch adds a simple utility to track the runtime of various rules and various planning phases. ## How was this patch tested? Added unit tests and end-to-end integration tests. Closes #23096 from rxin/SPARK-26129. Authored-by: Reynold Xin Signed-off-by: Reynold Xin Proj

spark-website git commit: Use Heilmeier Catechism for SPIP template.

2018-10-25 Thread rxin
Repository: spark-website Updated Branches: refs/heads/asf-site e4b87718d -> 005a2a0d1 Use Heilmeier Catechism for SPIP template. Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/005a2a0d Tree:

spark git commit: [SPARK-24157][SS][FOLLOWUP] Rename to spark.sql.streaming.noDataMicroBatches.enabled

2018-09-19 Thread rxin
How was this patch tested? Made sure no other references to this config are in the code base: ``` > git grep "noDataMicro" sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala: buildConf("spark.sql.streaming.noDataMicroBatches.enabled") ``` Closes #2

spark git commit: [SPARK-24157][SS][FOLLOWUP] Rename to spark.sql.streaming.noDataMicroBatches.enabled

2018-09-19 Thread rxin
How was this patch tested? Made sure no other references to this config are in the code base: ``` > git grep "noDataMicro" sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala: buildConf("spark.sql.streaming.noDataMicroBatches.enabled") ``` Closes #2

spark git commit: add one supported type missing from the javadoc

2018-06-15 Thread rxin
Repository: spark Updated Branches: refs/heads/master e4fee395e -> c7c0b086a add one supported type missing from the javadoc ## What changes were proposed in this pull request? The supported java.math.BigInteger type is not mentioned in the javadoc of Encoders.bean() ## How was this patch

[1/2] spark-website git commit: Update text/wording to more "modern" Spark and more consistent.

2018-04-12 Thread rxin
Repository: spark-website Updated Branches: refs/heads/asf-site 91b561749 -> 658467248 http://git-wip-us.apache.org/repos/asf/spark-website/blob/65846724/site/news/strata-exercises-now-available-online.html -- diff --git

[2/2] spark-website git commit: Update text/wording to more "modern" Spark and more consistent.

2018-04-12 Thread rxin
Update text/wording to more "modern" Spark and more consistent. 1. Use DataFrame examples. 2. Reduce explicit comparison with MapReduce, since the topic does not really come up. 3. More focus on analytics rather than "cluster compute". 4. Update committer affiliation. 5. Make it more clear

[2/2] spark-website git commit: Squashed commit of the following:

2018-03-16 Thread rxin
Squashed commit of the following: commit 8e2dd71cf5613be6f019bb76b46226771422a40e Merge: 8bd24fb6d 01f0b4e0c Author: Reynold Xin Date: Fri Mar 16 10:24:54 2018 -0700 Merge pull request #104 from mateiz/history Add a project history page commit

[1/2] spark-website git commit: Squashed commit of the following:

2018-03-16 Thread rxin
Repository: spark-website Updated Branches: refs/heads/asf-site 8bd24fb6d -> a1d84bcbf http://git-wip-us.apache.org/repos/asf/spark-website/blob/a1d84bcb/site/news/spark-summit-june-2016-agenda-posted.html -- diff --git

spark git commit: [SPARK-22648][K8S] Spark on Kubernetes - Documentation

2017-12-21 Thread rxin
our fork. Rest is documentation. cc rxin mateiz (shepherd) k8s-big-data SIG members & contributors: foxish ash211 mccheah liyinan926 erikerlandson ssuchter varunkatta kimoonkim tnachen ifilonenko reviewers: vanzin felixcheung jiangxb1987 mridulm TODO: - [x] Add dockerfiles directory t

[1/2] spark git commit: [SPARK-18278][SCHEDULER] Spark on Kubernetes - Basic Scheduler Backend

2017-11-28 Thread rxin
Repository: spark Updated Branches: refs/heads/master 475a29f11 -> e9b2070ab http://git-wip-us.apache.org/repos/asf/spark/blob/e9b2070a/resource-managers/kubernetes/core/src/test/scala/org/apache/spark/scheduler/cluster/k8s/KubernetesClusterSchedulerBackendSuite.scala

[2/2] spark git commit: [SPARK-18278][SCHEDULER] Spark on Kubernetes - Basic Scheduler Backend

2017-11-28 Thread rxin
-on-k8s.github.io/userdocs/running-on-kubernetes.html cc rxin felixcheung mateiz (shepherd) k8s-big-data SIG members & contributors: mccheah ash211 ssuchter varunkatta kimoonkim erikerlandson liyinan926 tnachen ifilonenko Author: Yinan Li <liyinan...@gmail.com> Author: foxish

spark git commit: [SPARK-22369][PYTHON][DOCS] Exposes catalog API documentation in PySpark

2017-11-02 Thread rxin
Repository: spark Updated Branches: refs/heads/master b2463fad7 -> 41b60125b [SPARK-22369][PYTHON][DOCS] Exposes catalog API documentation in PySpark ## What changes were proposed in this pull request? This PR proposes to add a link from `spark.catalog(..)` to `Catalog` and expose Catalog

spark git commit: [SPARK-22408][SQL] RelationalGroupedDataset's distinct pivot value calculation launches unnecessary stages

2017-11-02 Thread rxin
Repository: spark Updated Branches: refs/heads/master 849b465bb -> 277b1924b [SPARK-22408][SQL] RelationalGroupedDataset's distinct pivot value calculation launches unnecessary stages ## What changes were proposed in this pull request? Adding a global limit on top of the distinct values

spark git commit: [MINOR] Data source v2 docs update.

2017-11-01 Thread rxin
How was this patch tested? This is a doc only change. Author: Reynold Xin <r...@databricks.com> Closes #19626 from rxin/dsv2-update. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/d43e1f06 Tree: http://git-wip-us.apache.

spark git commit: [SPARK-22160][SQL] Make sample points per partition (in range partitioner) configurable and bump the default value up to 100

2017-09-28 Thread rxin
sed on chi square test ... Author: Reynold Xin <r...@databricks.com> Closes #19387 from rxin/SPARK-22160. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/323806e6 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/3238

spark git commit: [MINOR][TYPO] Fix typos: runnning and Excecutors

2017-08-18 Thread rxin
Repository: spark Updated Branches: refs/heads/master 7880909c4 -> a2db5c576 [MINOR][TYPO] Fix typos: runnning and Excecutors ## What changes were proposed in this pull request? Fix typos ## How was this patch tested? Existing tests Author: Andrew Ash Closes #18996

spark git commit: [SPARK-21699][SQL] Remove unused getTableOption in ExternalCatalog

2017-08-10 Thread rxin
log. getTableOption. ## How was this patch tested? Removed the test case. Author: Reynold Xin <r...@databricks.com> Closes #18912 from rxin/remove-getTableOption. (cherry picked from commit 584c7f14370cdfafdc6cd554b2760b7ce7709368) Signed-off-by: Reynold Xin <r...@databricks.com> Proj

spark git commit: [SPARK-21699][SQL] Remove unused getTableOption in ExternalCatalog

2017-08-10 Thread rxin
log. getTableOption. ## How was this patch tested? Removed the test case. Author: Reynold Xin <r...@databricks.com> Closes #18912 from rxin/remove-getTableOption. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/584c7f14 Tree: h

spark git commit: [SPARK-21669] Internal API for collecting metrics/stats during FileFormatWriter jobs

2017-08-10 Thread rxin
Repository: spark Updated Branches: refs/heads/master 84454d7d3 -> 95ad960ca [SPARK-21669] Internal API for collecting metrics/stats during FileFormatWriter jobs ## What changes were proposed in this pull request? This patch introduces an internal interface for tracking metrics and/or

spark git commit: [SPARK-21551][PYTHON] Increase timeout for PythonRDD.serveIterator

2017-08-09 Thread rxin
lly configurable). This fixes timeout issues in pyspark when using `collect` and similar functions, in cases where Python may take more than a couple seconds to connect. See https://issues.apache.org/jira/browse/SPARK-21551 ## How was this patch tested? Ran the tests. cc rxin Author: peay

spark git commit: [SPARK-21485][SQL][DOCS] Spark SQL documentation generation for built-in functions

2017-07-26 Thread rxin
Repository: spark Updated Branches: refs/heads/master cf29828d7 -> 60472dbfd [SPARK-21485][SQL][DOCS] Spark SQL documentation generation for built-in functions ## What changes were proposed in this pull request? This generates a documentation for Spark SQL built-in functions. One drawback

spark git commit: [SPARK-21382] The note about Scala 2.10 in building-spark.md is wrong.

2017-07-12 Thread rxin
Repository: spark Updated Branches: refs/heads/master 2cbfc975b -> 24367f23f [SPARK-21382] The note about Scala 2.10 in building-spark.md is wrong. [https://issues.apache.org/jira/browse/SPARK-21382](https://issues.apache.org/jira/browse/SPARK-21382) There should be "Note that support for

spark git commit: [SPARK-21358][EXAMPLES] Argument of repartitionandsortwithinpartitions at pyspark

2017-07-10 Thread rxin
Repository: spark Updated Branches: refs/heads/master d03aebbe6 -> c3713fde8 [SPARK-21358][EXAMPLES] Argument of repartitionandsortwithinpartitions at pyspark ## What changes were proposed in this pull request? At example of repartitionAndSortWithinPartitions at rdd.py, third argument

spark git commit: [SPARK-21323][SQL] Rename plans.logical.statsEstimation.Range to ValueInterval

2017-07-06 Thread rxin
Repository: spark Updated Branches: refs/heads/master 48e44b24a -> bf66335ac [SPARK-21323][SQL] Rename plans.logical.statsEstimation.Range to ValueInterval ## What changes were proposed in this pull request? Rename org.apache.spark.sql.catalyst.plans.logical.statsEstimation.Range to

spark git commit: [SPARK-21103][SQL] QueryPlanConstraints should be part of LogicalPlan

2017-06-20 Thread rxin
nce the constraint framework is only used for query plan rewriting and not for physical planning. ## How was this patch tested? Should be covered by existing tests, since it is a simple refactoring. Author: Reynold Xin <r...@databricks.com> Closes #18310 from rxin/SPARK-21103. Project: http:

spark git commit: [SPARK-21092][SQL] Wire SQLConf in logical plan and expressions

2017-06-14 Thread rxin
<r...@databricks.com> Closes #18299 from rxin/SPARK-21092. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/fffeb6d7 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/fffeb6d7 Diff: http://git-wip-us.apache.org/

spark git commit: [SPARK-21091][SQL] Move constraint code into QueryPlanConstraints

2017-06-14 Thread rxin
n't litter QueryPlan with a lot of constraint private functions. ## How was this patch tested? This is a simple move refactoring and should be covered by existing tests. Author: Reynold Xin <r...@databricks.com> Closes #18298 from rxin/SPARK-21091. Project: http://git-wip-us.apache.org/

spark git commit: [SPARK-21042][SQL] Document Dataset.union is resolution by position

2017-06-09 Thread rxin
een a confusing point for a lot of users. ## How was this patch tested? N/A - doc only change. Author: Reynold Xin <r...@databricks.com> Closes #18256 from rxin/SPARK-21042. (cherry picked from commit b78e3849b20d0d09b7146efd7ce8f203ef67b890) Signed-off-by: Reynold Xin <r...@databricks.com>

spark git commit: [SPARK-21042][SQL] Document Dataset.union is resolution by position

2017-06-09 Thread rxin
ing point for a lot of users. ## How was this patch tested? N/A - doc only change. Author: Reynold Xin <r...@databricks.com> Closes #18256 from rxin/SPARK-21042. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/b78e

spark git commit: [SPARK-20854][TESTS] Removing duplicate test case

2017-06-06 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.2 421d8ecb8 -> 3f93d076b [SPARK-20854][TESTS] Removing duplicate test case ## What changes were proposed in this pull request? Removed a duplicate case in "SPARK-20854: select hint syntax with expressions" ## How was this patch tested?

spark git commit: [SPARK-20854][TESTS] Removing duplicate test case

2017-06-06 Thread rxin
Repository: spark Updated Branches: refs/heads/master c92949ac2 -> cb83ca143 [SPARK-20854][TESTS] Removing duplicate test case ## What changes were proposed in this pull request? Removed a duplicate case in "SPARK-20854: select hint syntax with expressions" ## How was this patch tested?

spark git commit: [SPARK-8184][SQL] Add additional function description for weekofyear

2017-05-29 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.2 26640a269 -> 3b79e4cda [SPARK-8184][SQL] Add additional function description for weekofyear ## What changes were proposed in this pull request? Add additional function description for weekofyear. ## How was this patch tested?

spark git commit: [SPARK-8184][SQL] Add additional function description for weekofyear

2017-05-29 Thread rxin
Repository: spark Updated Branches: refs/heads/master c9749068e -> 1c7db00c7 [SPARK-8184][SQL] Add additional function description for weekofyear ## What changes were proposed in this pull request? Add additional function description for weekofyear. ## How was this patch tested? manual

spark git commit: [SPARK-20857][SQL] Generic resolved hint node

2017-05-23 Thread rxin
ore generic and would allow us to introduce other hint types in the future without introducing new hint nodes. ## How was this patch tested? Updated test cases. Author: Reynold Xin <r...@databricks.com> Closes #18072 from rxin/SPARK-20857. (cherry picked fr

spark git commit: [SPARK-20857][SQL] Generic resolved hint node

2017-05-23 Thread rxin
ric and would allow us to introduce other hint types in the future without introducing new hint nodes. ## How was this patch tested? Updated test cases. Author: Reynold Xin <r...@databricks.com> Closes #18072 from rxin/SPARK-20857. Project: http://git-wip-us.apache.org/repos/asf/spark/re

spark git commit: Revert "[SPARK-12297][SQL] Hive compatibility for Parquet Timestamps"

2017-05-09 Thread rxin
Repository: spark Updated Branches: refs/heads/master 1b85bcd92 -> ac1ab6b9d Revert "[SPARK-12297][SQL] Hive compatibility for Parquet Timestamps" This reverts commit 22691556e5f0dfbac81b8cc9ca0a67c70c1711ca. See JIRA ticket for more information. Project:

spark git commit: [SPARK-20616] RuleExecutor logDebug of batch results should show diff to start of batch

2017-05-05 Thread rxin
Repository: spark Updated Branches: refs/heads/master b31648c08 -> 5d75b14bf [SPARK-20616] RuleExecutor logDebug of batch results should show diff to start of batch ## What changes were proposed in this pull request? Due to a likely typo, the logDebug msg printing the diff of query plans

spark git commit: [SPARK-20616] RuleExecutor logDebug of batch results should show diff to start of batch

2017-05-05 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.2 f59c74a94 -> 1d9b7a74a [SPARK-20616] RuleExecutor logDebug of batch results should show diff to start of batch ## What changes were proposed in this pull request? Due to a likely typo, the logDebug msg printing the diff of query

spark git commit: [SPARK-20616] RuleExecutor logDebug of batch results should show diff to start of batch

2017-05-05 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.1 704b249b6 -> a1112c615 [SPARK-20616] RuleExecutor logDebug of batch results should show diff to start of batch ## What changes were proposed in this pull request? Due to a likely typo, the logDebug msg printing the diff of query

spark git commit: [SPARK-20584][PYSPARK][SQL] Python generic hint support

2017-05-03 Thread rxin
Repository: spark Updated Branches: refs/heads/master 13eb37c86 -> 02bbe7311 [SPARK-20584][PYSPARK][SQL] Python generic hint support ## What changes were proposed in this pull request? Adds `hint` method to PySpark `DataFrame`. ## How was this patch tested? Unit tests, doctests. Author:

spark git commit: [SPARK-20584][PYSPARK][SQL] Python generic hint support

2017-05-03 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.2 a3a5fcfef -> d8bd213f1 [SPARK-20584][PYSPARK][SQL] Python generic hint support ## What changes were proposed in this pull request? Adds `hint` method to PySpark `DataFrame`. ## How was this patch tested? Unit tests, doctests.

spark git commit: [MINOR][SQL] Fix the test title from =!= to <=>, remove a duplicated test and add a test for =!=

2017-05-03 Thread rxin
Repository: spark Updated Branches: refs/heads/master 6b9e49d12 -> 13eb37c86 [MINOR][SQL] Fix the test title from =!= to <=>, remove a duplicated test and add a test for =!= ## What changes were proposed in this pull request? This PR proposes three things as below: - This test looks not

spark git commit: [MINOR][SQL] Fix the test title from =!= to <=>, remove a duplicated test and add a test for =!=

2017-05-03 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.2 36d807906 -> 2629e7c7a [MINOR][SQL] Fix the test title from =!= to <=>, remove a duplicated test and add a test for =!= ## What changes were proposed in this pull request? This PR proposes three things as below: - This test looks

spark git commit: [SPARK-20576][SQL] Support generic hint function in Dataset/DataFrame

2017-05-03 Thread rxin
s well as SQL. As an example, after this patch, the following will apply a broadcast hint on a DataFrame using the new hint function: ``` df1.join(df2.hint("broadcast")) ``` ## How was this patch tested? Added a test case in DataFrameJoinSuite. Author: Reynold Xin <r...@databricks.com

spark git commit: [SPARK-20576][SQL] Support generic hint function in Dataset/DataFrame

2017-05-03 Thread rxin
s well as SQL. As an example, after this patch, the following will apply a broadcast hint on a DataFrame using the new hint function: ``` df1.join(df2.hint("broadcast")) ``` ## How was this patch tested? Added a test case in DataFrameJoinSuite. Author: Reynold Xin <r...@databricks.com

spark git commit: [SPARK-20474] Fixing OnHeapColumnVector reallocation

2017-04-26 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.2 6709bcf6e -> e278876ba [SPARK-20474] Fixing OnHeapColumnVector reallocation ## What changes were proposed in this pull request? OnHeapColumnVector reallocation copies to the new storage data up to 'elementsAppended'. This variable is

spark git commit: [SPARK-20474] Fixing OnHeapColumnVector reallocation

2017-04-26 Thread rxin
Repository: spark Updated Branches: refs/heads/master 99c6cf9ef -> a277ae80a [SPARK-20474] Fixing OnHeapColumnVector reallocation ## What changes were proposed in this pull request? OnHeapColumnVector reallocation copies to the new storage data up to 'elementsAppended'. This variable is only

spark git commit: [SPARK-20473] Enabling missing types in ColumnVector.Array

2017-04-26 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.2 b65858bb3 -> 6709bcf6e [SPARK-20473] Enabling missing types in ColumnVector.Array ## What changes were proposed in this pull request? ColumnVector implementations originally did not support some Catalyst types (float, short, and

spark git commit: [SPARK-20473] Enabling missing types in ColumnVector.Array

2017-04-26 Thread rxin
Repository: spark Updated Branches: refs/heads/master 66dd5b83f -> 99c6cf9ef [SPARK-20473] Enabling missing types in ColumnVector.Array ## What changes were proposed in this pull request? ColumnVector implementations originally did not support some Catalyst types (float, short, and boolean).

spark git commit: [SPARK-20453] Bump master branch version to 2.3.0-SNAPSHOT

2017-04-24 Thread rxin
Repository: spark Updated Branches: refs/heads/master 5280d93e6 -> f44c8a843 [SPARK-20453] Bump master branch version to 2.3.0-SNAPSHOT This patch bumps the master branch version to `2.3.0-SNAPSHOT`. Author: Josh Rosen Closes #17753 from JoshRosen/SPARK-20453.

spark git commit: [SPARK-20420][SQL] Add events to the external catalog

2017-04-21 Thread rxin
Repository: spark Updated Branches: refs/heads/master 48d760d02 -> e2b3d2367 [SPARK-20420][SQL] Add events to the external catalog ## What changes were proposed in this pull request? It is often useful to be able to track changes to the `ExternalCatalog`. This PR makes the `ExternalCatalog`

spark git commit: [SPARK-20420][SQL] Add events to the external catalog

2017-04-21 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.2 6cd2f16b1 -> cddb4b7db [SPARK-20420][SQL] Add events to the external catalog ## What changes were proposed in this pull request? It is often useful to be able to track changes to the `ExternalCatalog`. This PR makes the

spark git commit: Fixed typos in docs

2017-04-19 Thread rxin
Repository: spark Updated Branches: refs/heads/master dd6d55d5d -> bdc605691 Fixed typos in docs ## What changes were proposed in this pull request? Typos at a couple of place in the docs. ## How was this patch tested? build including docs Please review

spark git commit: Fixed typos in docs

2017-04-19 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.2 e6bbdb0c5 -> 8d658b90b Fixed typos in docs ## What changes were proposed in this pull request? Typos at a couple of place in the docs. ## How was this patch tested? build including docs Please review

spark git commit: [SPARK-20398][SQL] range() operator should include cancellation reason when killed

2017-04-19 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.2 af9f18c31 -> e6bbdb0c5 [SPARK-20398][SQL] range() operator should include cancellation reason when killed ## What changes were proposed in this pull request? https://issues.apache.org/jira/browse/SPARK-19820 adds a reason field for

spark git commit: [SPARK-20398][SQL] range() operator should include cancellation reason when killed

2017-04-19 Thread rxin
Repository: spark Updated Branches: refs/heads/master 39e303a8b -> dd6d55d5d [SPARK-20398][SQL] range() operator should include cancellation reason when killed ## What changes were proposed in this pull request? https://issues.apache.org/jira/browse/SPARK-19820 adds a reason field for why

spark git commit: [TEST][MINOR] Replace repartitionBy with distribute in CollapseRepartitionSuite

2017-04-17 Thread rxin
Repository: spark Updated Branches: refs/heads/master 0075562dd -> 33ea908af [TEST][MINOR] Replace repartitionBy with distribute in CollapseRepartitionSuite ## What changes were proposed in this pull request? Replace non-existent `repartitionBy` with `distribute` in

spark git commit: [SPARK-20349][SQL][REVERT-BRANCH2.1] ListFunctions returns duplicate functions after using persistent functions

2017-04-17 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.1 622d7a8bf -> 3808b4728 [SPARK-20349][SQL][REVERT-BRANCH2.1] ListFunctions returns duplicate functions after using persistent functions Revert the changes of https://github.com/apache/spark/pull/17646 made in Branch 2.1, because it

spark git commit: Typo fix: distitrbuted -> distributed

2017-04-17 Thread rxin
Repository: spark Updated Branches: refs/heads/master e5fee3e4f -> 0075562dd Typo fix: distitrbuted -> distributed ## What changes were proposed in this pull request? Typo fix: distitrbuted -> distributed ## How was this patch tested? Existing tests Author: Andrew Ash

spark git commit: [HOTFIX] Fix compilation.

2017-04-17 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.1 db9517c16 -> 622d7a8bf [HOTFIX] Fix compilation. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/622d7a8b Tree:

  1   2   3   4   5   6   7   8   9   10   >