[GitHub] [spark] attilapiros edited a comment on pull request #26901: [SPARK-29152][CORE][2.4] Executor Plugin shutdown when dynamic allocation is enabled

2021-06-02 Thread GitBox
attilapiros edited a comment on pull request #26901: URL: https://github.com/apache/spark/pull/26901#issuecomment-853624270 I know why this caused OOM. Here is the reason and the fix: https://github.com/apache/spark/pull/32748 -- This is an automated message from the Apache Git Service.

[GitHub] [spark] attilapiros commented on pull request #26901: [SPARK-29152][CORE][2.4] Executor Plugin shutdown when dynamic allocation is enabled

2021-06-02 Thread GitBox
attilapiros commented on pull request #26901: URL: https://github.com/apache/spark/pull/26901#issuecomment-853624270 I know why. Here is the reason and the fix: https://github.com/apache/spark/pull/32748 -- This is an automated message from the Apache Git Service. To respond to the messa

[GitHub] [spark] SparkQA commented on pull request #32726: [SPARK-35587][PYTHON][DOCS] Initial porting of Koalas documentation

2021-06-02 Thread GitBox
SparkQA commented on pull request #32726: URL: https://github.com/apache/spark/pull/32726#issuecomment-853621665 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/43795/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #32693: [SPARK-35556][SQL][TESTS] Avoid log NoSuchMethodError when running multiple Hive version related tests

2021-06-02 Thread GitBox
SparkQA commented on pull request #32693: URL: https://github.com/apache/spark/pull/32693#issuecomment-853620621 Kubernetes integration test unable to build dist. exiting with code: 1 URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/43798/ -- This

[GitHub] [spark] SparkQA removed a comment on pull request #32761: [SPARK-35621][SQL] Add rule id pruning to the TypeCoercion rule

2021-06-02 Thread GitBox
SparkQA removed a comment on pull request #32761: URL: https://github.com/apache/spark/pull/32761#issuecomment-853525514 **[Test build #139261 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139261/testReport)** for PR 32761 at commit [`a38fd54`](https://gi

[GitHub] [spark] SparkQA commented on pull request #32738: [SPARK-35474] Enable disallow_untyped_defs mypy check for pyspark.pandas.indexing.

2021-06-02 Thread GitBox
SparkQA commented on pull request #32738: URL: https://github.com/apache/spark/pull/32738#issuecomment-853619841 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/43797/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #32761: [SPARK-35621][SQL] Add rule id pruning to the TypeCoercion rule

2021-06-02 Thread GitBox
SparkQA commented on pull request #32761: URL: https://github.com/apache/spark/pull/32761#issuecomment-853619415 **[Test build #139261 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139261/testReport)** for PR 32761 at commit [`a38fd54`](https://github.co

[GitHub] [spark] SparkQA removed a comment on pull request #32741: [SPARK-35568][SQL] Add the BroadcastExchange after re-optimizing the physical plan to fix the UnsupportedOperationException when ena

2021-06-02 Thread GitBox
SparkQA removed a comment on pull request #32741: URL: https://github.com/apache/spark/pull/32741#issuecomment-853543621 **[Test build #139266 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139266/testReport)** for PR 32741 at commit [`9f43eef`](https://gi

[GitHub] [spark] ulysses-you commented on pull request #32742: [SPARK-35608][SQL] Support AQE optimizer side transformUpWithPruning

2021-06-02 Thread GitBox
ulysses-you commented on pull request #32742: URL: https://github.com/apache/spark/pull/32742#issuecomment-853616190 cc @cloud-fan @sigmod @gengliangwang -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [spark] SparkQA commented on pull request #32741: [SPARK-35568][SQL] Add the BroadcastExchange after re-optimizing the physical plan to fix the UnsupportedOperationException when enabling bo

2021-06-02 Thread GitBox
SparkQA commented on pull request #32741: URL: https://github.com/apache/spark/pull/32741#issuecomment-853615980 **[Test build #139266 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139266/testReport)** for PR 32741 at commit [`9f43eef`](https://github.co

[GitHub] [spark] zhouyejoe commented on pull request #32007: [SPARK-33350][SHUFFLE] Add support to DiskBlockManager to create merge directory and to get the local shuffle merged data

2021-06-02 Thread GitBox
zhouyejoe commented on pull request #32007: URL: https://github.com/apache/spark/pull/32007#issuecomment-853615497 Updated with a slim version, which excludes the handling for multiple attempts case. @Ngone51 Would like to share a little bit more context. We had multiple round of disc

[GitHub] [spark] SparkQA removed a comment on pull request #32726: [SPARK-35587][PYTHON][DOCS] Initial porting of Koalas documentation

2021-06-02 Thread GitBox
SparkQA removed a comment on pull request #32726: URL: https://github.com/apache/spark/pull/32726#issuecomment-853547190 **[Test build #139271 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139271/testReport)** for PR 32726 at commit [`2312bef`](https://gi

[GitHub] [spark] SparkQA commented on pull request #32726: [SPARK-35587][PYTHON][DOCS] Initial porting of Koalas documentation

2021-06-02 Thread GitBox
SparkQA commented on pull request #32726: URL: https://github.com/apache/spark/pull/32726#issuecomment-853612867 **[Test build #139271 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139271/testReport)** for PR 32726 at commit [`2312bef`](https://github.co

[GitHub] [spark] SparkQA commented on pull request #32763: [SPARK-35058][SQL] Group exception messages in hive/client

2021-06-02 Thread GitBox
SparkQA commented on pull request #32763: URL: https://github.com/apache/spark/pull/32763#issuecomment-853612263 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/43794/ -- This is an automated message from the A

[GitHub] [spark] SparkQA removed a comment on pull request #32726: [SPARK-35587][PYTHON][DOCS] Initial porting of Koalas documentation

2021-06-02 Thread GitBox
SparkQA removed a comment on pull request #32726: URL: https://github.com/apache/spark/pull/32726#issuecomment-853543625 **[Test build #139267 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139267/testReport)** for PR 32726 at commit [`e7a4af3`](https://gi

[GitHub] [spark] zhengruifeng commented on pull request #32734: [SPARK-35423][ML] PCA results should be consistent, If the Matrix contains both Sparse and Dense vectors

2021-06-02 Thread GitBox
zhengruifeng commented on pull request #32734: URL: https://github.com/apache/spark/pull/32734#issuecomment-853608593 Besides PCA, there are other impls that rely on the estimation of sparsity. what about just adding a boolean param named `preferSpare` to choose the code path? -- This i

[GitHub] [spark] zhengruifeng commented on pull request #32734: [SPARK-35423][ML] PCA results should be consistent, If the Matrix contains both Sparse and Dense vectors

2021-06-02 Thread GitBox
zhengruifeng commented on pull request #32734: URL: https://github.com/apache/spark/pull/32734#issuecomment-853607458 current estimation of data sparsity/density by `first().isInstanceOf[DenseVector]` is too simple `!rows.filter(_.isInstanceOf[DenseVector]` is better, but it still ma

[GitHub] [spark] SparkQA commented on pull request #32726: [SPARK-35587][PYTHON][DOCS] Initial porting of Koalas documentation

2021-06-02 Thread GitBox
SparkQA commented on pull request #32726: URL: https://github.com/apache/spark/pull/32726#issuecomment-853606468 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/43791/ -- This is an automated message from the A

[GitHub] [spark] SparkQA commented on pull request #32303: [SPARK-34382][SQL] Support LATERAL subqueries

2021-06-02 Thread GitBox
SparkQA commented on pull request #32303: URL: https://github.com/apache/spark/pull/32303#issuecomment-853606443 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/43792/ -- This is an automated message from the A

[GitHub] [spark] zhouyejoe commented on a change in pull request #32007: [SPARK-33350][SHUFFLE] Add support to DiskBlockManager to create merge directory and to get the local shuffle merged data

2021-06-02 Thread GitBox
zhouyejoe commented on a change in pull request #32007: URL: https://github.com/apache/spark/pull/32007#discussion_r644520610 ## File path: core/src/main/scala/org/apache/spark/storage/BlockManager.scala ## @@ -728,6 +736,27 @@ private[spark] class BlockManager( } } +

[GitHub] [spark] zhouyejoe commented on a change in pull request #32007: [SPARK-33350][SHUFFLE] Add support to DiskBlockManager to create merge directory and to get the local shuffle merged data

2021-06-02 Thread GitBox
zhouyejoe commented on a change in pull request #32007: URL: https://github.com/apache/spark/pull/32007#discussion_r644519782 ## File path: core/src/main/scala/org/apache/spark/storage/BlockManager.scala ## @@ -728,6 +736,27 @@ private[spark] class BlockManager( } } +

[GitHub] [spark] zhouyejoe commented on a change in pull request #32007: [SPARK-33350][SHUFFLE] Add support to DiskBlockManager to create merge directory and to get the local shuffle merged data

2021-06-02 Thread GitBox
zhouyejoe commented on a change in pull request #32007: URL: https://github.com/apache/spark/pull/32007#discussion_r644519782 ## File path: core/src/main/scala/org/apache/spark/storage/BlockManager.scala ## @@ -728,6 +736,27 @@ private[spark] class BlockManager( } } +

[GitHub] [spark] SparkQA commented on pull request #32741: [SPARK-35568][SQL] Add the BroadcastExchange after re-optimizing the physical plan to fix the UnsupportedOperationException when enabling bo

2021-06-02 Thread GitBox
SparkQA commented on pull request #32741: URL: https://github.com/apache/spark/pull/32741#issuecomment-853604946 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/43790/ -- This is an automated message from the A

[GitHub] [spark] SparkQA commented on pull request #32726: [SPARK-35587][PYTHON][DOCS] Initial porting of Koalas documentation

2021-06-02 Thread GitBox
SparkQA commented on pull request #32726: URL: https://github.com/apache/spark/pull/32726#issuecomment-853604463 **[Test build #139267 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139267/testReport)** for PR 32726 at commit [`e7a4af3`](https://github.co

[GitHub] [spark] zhouyejoe commented on a change in pull request #32007: [SPARK-33350][SHUFFLE] Add support to DiskBlockManager to create merge directory and to get the local shuffle merged data

2021-06-02 Thread GitBox
zhouyejoe commented on a change in pull request #32007: URL: https://github.com/apache/spark/pull/32007#discussion_r644517373 ## File path: core/src/main/scala/org/apache/spark/storage/BlockManager.scala ## @@ -504,7 +504,8 @@ private[spark] class BlockManager( hostLocal

[GitHub] [spark] wangyum commented on a change in pull request #32741: [SPARK-35568][SQL] Add the BroadcastExchange after re-optimizing the physical plan to fix the UnsupportedOperationException when

2021-06-02 Thread GitBox
wangyum commented on a change in pull request #32741: URL: https://github.com/apache/spark/pull/32741#discussion_r644516090 ## File path: sql/core/src/test/scala/org/apache/spark/sql/DynamicPartitionPruningSuite.scala ## @@ -1505,6 +1505,27 @@ abstract class DynamicPartitionPr

[GitHub] [spark] AmplabJenkins commented on pull request #31102: [SPARK-34054][CORE] BlockManagerDecommissioner code cleanup

2021-06-02 Thread GitBox
AmplabJenkins commented on pull request #31102: URL: https://github.com/apache/spark/pull/31102#issuecomment-853599863 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139268/ -- This

[GitHub] [spark] SparkQA removed a comment on pull request #31102: [SPARK-34054][CORE] BlockManagerDecommissioner code cleanup

2021-06-02 Thread GitBox
SparkQA removed a comment on pull request #31102: URL: https://github.com/apache/spark/pull/31102#issuecomment-853543985 **[Test build #139268 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139268/testReport)** for PR 31102 at commit [`bae07ce`](https://gi

[GitHub] [spark] SparkQA commented on pull request #31102: [SPARK-34054][CORE] BlockManagerDecommissioner code cleanup

2021-06-02 Thread GitBox
SparkQA commented on pull request #31102: URL: https://github.com/apache/spark/pull/31102#issuecomment-853599206 **[Test build #139268 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139268/testReport)** for PR 31102 at commit [`bae07ce`](https://github.co

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32693: [SPARK-35556][SQL][TESTS] Avoid log NoSuchMethodError when running multiple Hive version related tests

2021-06-02 Thread GitBox
AmplabJenkins removed a comment on pull request #32693: URL: https://github.com/apache/spark/pull/32693#issuecomment-853598030 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139274/ -

[GitHub] [spark] SparkQA removed a comment on pull request #32693: [SPARK-35556][SQL][TESTS] Avoid log NoSuchMethodError when running multiple Hive version related tests

2021-06-02 Thread GitBox
SparkQA removed a comment on pull request #32693: URL: https://github.com/apache/spark/pull/32693#issuecomment-853595002 **[Test build #139274 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139274/testReport)** for PR 32693 at commit [`9310db7`](https://gi

[GitHub] [spark] SparkQA commented on pull request #32693: [SPARK-35556][SQL][TESTS] Avoid log NoSuchMethodError when running multiple Hive version related tests

2021-06-02 Thread GitBox
SparkQA commented on pull request #32693: URL: https://github.com/apache/spark/pull/32693#issuecomment-853598000 **[Test build #139274 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139274/testReport)** for PR 32693 at commit [`9310db7`](https://github.co

[GitHub] [spark] AmplabJenkins commented on pull request #32693: [SPARK-35556][SQL][TESTS] Avoid log NoSuchMethodError when running multiple Hive version related tests

2021-06-02 Thread GitBox
AmplabJenkins commented on pull request #32693: URL: https://github.com/apache/spark/pull/32693#issuecomment-853598030 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139274/ -- This

[GitHub] [spark] SparkQA commented on pull request #30135: [SPARK-29250][BUILD] Upgrade to Hadoop 3.3.1

2021-06-02 Thread GitBox
SparkQA commented on pull request #30135: URL: https://github.com/apache/spark/pull/30135#issuecomment-853595736 **[Test build #139275 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139275/testReport)** for PR 30135 at commit [`547ccb0`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #32693: [SPARK-35556][SQL][TESTS] Avoid log NoSuchMethodError when running multiple Hive version related tests

2021-06-02 Thread GitBox
SparkQA commented on pull request #32693: URL: https://github.com/apache/spark/pull/32693#issuecomment-853595002 **[Test build #139274 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139274/testReport)** for PR 32693 at commit [`9310db7`](https://github.com

[GitHub] [spark] zhengruifeng commented on pull request #32759: [SPARK-35619][ML] Refactor LinearRegression - make huber support virtual centering

2021-06-02 Thread GitBox
zhengruifeng commented on pull request #32759: URL: https://github.com/apache/spark/pull/32759#issuecomment-853593720 friendly ping @srowen @WeichenXu123 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [spark] SparkQA removed a comment on pull request #32738: [SPARK-35474] Enable disallow_untyped_defs mypy check for pyspark.pandas.indexing.

2021-06-02 Thread GitBox
SparkQA removed a comment on pull request #32738: URL: https://github.com/apache/spark/pull/32738#issuecomment-853590541 **[Test build #139272 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139272/testReport)** for PR 32738 at commit [`6d1a050`](https://gi

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32737: [SPARK-35606][PYTHON][INFRA] List Python 3.9 installed libraries in build_and_test workflow

2021-06-02 Thread GitBox
AmplabJenkins removed a comment on pull request #32737: URL: https://github.com/apache/spark/pull/32737#issuecomment-853593205 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139264/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32738: [SPARK-35474] Enable disallow_untyped_defs mypy check for pyspark.pandas.indexing.

2021-06-02 Thread GitBox
AmplabJenkins removed a comment on pull request #32738: URL: https://github.com/apache/spark/pull/32738#issuecomment-853593200 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139272/ -

[GitHub] [spark] AmplabJenkins commented on pull request #32737: [SPARK-35606][PYTHON][INFRA] List Python 3.9 installed libraries in build_and_test workflow

2021-06-02 Thread GitBox
AmplabJenkins commented on pull request #32737: URL: https://github.com/apache/spark/pull/32737#issuecomment-853593205 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139264/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #32738: [SPARK-35474] Enable disallow_untyped_defs mypy check for pyspark.pandas.indexing.

2021-06-02 Thread GitBox
AmplabJenkins commented on pull request #32738: URL: https://github.com/apache/spark/pull/32738#issuecomment-853593200 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139272/ -- This

[GitHub] [spark] SparkQA commented on pull request #32738: [SPARK-35474] Enable disallow_untyped_defs mypy check for pyspark.pandas.indexing.

2021-06-02 Thread GitBox
SparkQA commented on pull request #32738: URL: https://github.com/apache/spark/pull/32738#issuecomment-853593163 **[Test build #139272 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139272/testReport)** for PR 32738 at commit [`6d1a050`](https://github.co

[GitHub] [spark] SparkQA commented on pull request #32764: [SPARK-35390][SQL] Handle type coercion when resolving V2 functions

2021-06-02 Thread GitBox
SparkQA commented on pull request #32764: URL: https://github.com/apache/spark/pull/32764#issuecomment-853592547 **[Test build #139273 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139273/testReport)** for PR 32764 at commit [`2add960`](https://github.com

[GitHub] [spark] SparkQA removed a comment on pull request #32737: [SPARK-35606][PYTHON][INFRA] List Python 3.9 installed libraries in build_and_test workflow

2021-06-02 Thread GitBox
SparkQA removed a comment on pull request #32737: URL: https://github.com/apache/spark/pull/32737#issuecomment-853530191 **[Test build #139264 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139264/testReport)** for PR 32737 at commit [`5ba0b32`](https://gi

[GitHub] [spark] SparkQA commented on pull request #32763: [SPARK-35058][SQL] Group exception messages in hive/client

2021-06-02 Thread GitBox
SparkQA commented on pull request #32763: URL: https://github.com/apache/spark/pull/32763#issuecomment-853592315 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/43794/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #32737: [SPARK-35606][PYTHON][INFRA] List Python 3.9 installed libraries in build_and_test workflow

2021-06-02 Thread GitBox
SparkQA commented on pull request #32737: URL: https://github.com/apache/spark/pull/32737#issuecomment-853591907 **[Test build #139264 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139264/testReport)** for PR 32737 at commit [`5ba0b32`](https://github.co

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31102: [SPARK-34054][CORE] BlockManagerDecommissioner code cleanup

2021-06-02 Thread GitBox
AmplabJenkins removed a comment on pull request #31102: URL: https://github.com/apache/spark/pull/31102#issuecomment-853591318 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/43793/

[GitHub] [spark] SparkQA commented on pull request #31102: [SPARK-34054][CORE] BlockManagerDecommissioner code cleanup

2021-06-02 Thread GitBox
SparkQA commented on pull request #31102: URL: https://github.com/apache/spark/pull/31102#issuecomment-853591298 Kubernetes integration test unable to build dist. exiting with code: 1 URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/43793/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #31102: [SPARK-34054][CORE] BlockManagerDecommissioner code cleanup

2021-06-02 Thread GitBox
AmplabJenkins commented on pull request #31102: URL: https://github.com/apache/spark/pull/31102#issuecomment-853591318 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/43793/ -- T

[GitHub] [spark] sunchao commented on a change in pull request #32764: [SPARK-35390][SQL] Handle type coercion when resolving V2 functions

2021-06-02 Thread GitBox
sunchao commented on a change in pull request #32764: URL: https://github.com/apache/spark/pull/32764#discussion_r644507916 ## File path: sql/catalyst/src/main/java/org/apache/spark/sql/connector/catalog/functions/ScalarFunction.java ## @@ -117,6 +120,8 @@ * {@link org.apa

[GitHub] [spark] sunchao opened a new pull request #32764: [SPARK-35390][SQL] Handle type coercion when resolving V2 functions

2021-06-02 Thread GitBox
sunchao opened a new pull request #32764: URL: https://github.com/apache/spark/pull/32764 ### What changes were proposed in this pull request? Handle type coercion when resolving V2 function. In particular: - prior to evaluating function arguments, insert cast whenever

[GitHub] [spark] SparkQA commented on pull request #32738: [SPARK-35474] Enable disallow_untyped_defs mypy check for pyspark.pandas.indexing.

2021-06-02 Thread GitBox
SparkQA commented on pull request #32738: URL: https://github.com/apache/spark/pull/32738#issuecomment-853590541 **[Test build #139272 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139272/testReport)** for PR 32738 at commit [`6d1a050`](https://github.com

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32751: [SPARK-35612][SQL] Support LZ4 compression in ORC data source

2021-06-02 Thread GitBox
AmplabJenkins removed a comment on pull request #32751: URL: https://github.com/apache/spark/pull/32751#issuecomment-853589832 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/43789/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32754: [SPARK-35613][CORE][SQL] Cache commonly occurring strings in SQLMetrics, JSONProtocol and AccumulatorV2 classes

2021-06-02 Thread GitBox
AmplabJenkins removed a comment on pull request #32754: URL: https://github.com/apache/spark/pull/32754#issuecomment-853589833 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139262/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32737: [SPARK-35606][PYTHON][INFRA] List Python 3.9 installed libraries in build_and_test workflow

2021-06-02 Thread GitBox
AmplabJenkins removed a comment on pull request #32737: URL: https://github.com/apache/spark/pull/32737#issuecomment-853589834 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/43788/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32761: [SPARK-35621][SQL] Add rule id pruning to the TypeCoercion rule

2021-06-02 Thread GitBox
AmplabJenkins removed a comment on pull request #32761: URL: https://github.com/apache/spark/pull/32761#issuecomment-853589830 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/43785/

[GitHub] [spark] AmplabJenkins commented on pull request #32754: [SPARK-35613][CORE][SQL] Cache commonly occurring strings in SQLMetrics, JSONProtocol and AccumulatorV2 classes

2021-06-02 Thread GitBox
AmplabJenkins commented on pull request #32754: URL: https://github.com/apache/spark/pull/32754#issuecomment-853589833 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139262/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #32751: [SPARK-35612][SQL] Support LZ4 compression in ORC data source

2021-06-02 Thread GitBox
AmplabJenkins commented on pull request #32751: URL: https://github.com/apache/spark/pull/32751#issuecomment-853589832 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/43789/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #32761: [SPARK-35621][SQL] Add rule id pruning to the TypeCoercion rule

2021-06-02 Thread GitBox
AmplabJenkins commented on pull request #32761: URL: https://github.com/apache/spark/pull/32761#issuecomment-853589830 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/43785/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #32737: [SPARK-35606][PYTHON][INFRA] List Python 3.9 installed libraries in build_and_test workflow

2021-06-02 Thread GitBox
AmplabJenkins commented on pull request #32737: URL: https://github.com/apache/spark/pull/32737#issuecomment-853589834 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/43788/ -- T

[GitHub] [spark] SparkQA commented on pull request #32726: [SPARK-35587][PYTHON][DOCS] Initial porting of Koalas documentation

2021-06-02 Thread GitBox
SparkQA commented on pull request #32726: URL: https://github.com/apache/spark/pull/32726#issuecomment-853588189 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/43791/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #32303: [SPARK-34382][SQL] Support LATERAL subqueries

2021-06-02 Thread GitBox
SparkQA commented on pull request #32303: URL: https://github.com/apache/spark/pull/32303#issuecomment-853588123 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/43792/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #32761: [SPARK-35621][SQL] Add rule id pruning to the TypeCoercion rule

2021-06-02 Thread GitBox
SparkQA commented on pull request #32761: URL: https://github.com/apache/spark/pull/32761#issuecomment-853586808 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/43785/ -- This is an automated message from the A

[GitHub] [spark] SparkQA commented on pull request #32741: [SPARK-35568][SQL] Add the BroadcastExchange after re-optimizing the physical plan to fix the UnsupportedOperationException when enabling bo

2021-06-02 Thread GitBox
SparkQA commented on pull request #32741: URL: https://github.com/apache/spark/pull/32741#issuecomment-853586016 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/43790/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA removed a comment on pull request #32754: [SPARK-35613][CORE][SQL] Cache commonly occurring strings in SQLMetrics, JSONProtocol and AccumulatorV2 classes

2021-06-02 Thread GitBox
SparkQA removed a comment on pull request #32754: URL: https://github.com/apache/spark/pull/32754#issuecomment-853525564 **[Test build #139262 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139262/testReport)** for PR 32754 at commit [`2cb9f8a`](https://gi

[GitHub] [spark] SparkQA commented on pull request #32754: [SPARK-35613][CORE][SQL] Cache commonly occurring strings in SQLMetrics, JSONProtocol and AccumulatorV2 classes

2021-06-02 Thread GitBox
SparkQA commented on pull request #32754: URL: https://github.com/apache/spark/pull/32754#issuecomment-853580364 **[Test build #139262 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139262/testReport)** for PR 32754 at commit [`2cb9f8a`](https://github.co

[GitHub] [spark] SparkQA commented on pull request #32737: [SPARK-35606][PYTHON][INFRA] List Python 3.9 installed libraries in build_and_test workflow

2021-06-02 Thread GitBox
SparkQA commented on pull request #32737: URL: https://github.com/apache/spark/pull/32737#issuecomment-853578682 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/43788/ -- This is an automated message from the A

[GitHub] [spark] HyukjinKwon commented on a change in pull request #32745: [SPARK-35523] Fix the default value in Data Source Options page

2021-06-02 Thread GitBox
HyukjinKwon commented on a change in pull request #32745: URL: https://github.com/apache/spark/pull/32745#discussion_r644497726 ## File path: docs/sql-data-sources-text.md ## @@ -57,7 +57,7 @@ Data source options of text can be set via: Review comment: Can we change

[GitHub] [spark] pingsutw commented on a change in pull request #32738: [SPARK-35474] Enable disallow_untyped_defs mypy check for pyspark.pandas.indexing.

2021-06-02 Thread GitBox
pingsutw commented on a change in pull request #32738: URL: https://github.com/apache/spark/pull/32738#discussion_r644490173 ## File path: python/pyspark/pandas/indexing.py ## @@ -608,7 +608,9 @@ def __setitem__(self, key, value): if cond is None:

[GitHub] [spark] HyukjinKwon commented on pull request #32745: [SPARK-35523] Fix the default value in Data Source Options page

2021-06-02 Thread GitBox
HyukjinKwon commented on pull request #32745: URL: https://github.com/apache/spark/pull/32745#issuecomment-853577413 can we add default values at https://github.com/apache/spark/blob/5ff5770e5c4aeeec9c5f0ab173c49dfe003e5eba/docs/sql-data-sources-jdbc.md too? -- This is an automated mess

[GitHub] [spark] pingsutw commented on a change in pull request #32738: [SPARK-35474] Enable disallow_untyped_defs mypy check for pyspark.pandas.indexing.

2021-06-02 Thread GitBox
pingsutw commented on a change in pull request #32738: URL: https://github.com/apache/spark/pull/32738#discussion_r644495912 ## File path: python/pyspark/pandas/indexing.py ## @@ -1246,7 +1255,8 @@ def _select_cols_by_iterable( % (len(cast(Sized, cols_sel))

[GitHub] [spark] SparkQA commented on pull request #32751: [SPARK-35612][SQL] Support LZ4 compression in ORC data source

2021-06-02 Thread GitBox
SparkQA commented on pull request #32751: URL: https://github.com/apache/spark/pull/32751#issuecomment-853575054 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/43789/ -- This is an automated message from the A

[GitHub] [spark] pingsutw commented on a change in pull request #32738: [SPARK-35474] Enable disallow_untyped_defs mypy check for pyspark.pandas.indexing.

2021-06-02 Thread GitBox
pingsutw commented on a change in pull request #32738: URL: https://github.com/apache/spark/pull/32738#discussion_r644493906 ## File path: python/pyspark/pandas/indexing.py ## @@ -1138,7 +1146,8 @@ def _select_rows_else( ) def _get_from_multiindex_column( -

[GitHub] [spark] AmplabJenkins commented on pull request #32735: [SPARK-35580][SQL] Implement canonicalized method for HigherOrderFunction

2021-06-02 Thread GitBox
AmplabJenkins commented on pull request #32735: URL: https://github.com/apache/spark/pull/32735#issuecomment-853570290 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/43787/ -- T

[GitHub] [spark] SparkQA commented on pull request #32735: [SPARK-35580][SQL] Implement canonicalized method for HigherOrderFunction

2021-06-02 Thread GitBox
SparkQA commented on pull request #32735: URL: https://github.com/apache/spark/pull/32735#issuecomment-853570260 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/43787/ -- This is an automated message from the A

[GitHub] [spark] pingsutw commented on a change in pull request #32738: [SPARK-35474] Enable disallow_untyped_defs mypy check for pyspark.pandas.indexing.

2021-06-02 Thread GitBox
pingsutw commented on a change in pull request #32738: URL: https://github.com/apache/spark/pull/32738#discussion_r644490173 ## File path: python/pyspark/pandas/indexing.py ## @@ -608,7 +608,9 @@ def __setitem__(self, key, value): if cond is None:

[GitHub] [spark] pingsutw commented on a change in pull request #32738: [SPARK-35474] Enable disallow_untyped_defs mypy check for pyspark.pandas.indexing.

2021-06-02 Thread GitBox
pingsutw commented on a change in pull request #32738: URL: https://github.com/apache/spark/pull/32738#discussion_r644489741 ## File path: python/pyspark/pandas/indexing.py ## @@ -514,7 +514,7 @@ def __getitem__(self, key) -> Union["Series", "DataFrame"]: except Analys

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32750: [SPARK-33696][BUILD][SQL] Upgrade built-in Hive to 2.3.9

2021-06-02 Thread GitBox
AmplabJenkins removed a comment on pull request #32750: URL: https://github.com/apache/spark/pull/32750#issuecomment-853566086 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139258/ -

[GitHub] [spark] HyukjinKwon commented on a change in pull request #32737: [SPARK-35606][PYTHON][INFRA] List Python 3.9 installed libraries in build_and_test workflow

2021-06-02 Thread GitBox
HyukjinKwon commented on a change in pull request #32737: URL: https://github.com/apache/spark/pull/32737#discussion_r644487409 ## File path: .github/workflows/build_and_test.yml ## @@ -217,6 +217,11 @@ jobs: run: | python3.6 -m pip install numpy 'pyarrow<3.0.0'

[GitHub] [spark] LuciferYang commented on pull request #32710: [SPARK-35574][BUILD] Add a compile arg to turn compilation warnings related to `procedure syntax` to compilation errors in Scala 2.13

2021-06-02 Thread GitBox
LuciferYang commented on pull request #32710: URL: https://github.com/apache/spark/pull/32710#issuecomment-853566158 thx @srowen @HyukjinKwon @gengliangwang -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL abo

[GitHub] [spark] AmplabJenkins commented on pull request #32750: [SPARK-33696][BUILD][SQL] Upgrade built-in Hive to 2.3.9

2021-06-02 Thread GitBox
AmplabJenkins commented on pull request #32750: URL: https://github.com/apache/spark/pull/32750#issuecomment-853566086 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139258/ -- This

[GitHub] [spark] SparkQA removed a comment on pull request #32750: [SPARK-33696][BUILD][SQL] Upgrade built-in Hive to 2.3.9

2021-06-02 Thread GitBox
SparkQA removed a comment on pull request #32750: URL: https://github.com/apache/spark/pull/32750#issuecomment-853507209 **[Test build #139258 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139258/testReport)** for PR 32750 at commit [`c7194cf`](https://gi

[GitHub] [spark] SparkQA commented on pull request #32750: [SPARK-33696][BUILD][SQL] Upgrade built-in Hive to 2.3.9

2021-06-02 Thread GitBox
SparkQA commented on pull request #32750: URL: https://github.com/apache/spark/pull/32750#issuecomment-853565054 **[Test build #139258 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139258/testReport)** for PR 32750 at commit [`c7194cf`](https://github.co

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32754: [SPARK-35613][CORE][SQL] Cache commonly occurring strings in SQLMetrics, JSONProtocol and AccumulatorV2 classes

2021-06-02 Thread GitBox
AmplabJenkins removed a comment on pull request #32754: URL: https://github.com/apache/spark/pull/32754#issuecomment-853564361 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/43786/

[GitHub] [spark] AmplabJenkins commented on pull request #32754: [SPARK-35613][CORE][SQL] Cache commonly occurring strings in SQLMetrics, JSONProtocol and AccumulatorV2 classes

2021-06-02 Thread GitBox
AmplabJenkins commented on pull request #32754: URL: https://github.com/apache/spark/pull/32754#issuecomment-853564361 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/43786/ -- T

[GitHub] [spark] SparkQA commented on pull request #32737: [SPARK-35606][PYTHON][INFRA] List Python 3.9 installed libraries in build_and_test workflow

2021-06-02 Thread GitBox
SparkQA commented on pull request #32737: URL: https://github.com/apache/spark/pull/32737#issuecomment-853564322 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/43788/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #32754: [SPARK-35613][CORE][SQL] Cache commonly occurring strings in SQLMetrics, JSONProtocol and AccumulatorV2 classes

2021-06-02 Thread GitBox
SparkQA commented on pull request #32754: URL: https://github.com/apache/spark/pull/32754#issuecomment-853564346 Kubernetes integration test unable to build dist. exiting with code: 1 URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/43786/ -- This

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32726: [SPARK-35587][PYTHON][DOCS] Initial porting of Koalas documentation

2021-06-02 Thread GitBox
AmplabJenkins removed a comment on pull request #32726: URL: https://github.com/apache/spark/pull/32726#issuecomment-853563766 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139259/ -

[GitHub] [spark] AmplabJenkins commented on pull request #32726: [SPARK-35587][PYTHON][DOCS] Initial porting of Koalas documentation

2021-06-02 Thread GitBox
AmplabJenkins commented on pull request #32726: URL: https://github.com/apache/spark/pull/32726#issuecomment-853563766 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139259/ -- This

[GitHub] [spark] SparkQA removed a comment on pull request #32726: [SPARK-35587][PYTHON][DOCS] Initial porting of Koalas documentation

2021-06-02 Thread GitBox
SparkQA removed a comment on pull request #32726: URL: https://github.com/apache/spark/pull/32726#issuecomment-853508975 **[Test build #139259 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139259/testReport)** for PR 32726 at commit [`0a7b613`](https://gi

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32743: [SPARK-35396][CORE][FOLLOWUP] Free memory entry immediately

2021-06-02 Thread GitBox
AmplabJenkins removed a comment on pull request #32743: URL: https://github.com/apache/spark/pull/32743#issuecomment-853563089 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139255/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32762: [SPARK-35081][DOCS] Add Data Source Option links to missing documents

2021-06-02 Thread GitBox
AmplabJenkins removed a comment on pull request #32762: URL: https://github.com/apache/spark/pull/32762#issuecomment-853563087 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/43784/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32756: [SPARK-35589][CORE][3.1] BlockManagerMasterEndpoint should not ignore index-only shuffle file during updating

2021-06-02 Thread GitBox
AmplabJenkins removed a comment on pull request #32756: URL: https://github.com/apache/spark/pull/32756#issuecomment-853563093 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139254/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32760: [SPARK-35620][BUILD][PYTHON] Remove documentation build in Python linter

2021-06-02 Thread GitBox
AmplabJenkins removed a comment on pull request #32760: URL: https://github.com/apache/spark/pull/32760#issuecomment-853563086 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139257/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32726: [SPARK-35587][PYTHON][DOCS] Initial porting of Koalas documentation

2021-06-02 Thread GitBox
AmplabJenkins removed a comment on pull request #32726: URL: https://github.com/apache/spark/pull/32726#issuecomment-853563090 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/43783/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32750: [SPARK-33696][BUILD][SQL] Upgrade built-in Hive to 2.3.9

2021-06-02 Thread GitBox
AmplabJenkins removed a comment on pull request #32750: URL: https://github.com/apache/spark/pull/32750#issuecomment-853563085 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/43782/

[GitHub] [spark] AmplabJenkins commented on pull request #32726: [SPARK-35587][PYTHON][DOCS] Initial porting of Koalas documentation

2021-06-02 Thread GitBox
AmplabJenkins commented on pull request #32726: URL: https://github.com/apache/spark/pull/32726#issuecomment-853563090 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/43783/ -- T

[GitHub] [spark] SparkQA commented on pull request #32726: [SPARK-35587][PYTHON][DOCS] Initial porting of Koalas documentation

2021-06-02 Thread GitBox
SparkQA commented on pull request #32726: URL: https://github.com/apache/spark/pull/32726#issuecomment-853563062 **[Test build #139259 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139259/testReport)** for PR 32726 at commit [`0a7b613`](https://github.co

[GitHub] [spark] AmplabJenkins commented on pull request #32743: [SPARK-35396][CORE][FOLLOWUP] Free memory entry immediately

2021-06-02 Thread GitBox
AmplabJenkins commented on pull request #32743: URL: https://github.com/apache/spark/pull/32743#issuecomment-853563089 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139255/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #32762: [SPARK-35081][DOCS] Add Data Source Option links to missing documents

2021-06-02 Thread GitBox
AmplabJenkins commented on pull request #32762: URL: https://github.com/apache/spark/pull/32762#issuecomment-853563087 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/43784/ -- T

  1   2   3   4   5   6   7   8   9   >