[GitHub] [spark] SparkQA removed a comment on pull request #31348: [SPARK-34245][CORE] Ensure Master removes executors that failed to send finished state

2021-02-18 Thread GitBox
SparkQA removed a comment on pull request #31348: URL: https://github.com/apache/spark/pull/31348#issuecomment-781823524 **[Test build #135258 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135258/testReport)** for PR 31348 at commit

[GitHub] [spark] SparkQA commented on pull request #31348: [SPARK-34245][CORE] Ensure Master removes executors that failed to send finished state

2021-02-18 Thread GitBox
SparkQA commented on pull request #31348: URL: https://github.com/apache/spark/pull/31348#issuecomment-781894513 **[Test build #135258 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135258/testReport)** for PR 31348 at commit

[GitHub] [spark] SparkQA commented on pull request #31545: [SPARK-34417] [SQL] org.apache.spark.sql.DataFrameNaFunctions.fillMap(values: Seq[(String, Any)]) fails for column name having a dot

2021-02-18 Thread GitBox
SparkQA commented on pull request #31545: URL: https://github.com/apache/spark/pull/31545#issuecomment-781893314 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39840/

[GitHub] [spark] Ngone51 commented on pull request #31480: [SPARK-32384][CORE] repartitionAndSortWithinPartitions avoid shuffle with same partitioner

2021-02-18 Thread GitBox
Ngone51 commented on pull request #31480: URL: https://github.com/apache/spark/pull/31480#issuecomment-781891397 LGTM This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] Ngone51 commented on pull request #31480: [SPARK-32384][CORE] repartitionAndSortWithinPartitions avoid shuffle with same partitioner

2021-02-18 Thread GitBox
Ngone51 commented on pull request #31480: URL: https://github.com/apache/spark/pull/31480#issuecomment-781891645 cc @tgravescs @mridulm This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] Ngone51 commented on a change in pull request #31480: [SPARK-32384][CORE] repartitionAndSortWithinPartitions avoid shuffle with same partitioner

2021-02-18 Thread GitBox
Ngone51 commented on a change in pull request #31480: URL: https://github.com/apache/spark/pull/31480#discussion_r578979477 ## File path: core/src/main/scala/org/apache/spark/rdd/OrderedRDDFunctions.scala ## @@ -73,7 +75,23 @@ class OrderedRDDFunctions[K : Ordering : ClassTag,

[GitHub] [spark] SparkQA commented on pull request #31548: [SPARK-34127][SQL] Support table valued command

2021-02-18 Thread GitBox
SparkQA commented on pull request #31548: URL: https://github.com/apache/spark/pull/31548#issuecomment-781887505 **[Test build #135262 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135262/testReport)** for PR 31548 at commit

[GitHub] [spark] SparkQA commented on pull request #31588: [SPARK-34470][ML] VectorSlicer use ordering if possible

2021-02-18 Thread GitBox
SparkQA commented on pull request #31588: URL: https://github.com/apache/spark/pull/31588#issuecomment-781887521 **[Test build #135261 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135261/testReport)** for PR 31588 at commit

[GitHub] [spark] SparkQA commented on pull request #31549: [SPARK-34314][SQL] Fix partitions schema inference

2021-02-18 Thread GitBox
SparkQA commented on pull request #31549: URL: https://github.com/apache/spark/pull/31549#issuecomment-781886866 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39839/

[GitHub] [spark] SparkQA commented on pull request #31495: [SPARK-34383][SS] Optimize WAL commit phase via reducing cost of filesystem operations

2021-02-18 Thread GitBox
SparkQA commented on pull request #31495: URL: https://github.com/apache/spark/pull/31495#issuecomment-781880633 **[Test build #135263 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135263/testReport)** for PR 31495 at commit

[GitHub] [spark] HeartSaVioR commented on pull request #31495: [SPARK-34383][SS] Optimize WAL commit phase via reducing cost of filesystem operations

2021-02-18 Thread GitBox
HeartSaVioR commented on pull request #31495: URL: https://github.com/apache/spark/pull/31495#issuecomment-781878903 retest this, please This is an automated message from the Apache Git Service. To respond to the

[GitHub] [spark] zhengruifeng commented on pull request #31588: [SPARK-34470][ML] VectorSlicer use ordering if possible

2021-02-18 Thread GitBox
zhengruifeng commented on pull request #31588: URL: https://github.com/apache/spark/pull/31588#issuecomment-781877440 test: ``` test("performance") { val rng = new Random(123) val n = 10 val dim = 1 val nnz = 100 val vectors =

[GitHub] [spark] SparkQA commented on pull request #31545: [SPARK-34417] [SQL] org.apache.spark.sql.DataFrameNaFunctions.fillMap(values: Seq[(String, Any)]) fails for column name having a dot

2021-02-18 Thread GitBox
SparkQA commented on pull request #31545: URL: https://github.com/apache/spark/pull/31545#issuecomment-781877375 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39840/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31545: [SPARK-34417] [SQL] org.apache.spark.sql.DataFrameNaFunctions.fillMap(values: Seq[(String, Any)]) fails for column name having

2021-02-18 Thread GitBox
AmplabJenkins removed a comment on pull request #31545: URL: https://github.com/apache/spark/pull/31545#issuecomment-781876771 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/135260/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31570: [WIP][SPARK-10816][SS] SessionWindow support for Structure Streaming

2021-02-18 Thread GitBox
AmplabJenkins removed a comment on pull request #31570: URL: https://github.com/apache/spark/pull/31570#issuecomment-781876774 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39836/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31495: [SPARK-34383][SS] Optimize WAL commit phase via reducing cost of filesystem operations

2021-02-18 Thread GitBox
AmplabJenkins removed a comment on pull request #31495: URL: https://github.com/apache/spark/pull/31495#issuecomment-781876772 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/135250/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31348: [SPARK-34245][CORE] Ensure Master removes executors that failed to send finished state

2021-02-18 Thread GitBox
AmplabJenkins removed a comment on pull request #31348: URL: https://github.com/apache/spark/pull/31348#issuecomment-781876773 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39838/

[GitHub] [spark] zhengruifeng opened a new pull request #31588: [SPARK-34470][ML] VectorSlicer use ordering if possible

2021-02-18 Thread GitBox
zhengruifeng opened a new pull request #31588: URL: https://github.com/apache/spark/pull/31588 ### What changes were proposed in this pull request? 1, add a new method `sliceSorted` for `SparseVector`; 2, in `VectorSlicer`, switch to `sliceSorted` if input indices are ordered.

[GitHub] [spark] AmplabJenkins commented on pull request #31495: [SPARK-34383][SS] Optimize WAL commit phase via reducing cost of filesystem operations

2021-02-18 Thread GitBox
AmplabJenkins commented on pull request #31495: URL: https://github.com/apache/spark/pull/31495#issuecomment-781876772 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/135250/

[GitHub] [spark] AmplabJenkins commented on pull request #31348: [SPARK-34245][CORE] Ensure Master removes executors that failed to send finished state

2021-02-18 Thread GitBox
AmplabJenkins commented on pull request #31348: URL: https://github.com/apache/spark/pull/31348#issuecomment-781876773 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39838/

[GitHub] [spark] AmplabJenkins commented on pull request #31545: [SPARK-34417] [SQL] org.apache.spark.sql.DataFrameNaFunctions.fillMap(values: Seq[(String, Any)]) fails for column name having a dot

2021-02-18 Thread GitBox
AmplabJenkins commented on pull request #31545: URL: https://github.com/apache/spark/pull/31545#issuecomment-781876771 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/135260/

[GitHub] [spark] AmplabJenkins commented on pull request #31570: [WIP][SPARK-10816][SS] SessionWindow support for Structure Streaming

2021-02-18 Thread GitBox
AmplabJenkins commented on pull request #31570: URL: https://github.com/apache/spark/pull/31570#issuecomment-781876774 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39836/

[GitHub] [spark] HyukjinKwon removed a comment on pull request #31496: [SPARK-34384][CORE] Add missing docs for ResourceProfile APIs

2021-02-18 Thread GitBox
HyukjinKwon removed a comment on pull request #31496: URL: https://github.com/apache/spark/pull/31496#issuecomment-781871010 @tgravescs and @holdenk, I plan to cut the RC as soon as possible but it seems like this PR includes two small changes that might matter in compatibility: -

[GitHub] [spark] HyukjinKwon commented on pull request #31496: [SPARK-34384][CORE] Add missing docs for ResourceProfile APIs

2021-02-18 Thread GitBox
HyukjinKwon commented on pull request #31496: URL: https://github.com/apache/spark/pull/31496#issuecomment-781874079 @tgravescs and @holdenk, I plan to cut the RC as soon as possible (after the blocker #31550 is merged), but it seems like this PR includes two small changes that might

[GitHub] [spark] SparkQA commented on pull request #31549: [SPARK-34314][SQL] Fix partitions schema inference

2021-02-18 Thread GitBox
SparkQA commented on pull request #31549: URL: https://github.com/apache/spark/pull/31549#issuecomment-781871787 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39839/

[GitHub] [spark] HyukjinKwon commented on pull request #31496: [SPARK-34384][CORE] Add missing docs for ResourceProfile APIs

2021-02-18 Thread GitBox
HyukjinKwon commented on pull request #31496: URL: https://github.com/apache/spark/pull/31496#issuecomment-781871010 @tgravescs and @holdenk, I plan to cut the RC as soon as possible but it seems like this PR includes two small changes that might matter in compatibility: -

[GitHub] [spark] SparkQA commented on pull request #31570: [WIP][SPARK-10816][SS] SessionWindow support for Structure Streaming

2021-02-18 Thread GitBox
SparkQA commented on pull request #31570: URL: https://github.com/apache/spark/pull/31570#issuecomment-781870068 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39836/

[GitHub] [spark] beliefer commented on pull request #31316: [SPARK-33599][SQL][FOLLOWUP] Group exception messages in catalyst/analysis

2021-02-18 Thread GitBox
beliefer commented on pull request #31316: URL: https://github.com/apache/spark/pull/31316#issuecomment-781866313 ping @cloud-fan This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [spark] zhengruifeng commented on a change in pull request #31480: [SPARK-32384][CORE] repartitionAndSortWithinPartitions avoid shuffle with same partitioner

2021-02-18 Thread GitBox
zhengruifeng commented on a change in pull request #31480: URL: https://github.com/apache/spark/pull/31480#discussion_r578958152 ## File path: core/src/main/scala/org/apache/spark/rdd/OrderedRDDFunctions.scala ## @@ -73,7 +75,23 @@ class OrderedRDDFunctions[K : Ordering :

[GitHub] [spark] SparkQA commented on pull request #31348: [SPARK-34245][CORE] Ensure Master removes executors that failed to send finished state

2021-02-18 Thread GitBox
SparkQA commented on pull request #31348: URL: https://github.com/apache/spark/pull/31348#issuecomment-781860361 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39838/

[GitHub] [spark] SparkQA removed a comment on pull request #31495: [SPARK-34383][SS] Optimize WAL commit phase via reducing cost of filesystem operations

2021-02-18 Thread GitBox
SparkQA removed a comment on pull request #31495: URL: https://github.com/apache/spark/pull/31495#issuecomment-781807866 **[Test build #135250 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135250/testReport)** for PR 31495 at commit

[GitHub] [spark] SparkQA commented on pull request #31495: [SPARK-34383][SS] Optimize WAL commit phase via reducing cost of filesystem operations

2021-02-18 Thread GitBox
SparkQA commented on pull request #31495: URL: https://github.com/apache/spark/pull/31495#issuecomment-781859857 **[Test build #135250 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135250/testReport)** for PR 31495 at commit

[GitHub] [spark] SparkQA commented on pull request #31549: [SPARK-34314][SQL] Fix partitions schema inference

2021-02-18 Thread GitBox
SparkQA commented on pull request #31549: URL: https://github.com/apache/spark/pull/31549#issuecomment-781859366 **[Test build #135259 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135259/testReport)** for PR 31549 at commit

[GitHub] [spark] SparkQA commented on pull request #31545: [SPARK-34417] [SQL] org.apache.spark.sql.DataFrameNaFunctions.fillMap(values: Seq[(String, Any)]) fails for column name having a dot

2021-02-18 Thread GitBox
SparkQA commented on pull request #31545: URL: https://github.com/apache/spark/pull/31545#issuecomment-781857670 **[Test build #135260 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135260/testReport)** for PR 31545 at commit

[GitHub] [spark] viirya commented on a change in pull request #31468: [SPARK-34353][SQL] CollectLimitExec avoid shuffle if input rdd has 0/1 partition

2021-02-18 Thread GitBox
viirya commented on a change in pull request #31468: URL: https://github.com/apache/spark/pull/31468#discussion_r578951440 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala ## @@ -52,16 +53,25 @@ case class CollectLimitExec(limit: Int, child:

[GitHub] [spark] SparkQA commented on pull request #31570: [WIP][SPARK-10816][SS] SessionWindow support for Structure Streaming

2021-02-18 Thread GitBox
SparkQA commented on pull request #31570: URL: https://github.com/apache/spark/pull/31570#issuecomment-781851450 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39836/

[GitHub] [spark] viirya commented on a change in pull request #31468: [SPARK-34353][SQL] CollectLimitExec avoid shuffle if input rdd has 0/1 partition

2021-02-18 Thread GitBox
viirya commented on a change in pull request #31468: URL: https://github.com/apache/spark/pull/31468#discussion_r578951088 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala ## @@ -52,16 +53,25 @@ case class CollectLimitExec(limit: Int, child:

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31545: [SPARK-34417] [SQL] org.apache.spark.sql.DataFrameNaFunctions.fillMap(values: Seq[(String, Any)]) fails for column name having

2021-02-18 Thread GitBox
AmplabJenkins removed a comment on pull request #31545: URL: https://github.com/apache/spark/pull/31545#issuecomment-781848151 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31479: [SPARK-34373][SQL] HiveThriftServer2 startWithContext may hang with a race issue

2021-02-18 Thread GitBox
AmplabJenkins removed a comment on pull request #31479: URL: https://github.com/apache/spark/pull/31479#issuecomment-781848152 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39831/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31273: [SPARK-34152][SQL] Make CreateViewStatement.child to be LogicalPlan's children so that it's resolved in analyze phase

2021-02-18 Thread GitBox
AmplabJenkins removed a comment on pull request #31273: URL: https://github.com/apache/spark/pull/31273#issuecomment-781848158 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39834/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31495: [SPARK-34383][SS] Optimize WAL commit phase via reducing cost of filesystem operations

2021-02-18 Thread GitBox
AmplabJenkins removed a comment on pull request #31495: URL: https://github.com/apache/spark/pull/31495#issuecomment-781848148 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39830/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31587: [SPARK-34469][K8S] Ignore RegisterExecutor when SparkContext is stopped

2021-02-18 Thread GitBox
AmplabJenkins removed a comment on pull request #31587: URL: https://github.com/apache/spark/pull/31587#issuecomment-781848147 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39835/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31496: [SPARK-34384][CORE] Add missing docs for ResourceProfile APIs

2021-02-18 Thread GitBox
AmplabJenkins removed a comment on pull request #31496: URL: https://github.com/apache/spark/pull/31496#issuecomment-781848149 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31476: [SPARK-34366][SQL] Add interface for DS v2 metrics

2021-02-18 Thread GitBox
AmplabJenkins removed a comment on pull request #31476: URL: https://github.com/apache/spark/pull/31476#issuecomment-781848150 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39832/

[GitHub] [spark] AmplabJenkins commented on pull request #31476: [SPARK-34366][SQL] Add interface for DS v2 metrics

2021-02-18 Thread GitBox
AmplabJenkins commented on pull request #31476: URL: https://github.com/apache/spark/pull/31476#issuecomment-781848150 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39832/

[GitHub] [spark] AmplabJenkins commented on pull request #31495: [SPARK-34383][SS] Optimize WAL commit phase via reducing cost of filesystem operations

2021-02-18 Thread GitBox
AmplabJenkins commented on pull request #31495: URL: https://github.com/apache/spark/pull/31495#issuecomment-781848148 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39830/

[GitHub] [spark] AmplabJenkins commented on pull request #31587: [SPARK-34469][K8S] Ignore RegisterExecutor when SparkContext is stopped

2021-02-18 Thread GitBox
AmplabJenkins commented on pull request #31587: URL: https://github.com/apache/spark/pull/31587#issuecomment-781848147 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39835/

[GitHub] [spark] AmplabJenkins commented on pull request #31273: [SPARK-34152][SQL] Make CreateViewStatement.child to be LogicalPlan's children so that it's resolved in analyze phase

2021-02-18 Thread GitBox
AmplabJenkins commented on pull request #31273: URL: https://github.com/apache/spark/pull/31273#issuecomment-781848158 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39834/

[GitHub] [spark] AmplabJenkins commented on pull request #31496: [SPARK-34384][CORE] Add missing docs for ResourceProfile APIs

2021-02-18 Thread GitBox
AmplabJenkins commented on pull request #31496: URL: https://github.com/apache/spark/pull/31496#issuecomment-781848149 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #31545: [SPARK-34417] [SQL] org.apache.spark.sql.DataFrameNaFunctions.fillMap(values: Seq[(String, Any)]) fails for column name having a dot

2021-02-18 Thread GitBox
AmplabJenkins commented on pull request #31545: URL: https://github.com/apache/spark/pull/31545#issuecomment-781848151 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #31479: [SPARK-34373][SQL] HiveThriftServer2 startWithContext may hang with a race issue

2021-02-18 Thread GitBox
AmplabJenkins commented on pull request #31479: URL: https://github.com/apache/spark/pull/31479#issuecomment-781848152 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39831/

[GitHub] [spark] SparkQA commented on pull request #31348: [SPARK-34245][CORE] Ensure Master removes executors that failed to send finished state

2021-02-18 Thread GitBox
SparkQA commented on pull request #31348: URL: https://github.com/apache/spark/pull/31348#issuecomment-781845567 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39838/

[GitHub] [spark] SparkQA removed a comment on pull request #31496: [SPARK-34384][CORE] Add missing docs for ResourceProfile APIs

2021-02-18 Thread GitBox
SparkQA removed a comment on pull request #31496: URL: https://github.com/apache/spark/pull/31496#issuecomment-781788234 **[Test build #135246 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135246/testReport)** for PR 31496 at commit

[GitHub] [spark] SparkQA commented on pull request #31496: [SPARK-34384][CORE] Add missing docs for ResourceProfile APIs

2021-02-18 Thread GitBox
SparkQA commented on pull request #31496: URL: https://github.com/apache/spark/pull/31496#issuecomment-781841961 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39826/

[GitHub] [spark] zhengruifeng commented on a change in pull request #31468: [SPARK-34353][SQL] CollectLimitExec avoid shuffle if input rdd has 0/1 partition

2021-02-18 Thread GitBox
zhengruifeng commented on a change in pull request #31468: URL: https://github.com/apache/spark/pull/31468#discussion_r578943394 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala ## @@ -52,16 +53,25 @@ case class CollectLimitExec(limit: Int,

[GitHub] [spark] SparkQA commented on pull request #31496: [SPARK-34384][CORE] Add missing docs for ResourceProfile APIs

2021-02-18 Thread GitBox
SparkQA commented on pull request #31496: URL: https://github.com/apache/spark/pull/31496#issuecomment-781839349 **[Test build #135246 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135246/testReport)** for PR 31496 at commit

[GitHub] [spark] SparkQA commented on pull request #31587: [SPARK-34469][K8S] Ignore RegisterExecutor when SparkContext is stopped

2021-02-18 Thread GitBox
SparkQA commented on pull request #31587: URL: https://github.com/apache/spark/pull/31587#issuecomment-781836772 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39835/

[GitHub] [spark] zhengruifeng commented on a change in pull request #31468: [SPARK-34353][SQL] CollectLimitExec avoid shuffle if input rdd has 0/1 partition

2021-02-18 Thread GitBox
zhengruifeng commented on a change in pull request #31468: URL: https://github.com/apache/spark/pull/31468#discussion_r578940920 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala ## @@ -52,16 +53,25 @@ case class CollectLimitExec(limit: Int,

[GitHub] [spark] SparkQA commented on pull request #31273: [SPARK-34152][SQL] Make CreateViewStatement.child to be LogicalPlan's children so that it's resolved in analyze phase

2021-02-18 Thread GitBox
SparkQA commented on pull request #31273: URL: https://github.com/apache/spark/pull/31273#issuecomment-781835341 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39834/

[GitHub] [spark] SparkQA commented on pull request #31495: [SPARK-34383][SS] Optimize WAL commit phase via reducing cost of filesystem operations

2021-02-18 Thread GitBox
SparkQA commented on pull request #31495: URL: https://github.com/apache/spark/pull/31495#issuecomment-781833776 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39830/

[GitHub] [spark] SparkQA removed a comment on pull request #31545: [SPARK-34417] [SQL] org.apache.spark.sql.DataFrameNaFunctions.fillMap(values: Seq[(String, Any)]) fails for column name having a dot

2021-02-18 Thread GitBox
SparkQA removed a comment on pull request #31545: URL: https://github.com/apache/spark/pull/31545#issuecomment-781829338 **[Test build #135257 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135257/testReport)** for PR 31545 at commit

[GitHub] [spark] SparkQA commented on pull request #31545: [SPARK-34417] [SQL] org.apache.spark.sql.DataFrameNaFunctions.fillMap(values: Seq[(String, Any)]) fails for column name having a dot

2021-02-18 Thread GitBox
SparkQA commented on pull request #31545: URL: https://github.com/apache/spark/pull/31545#issuecomment-781832030 **[Test build #135257 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135257/testReport)** for PR 31545 at commit

[GitHub] [spark] SparkQA commented on pull request #31479: [SPARK-34373][SQL] HiveThriftServer2 startWithContext may hang with a race issue

2021-02-18 Thread GitBox
SparkQA commented on pull request #31479: URL: https://github.com/apache/spark/pull/31479#issuecomment-781832039 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39831/

[GitHub] [spark] SparkQA commented on pull request #31570: [WIP][SPARK-10816][SS] SessionWindow support for Structure Streaming

2021-02-18 Thread GitBox
SparkQA commented on pull request #31570: URL: https://github.com/apache/spark/pull/31570#issuecomment-781829637 **[Test build #135256 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135256/testReport)** for PR 31570 at commit

[GitHub] [spark] SparkQA commented on pull request #31545: [SPARK-34417] [SQL] org.apache.spark.sql.DataFrameNaFunctions.fillMap(values: Seq[(String, Any)]) fails for column name having a dot

2021-02-18 Thread GitBox
SparkQA commented on pull request #31545: URL: https://github.com/apache/spark/pull/31545#issuecomment-781829338 **[Test build #135257 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135257/testReport)** for PR 31545 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31390: [SPARK-28123][SQL] String Functions: support btrim

2021-02-18 Thread GitBox
AmplabJenkins removed a comment on pull request #31390: URL: https://github.com/apache/spark/pull/31390#issuecomment-781826004 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39833/

[GitHub] [spark] AmplabJenkins commented on pull request #31390: [SPARK-28123][SQL] String Functions: support btrim

2021-02-18 Thread GitBox
AmplabJenkins commented on pull request #31390: URL: https://github.com/apache/spark/pull/31390#issuecomment-781826004 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39833/

[GitHub] [spark] SparkQA commented on pull request #31390: [SPARK-28123][SQL] String Functions: support btrim

2021-02-18 Thread GitBox
SparkQA commented on pull request #31390: URL: https://github.com/apache/spark/pull/31390#issuecomment-781825994 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39833/

[GitHub] [spark] SparkQA commented on pull request #31587: [SPARK-34469][K8S] Ignore RegisterExecutor when SparkContext is stopped

2021-02-18 Thread GitBox
SparkQA commented on pull request #31587: URL: https://github.com/apache/spark/pull/31587#issuecomment-781824605 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39835/

[GitHub] [spark] SparkQA commented on pull request #31390: [SPARK-28123][SQL] String Functions: support btrim

2021-02-18 Thread GitBox
SparkQA commented on pull request #31390: URL: https://github.com/apache/spark/pull/31390#issuecomment-781824395 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39833/

[GitHub] [spark] SparkQA commented on pull request #31348: [SPARK-34245][CORE] Ensure Master removes executors that failed to send finished state

2021-02-18 Thread GitBox
SparkQA commented on pull request #31348: URL: https://github.com/apache/spark/pull/31348#issuecomment-781823524 **[Test build #135258 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135258/testReport)** for PR 31348 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31587: [SPARK-34469][K8S] Ignore RegisterExecutor when SparkContext is stopped

2021-02-18 Thread GitBox
AmplabJenkins removed a comment on pull request #31587: URL: https://github.com/apache/spark/pull/31587#issuecomment-781822666 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/135255/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31404: [SPARK-34283][SQL] Combines all adjacent 'Union' operators into a single 'Union' when using 'Dataset.union.distinct.union.disti

2021-02-18 Thread GitBox
AmplabJenkins removed a comment on pull request #31404: URL: https://github.com/apache/spark/pull/31404#issuecomment-781822664 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39828/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31479: [SPARK-34373][SQL] HiveThriftServer2 startWithContext may hang with a race issue

2021-02-18 Thread GitBox
AmplabJenkins removed a comment on pull request #31479: URL: https://github.com/apache/spark/pull/31479#issuecomment-781822661 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30841: [SPARK-28191][SS] New data source - state - reader part

2021-02-18 Thread GitBox
AmplabJenkins removed a comment on pull request #30841: URL: https://github.com/apache/spark/pull/30841#issuecomment-781822668 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39829/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31348: [SPARK-34245][CORE] Ensure Master removes executors that failed to send finished state

2021-02-18 Thread GitBox
AmplabJenkins removed a comment on pull request #31348: URL: https://github.com/apache/spark/pull/31348#issuecomment-781822662 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/135245/

[GitHub] [spark] AmplabJenkins commented on pull request #31479: [SPARK-34373][SQL] HiveThriftServer2 startWithContext may hang with a race issue

2021-02-18 Thread GitBox
AmplabJenkins commented on pull request #31479: URL: https://github.com/apache/spark/pull/31479#issuecomment-781822665 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #31273: [SPARK-34152][SQL] Make CreateViewStatement.child to be LogicalPlan's children so that it's resolved in analyze phase

2021-02-18 Thread GitBox
SparkQA commented on pull request #31273: URL: https://github.com/apache/spark/pull/31273#issuecomment-781822794 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39834/

[GitHub] [spark] AmplabJenkins commented on pull request #31404: [SPARK-34283][SQL] Combines all adjacent 'Union' operators into a single 'Union' when using 'Dataset.union.distinct.union.distinct'

2021-02-18 Thread GitBox
AmplabJenkins commented on pull request #31404: URL: https://github.com/apache/spark/pull/31404#issuecomment-781822664 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39828/

[GitHub] [spark] AmplabJenkins commented on pull request #31587: [SPARK-34469][K8S] Ignore RegisterExecutor when SparkContext is stopped

2021-02-18 Thread GitBox
AmplabJenkins commented on pull request #31587: URL: https://github.com/apache/spark/pull/31587#issuecomment-781822666 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/135255/

[GitHub] [spark] AmplabJenkins commented on pull request #31348: [SPARK-34245][CORE] Ensure Master removes executors that failed to send finished state

2021-02-18 Thread GitBox
AmplabJenkins commented on pull request #31348: URL: https://github.com/apache/spark/pull/31348#issuecomment-781822662 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/135245/

[GitHub] [spark] AmplabJenkins commented on pull request #30841: [SPARK-28191][SS] New data source - state - reader part

2021-02-18 Thread GitBox
AmplabJenkins commented on pull request #30841: URL: https://github.com/apache/spark/pull/30841#issuecomment-781822668 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39829/

[GitHub] [spark] baibaichen edited a comment on pull request #30483: [SPARK-33449][SQL] Add File Metadata cache support for Parquet and Orc

2021-02-18 Thread GitBox
baibaichen edited a comment on pull request #30483: URL: https://github.com/apache/spark/pull/30483#issuecomment-781817208 @LuciferYang, where footer is cached, driver or executor? As I understand, the footer will be used at executor side, are you caching the footer at executor side?

[GitHub] [spark] SparkQA commented on pull request #31495: [SPARK-34383][SS] Optimize WAL commit phase via reducing cost of filesystem operations

2021-02-18 Thread GitBox
SparkQA commented on pull request #31495: URL: https://github.com/apache/spark/pull/31495#issuecomment-781819057 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39830/

[GitHub] [spark] SparkQA commented on pull request #31479: [SPARK-34373][SQL] HiveThriftServer2 startWithContext may hang with a race issue

2021-02-18 Thread GitBox
SparkQA commented on pull request #31479: URL: https://github.com/apache/spark/pull/31479#issuecomment-781818065 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39831/

[GitHub] [spark] amandeep-sharma commented on a change in pull request #31545: [SPARK-34417] [SQL] org.apache.spark.sql.DataFrameNaFunctions.fillMap(values: Seq[(String, Any)]) fails for column name h

2021-02-18 Thread GitBox
amandeep-sharma commented on a change in pull request #31545: URL: https://github.com/apache/spark/pull/31545#discussion_r578925797 ## File path: sql/core/src/main/scala/org/apache/spark/sql/DataFrameNaFunctions.scala ## @@ -394,10 +395,11 @@ final class DataFrameNaFunctions

[GitHub] [spark] baibaichen commented on pull request #30483: [SPARK-33449][SQL] Add File Metadata cache support for Parquet and Orc

2021-02-18 Thread GitBox
baibaichen commented on pull request #30483: URL: https://github.com/apache/spark/pull/30483#issuecomment-781817208 @LuciferYang, where footer is cached, driver or executor? As I understand, the footer will be used at executor side, are you caching the footer at executor side?

[GitHub] [spark] SparkQA removed a comment on pull request #31479: [SPARK-34373][SQL] HiveThriftServer2 startWithContext may hang with a race issue

2021-02-18 Thread GitBox
SparkQA removed a comment on pull request #31479: URL: https://github.com/apache/spark/pull/31479#issuecomment-781806582 **[Test build #135251 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135251/testReport)** for PR 31479 at commit

[GitHub] [spark] SparkQA commented on pull request #31479: [SPARK-34373][SQL] HiveThriftServer2 startWithContext may hang with a race issue

2021-02-18 Thread GitBox
SparkQA commented on pull request #31479: URL: https://github.com/apache/spark/pull/31479#issuecomment-781816957 **[Test build #135251 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135251/testReport)** for PR 31479 at commit

[GitHub] [spark] cloud-fan closed pull request #31586: [SPARK-34466][SQL][DOCS] Improve docs for `ALTER TABLE .. RENAME TO`

2021-02-18 Thread GitBox
cloud-fan closed pull request #31586: URL: https://github.com/apache/spark/pull/31586 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [spark] cloud-fan commented on pull request #31586: [SPARK-34466][SQL][DOCS] Improve docs for `ALTER TABLE .. RENAME TO`

2021-02-18 Thread GitBox
cloud-fan commented on pull request #31586: URL: https://github.com/apache/spark/pull/31586#issuecomment-781815508 thanks, merging to master! This is an automated message from the Apache Git Service. To respond to the

[GitHub] [spark] SparkQA commented on pull request #31479: [SPARK-34373][SQL] HiveThriftServer2 startWithContext may hang with a race issue

2021-02-18 Thread GitBox
SparkQA commented on pull request #31479: URL: https://github.com/apache/spark/pull/31479#issuecomment-781814471 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39827/

[GitHub] [spark] MaxGekk commented on pull request #31586: [SPARK-34466][SQL][DOCS] Improve docs for `ALTER TABLE .. RENAME TO`

2021-02-18 Thread GitBox
MaxGekk commented on pull request #31586: URL: https://github.com/apache/spark/pull/31586#issuecomment-781814151 @cloud-fan Do the changes make sense to you? This is an automated message from the Apache Git Service. To

[GitHub] [spark] SparkQA removed a comment on pull request #31587: [SPARK-34469][K8S] Ignore RegisterExecutor when SparkContext is stopped

2021-02-18 Thread GitBox
SparkQA removed a comment on pull request #31587: URL: https://github.com/apache/spark/pull/31587#issuecomment-781809577 **[Test build #135255 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135255/testReport)** for PR 31587 at commit

[GitHub] [spark] SparkQA commented on pull request #31587: [SPARK-34469][K8S] Ignore RegisterExecutor when SparkContext is stopped

2021-02-18 Thread GitBox
SparkQA commented on pull request #31587: URL: https://github.com/apache/spark/pull/31587#issuecomment-781813769 **[Test build #135255 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135255/testReport)** for PR 31587 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #31348: [SPARK-34245][CORE] Ensure Master removes executors that failed to send finished state

2021-02-18 Thread GitBox
SparkQA removed a comment on pull request #31348: URL: https://github.com/apache/spark/pull/31348#issuecomment-781762077 **[Test build #135245 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135245/testReport)** for PR 31348 at commit

[GitHub] [spark] SparkQA commented on pull request #31348: [SPARK-34245][CORE] Ensure Master removes executors that failed to send finished state

2021-02-18 Thread GitBox
SparkQA commented on pull request #31348: URL: https://github.com/apache/spark/pull/31348#issuecomment-781812504 **[Test build #135245 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135245/testReport)** for PR 31348 at commit

[GitHub] [spark] SparkQA commented on pull request #30841: [SPARK-28191][SS] New data source - state - reader part

2021-02-18 Thread GitBox
SparkQA commented on pull request #30841: URL: https://github.com/apache/spark/pull/30841#issuecomment-781811974 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39829/

[GitHub] [spark] SparkQA commented on pull request #31587: [SPARK-34469][K8S] Ignore RegisterExecutor when SparkContext is stopped

2021-02-18 Thread GitBox
SparkQA commented on pull request #31587: URL: https://github.com/apache/spark/pull/31587#issuecomment-781809577 **[Test build #135255 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135255/testReport)** for PR 31587 at commit

[GitHub] [spark] SparkQA commented on pull request #31404: [SPARK-34283][SQL] Combines all adjacent 'Union' operators into a single 'Union' when using 'Dataset.union.distinct.union.distinct'

2021-02-18 Thread GitBox
SparkQA commented on pull request #31404: URL: https://github.com/apache/spark/pull/31404#issuecomment-781808687 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39828/

  1   2   3   4   5   6   >