[GitHub] [spark] SparkQA removed a comment on pull request #34734: [SPARK-37480][K8S][DOC] Sync Kubernetes configuration to latest in running-on-k8s.md

2021-11-28 Thread GitBox
SparkQA removed a comment on pull request #34734: URL: https://github.com/apache/spark/pull/34734#issuecomment-981357525 **[Test build #145700 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145700/testReport)** for PR 34734 at commit [`1f24055`](https://gi

[GitHub] [spark] yaooqinn opened a new pull request #34735: [SPARK-37481][Core] Fix disappearance of skipped stages after they retry

2021-11-28 Thread GitBox
yaooqinn opened a new pull request #34735: URL: https://github.com/apache/spark/pull/34735 ### What changes were proposed in this pull request? When skipped stages retry, their skipped info will be lost on the UI, and then we may see a stage with 200 tasks indeed,

[GitHub] [spark] Yikun commented on pull request #34646: [SPARK-37372][K8S] Removing redundant label addition and refactoring related test case

2021-11-28 Thread GitBox
Yikun commented on pull request #34646: URL: https://github.com/apache/spark/pull/34646#issuecomment-981371597 @dongjoon-hyun Would you mind taking a look again? Or I misundertanded your suggestion, it's not enough to update the PR message, I should split this PR to 2 PRs: 1. Remove the

[GitHub] [spark] SparkQA commented on pull request #34715: [SPARK-37445][BUILD] Rename the maven hadoop profile to hadoop-3 and hadoop-2

2021-11-28 Thread GitBox
SparkQA commented on pull request #34715: URL: https://github.com/apache/spark/pull/34715#issuecomment-981369425 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50165/ -- This is an automated message from the A

[GitHub] [spark] SparkQA commented on pull request #34734: [SPARK-37480][K8S][DOC] Sync Kubernetes configuration to latest in running-on-k8s.md

2021-11-28 Thread GitBox
SparkQA commented on pull request #34734: URL: https://github.com/apache/spark/pull/34734#issuecomment-981367713 **[Test build #145700 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145700/testReport)** for PR 34734 at commit [`1f24055`](https://github.co

[GitHub] [spark] gengliangwang commented on a change in pull request #34712: [SPARK-37463][SQL] Read/Write Timestamp ntz or ltz to Orc uses UTC timestamp

2021-11-28 Thread GitBox
gengliangwang commented on a change in pull request #34712: URL: https://github.com/apache/spark/pull/34712#discussion_r758099882 ## File path: sql/core/src/main/java/org/apache/spark/sql/execution/datasources/orc/OrcColumnarBatchReader.java ## @@ -48,6 +48,9 @@ // The capa

[GitHub] [spark] SparkQA commented on pull request #34734: [SPARK-37480][K8S][DOC] Sync Kubernetes configurations to latest in doc

2021-11-28 Thread GitBox
SparkQA commented on pull request #34734: URL: https://github.com/apache/spark/pull/34734#issuecomment-981357525 **[Test build #145700 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145700/testReport)** for PR 34734 at commit [`1f24055`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
SparkQA commented on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981357549 **[Test build #145701 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145701/testReport)** for PR 34732 at commit [`4598e8b`](https://github.com

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34367: [SPARK-37099][SQL] Impl a rank-based filter to optimize top-k computation

2021-11-28 Thread GitBox
AmplabJenkins removed a comment on pull request #34367: URL: https://github.com/apache/spark/pull/34367#issuecomment-981357027 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145692/ -

[GitHub] [spark] AmplabJenkins commented on pull request #34367: [SPARK-37099][SQL] Impl a rank-based filter to optimize top-k computation

2021-11-28 Thread GitBox
AmplabJenkins commented on pull request #34367: URL: https://github.com/apache/spark/pull/34367#issuecomment-981357027 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145692/ -- This

[GitHub] [spark] Yikun opened a new pull request #34734: [SPARK-37480][K8S][DOC] Sync Kubernetes configurations to latest in doc

2021-11-28 Thread GitBox
Yikun opened a new pull request #34734: URL: https://github.com/apache/spark/pull/34734 ### What changes were proposed in this pull request? Sync Kubernetes configurations to latest in doc ### Why are the changes needed? Configurations in docs/running-on-kubernetes.md are not up

[GitHub] [spark] SparkQA commented on pull request #34367: [SPARK-37099][SQL] Impl a rank-based filter to optimize top-k computation

2021-11-28 Thread GitBox
SparkQA commented on pull request #34367: URL: https://github.com/apache/spark/pull/34367#issuecomment-981356532 **[Test build #145692 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145692/testReport)** for PR 34367 at commit [`354b445`](https://github.co

[GitHub] [spark] SparkQA removed a comment on pull request #34367: [SPARK-37099][SQL] Impl a rank-based filter to optimize top-k computation

2021-11-28 Thread GitBox
SparkQA removed a comment on pull request #34367: URL: https://github.com/apache/spark/pull/34367#issuecomment-981263291 **[Test build #145692 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145692/testReport)** for PR 34367 at commit [`354b445`](https://gi

[GitHub] [spark] SparkQA commented on pull request #34733: [SPARK-36346][SQL][FOLLOWUP] Rename `withAllOrcReaders` to `withAllNativeOrcReaders`

2021-11-28 Thread GitBox
SparkQA commented on pull request #34733: URL: https://github.com/apache/spark/pull/34733#issuecomment-981351668 **[Test build #145698 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145698/testReport)** for PR 34733 at commit [`fc448fc`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
SparkQA commented on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981351647 **[Test build #145699 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145699/testReport)** for PR 34732 at commit [`b33d254`](https://github.com

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34726: [SPARK-33875][SQL][FOLLOWUP] Handle the char/varchar column for `Describe column` command

2021-11-28 Thread GitBox
AmplabJenkins removed a comment on pull request #34726: URL: https://github.com/apache/spark/pull/34726#issuecomment-981350686 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145688/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
AmplabJenkins removed a comment on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981331159 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment

[GitHub] [spark] AmplabJenkins commented on pull request #34726: [SPARK-33875][SQL][FOLLOWUP] Handle the char/varchar column for `Describe column` command

2021-11-28 Thread GitBox
AmplabJenkins commented on pull request #34726: URL: https://github.com/apache/spark/pull/34726#issuecomment-981350686 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145688/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
AmplabJenkins commented on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981350687 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50167/ -- T

[GitHub] [spark] gengliangwang commented on pull request #33588: [SPARK-36346][SQL] Support TimestampNTZ type in Orc file source

2021-11-28 Thread GitBox
gengliangwang commented on pull request #33588: URL: https://github.com/apache/spark/pull/33588#issuecomment-981350638 @bersprockets good catch, thank you! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [spark] AngersZhuuuu commented on a change in pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
AngersZh commented on a change in pull request #34732: URL: https://github.com/apache/spark/pull/34732#discussion_r758087493 ## File path: python/pyspark/sql/session.py ## @@ -305,10 +305,7 @@ def __init__( ): jsparkSession = self._jvm.SparkSe

[GitHub] [spark] SparkQA commented on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
SparkQA commented on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981348630 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50164/ -- This is an automated message from the Apache

[GitHub] [spark] AngersZhuuuu commented on a change in pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
AngersZh commented on a change in pull request #34732: URL: https://github.com/apache/spark/pull/34732#discussion_r758065251 ## File path: python/pyspark/sql/session.py ## @@ -305,10 +305,7 @@ def __init__( ): jsparkSession = self._jvm.SparkSe

[GitHub] [spark] AngersZhuuuu commented on a change in pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
AngersZh commented on a change in pull request #34732: URL: https://github.com/apache/spark/pull/34732#discussion_r758086614 ## File path: python/pyspark/sql/session.py ## @@ -305,10 +305,7 @@ def __init__( ): jsparkSession = self._jvm.SparkSe

[GitHub] [spark] Yikun edited a comment on pull request #34717: [SPARK-37465][PYTHON] Bump minimum pandas version to 1.0.0

2021-11-28 Thread GitBox
Yikun edited a comment on pull request #34717: URL: https://github.com/apache/spark/pull/34717#issuecomment-981220341 Sure, thanks for your suggestion, I'd like to update. and I added a simple test to install pandas v1.0.1 ~and run test on https://github.com/apache/spark/pull/34730 , wait

[GitHub] [spark] SparkQA commented on pull request #34715: [SPARK-37445][BUILD] Rename the maven hadoop profile to hadoop-3 and hadoop-2

2021-11-28 Thread GitBox
SparkQA commented on pull request #34715: URL: https://github.com/apache/spark/pull/34715#issuecomment-981343687 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50165/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA removed a comment on pull request #34726: [SPARK-33875][SQL][FOLLOWUP] Handle the char/varchar column for `Describe column` command

2021-11-28 Thread GitBox
SparkQA removed a comment on pull request #34726: URL: https://github.com/apache/spark/pull/34726#issuecomment-981232745 **[Test build #145688 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145688/testReport)** for PR 34726 at commit [`ef74a06`](https://gi

[GitHub] [spark] SparkQA commented on pull request #34726: [SPARK-33875][SQL][FOLLOWUP] Handle the char/varchar column for `Describe column` command

2021-11-28 Thread GitBox
SparkQA commented on pull request #34726: URL: https://github.com/apache/spark/pull/34726#issuecomment-981342304 **[Test build #145688 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145688/testReport)** for PR 34726 at commit [`ef74a06`](https://github.co

[GitHub] [spark] Yikun edited a comment on pull request #34717: [SPARK-37465][PYTHON] Bump minimum pandas version to 1.0.0

2021-11-28 Thread GitBox
Yikun edited a comment on pull request #34717: URL: https://github.com/apache/spark/pull/34717#issuecomment-981220341 Sure, thanks for your suggestion, I'd like to update. and I added a simple test to install pandas v1.0.1 ~and run test on https://github.com/apache/spark/pull/34730 , wait

[GitHub] [spark] dongjoon-hyun commented on pull request #33588: [SPARK-36346][SQL] Support TimestampNTZ type in Orc file source

2021-11-28 Thread GitBox
dongjoon-hyun commented on pull request #33588: URL: https://github.com/apache/spark/pull/33588#issuecomment-981335424 BTW, thank you, @bersprockets ! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to g

[GitHub] [spark] SparkQA commented on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
SparkQA commented on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981332981 Kubernetes integration test unable to build dist. exiting with code: 1 URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50167/ -- This

[GitHub] [spark] dongjoon-hyun commented on pull request #34723: [MINOR][SQL] Optimize some Orc test code

2021-11-28 Thread GitBox
dongjoon-hyun commented on pull request #34723: URL: https://github.com/apache/spark/pull/34723#issuecomment-981332407 Let's proceed this after https://github.com/apache/spark/pull/34733 . -- This is an automated message from the Apache Git Service. To respond to the message, please log o

[GitHub] [spark] dongjoon-hyun commented on pull request #33588: [SPARK-36346][SQL] Support TimestampNTZ type in Orc file source

2021-11-28 Thread GitBox
dongjoon-hyun commented on pull request #33588: URL: https://github.com/apache/spark/pull/33588#issuecomment-981332147 Here is a follow-up PR. - https://github.com/apache/spark/pull/34733 -- This is an automated message from the Apache Git Service. To respond to the message, please log

[GitHub] [spark] dongjoon-hyun opened a new pull request #34733: [SPARK-36346][SQL][FOLLOWUP] Rename withAllOrcReaders to withAllNativeOrcReaders

2021-11-28 Thread GitBox
dongjoon-hyun opened a new pull request #34733: URL: https://github.com/apache/spark/pull/34733 … ### What changes were proposed in this pull request? ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change?

[GitHub] [spark] SparkQA removed a comment on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
SparkQA removed a comment on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981325968 **[Test build #145696 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145696/testReport)** for PR 34732 at commit [`d374536`](https://gi

[GitHub] [spark] AmplabJenkins commented on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
AmplabJenkins commented on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981331159 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145696/ -- This

[GitHub] [spark] SparkQA commented on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
SparkQA commented on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981331123 **[Test build #145696 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145696/testReport)** for PR 34732 at commit [`d374536`](https://github.co

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
AmplabJenkins removed a comment on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981330252 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50166/

[GitHub] [spark] SparkQA commented on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
SparkQA commented on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981330241 Kubernetes integration test unable to build dist. exiting with code: 1 URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50166/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
AmplabJenkins commented on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981330252 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50166/ -- T

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
AmplabJenkins removed a comment on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981330133 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145697/ -

[GitHub] [spark] SparkQA removed a comment on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
SparkQA removed a comment on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981327528 **[Test build #145697 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145697/testReport)** for PR 34732 at commit [`f6df6a8`](https://gi

[GitHub] [spark] SparkQA commented on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
SparkQA commented on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981330110 **[Test build #145697 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145697/testReport)** for PR 34732 at commit [`f6df6a8`](https://github.co

[GitHub] [spark] AmplabJenkins commented on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
AmplabJenkins commented on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981330133 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145697/ -- This

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
AmplabJenkins removed a comment on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981329560 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145694/ -

[GitHub] [spark] SparkQA removed a comment on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
SparkQA removed a comment on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981324141 **[Test build #145694 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145694/testReport)** for PR 34732 at commit [`7be6862`](https://gi

[GitHub] [spark] AmplabJenkins commented on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
AmplabJenkins commented on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981329560 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145694/ -- This

[GitHub] [spark] SparkQA commented on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
SparkQA commented on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981329520 **[Test build #145694 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145694/testReport)** for PR 34732 at commit [`7be6862`](https://github.co

[GitHub] [spark] SparkQA commented on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
SparkQA commented on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981325968 **[Test build #145696 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145696/testReport)** for PR 34732 at commit [`d374536`](https://github.com

[GitHub] [spark] HyukjinKwon commented on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
HyukjinKwon commented on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981325158 Thanks for the followup! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [spark] AngersZhuuuu commented on a change in pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
AngersZh commented on a change in pull request #34732: URL: https://github.com/apache/spark/pull/34732#discussion_r758065251 ## File path: python/pyspark/sql/session.py ## @@ -305,10 +305,7 @@ def __init__( ): jsparkSession = self._jvm.SparkSe

[GitHub] [spark] HyukjinKwon commented on a change in pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
HyukjinKwon commented on a change in pull request #34732: URL: https://github.com/apache/spark/pull/34732#discussion_r758065143 ## File path: python/pyspark/sql/session.py ## @@ -305,10 +305,7 @@ def __init__( ): jsparkSession = self._jvm.SparkSes

[GitHub] [spark] HyukjinKwon commented on a change in pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
HyukjinKwon commented on a change in pull request #34732: URL: https://github.com/apache/spark/pull/34732#discussion_r758064643 ## File path: python/pyspark/sql/session.py ## @@ -305,10 +305,7 @@ def __init__( ): jsparkSession = self._jvm.SparkSes

[GitHub] [spark] SparkQA commented on pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
SparkQA commented on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981324141 **[Test build #145694 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145694/testReport)** for PR 34732 at commit [`7be6862`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #34715: [SPARK-37445][BUILD] Rename the maven hadoop profile to hadoop-3 and hadoop-2

2021-11-28 Thread GitBox
SparkQA commented on pull request #34715: URL: https://github.com/apache/spark/pull/34715#issuecomment-981324203 **[Test build #145695 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145695/testReport)** for PR 34715 at commit [`758b267`](https://github.com

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34731: [SPARK-37153][PYTHON] Inline type hints for python/pyspark/profiler.py

2021-11-28 Thread GitBox
AmplabJenkins removed a comment on pull request #34731: URL: https://github.com/apache/spark/pull/34731#issuecomment-981323452 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50163/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34720: [SPARK-37469][WebUI] unified shuffle read block time to shuffle read fetch wait time in StagePage

2021-11-28 Thread GitBox
AmplabJenkins removed a comment on pull request #34720: URL: https://github.com/apache/spark/pull/34720#issuecomment-981323453 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145690/ -

[GitHub] [spark] HyukjinKwon commented on a change in pull request #34732: [SPARK-37291][PYSPARK][FOLLOWUP] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
HyukjinKwon commented on a change in pull request #34732: URL: https://github.com/apache/spark/pull/34732#discussion_r758064015 ## File path: sql/core/src/main/scala/org/apache/spark/sql/SparkSession.scala ## @@ -97,11 +97,11 @@ class SparkSession private( * since that woul

[GitHub] [spark] AmplabJenkins commented on pull request #34720: [SPARK-37469][WebUI] unified shuffle read block time to shuffle read fetch wait time in StagePage

2021-11-28 Thread GitBox
AmplabJenkins commented on pull request #34720: URL: https://github.com/apache/spark/pull/34720#issuecomment-981323453 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145690/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #34731: [SPARK-37153][PYTHON] Inline type hints for python/pyspark/profiler.py

2021-11-28 Thread GitBox
AmplabJenkins commented on pull request #34731: URL: https://github.com/apache/spark/pull/34731#issuecomment-981323452 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50163/ -- T

[GitHub] [spark] dongjoon-hyun commented on a change in pull request #33588: [SPARK-36346][SQL] Support TimestampNTZ type in Orc file source

2021-11-28 Thread GitBox
dongjoon-hyun commented on a change in pull request #33588: URL: https://github.com/apache/spark/pull/33588#discussion_r758063667 ## File path: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/orc/OrcTest.scala ## @@ -143,6 +143,13 @@ abstract class OrcTest e

[GitHub] [spark] AngersZhuuuu commented on pull request #34732: [SPARK-37291][PYSPARK] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
AngersZh commented on pull request #34732: URL: https://github.com/apache/spark/pull/34732#issuecomment-981322758 ping @HyukjinKwon @dongjoon-hyun -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [spark] AngersZhuuuu opened a new pull request #34732: [SPARK-37291][PYSPARK] PySpark create SparkSession should pass initialSessionOptions

2021-11-28 Thread GitBox
AngersZh opened a new pull request #34732: URL: https://github.com/apache/spark/pull/34732 ### What changes were proposed in this pull request? In this pr, when create SparkSession, we pass initialSessionOptions to SparkSession, to keep same code path with scala code. ### Why

[GitHub] [spark] yaooqinn commented on pull request #34697: [SPARK-37452][SQL] Char and Varchar break backward compatibility between v3.1 and v2

2021-11-28 Thread GitBox
yaooqinn commented on pull request #34697: URL: https://github.com/apache/spark/pull/34697#issuecomment-981322426 any more concerns from the CCers? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go t

[GitHub] [spark] AngersZhuuuu commented on pull request #34715: [SPARK-37445][BUILD] Rename the maven hadoop profile to hadoop-3 and hadoop-2

2021-11-28 Thread GitBox
AngersZh commented on pull request #34715: URL: https://github.com/apache/spark/pull/34715#issuecomment-981320628 retest this please -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the speci

[GitHub] [spark] dongjoon-hyun commented on pull request #34679: [SPARK-37437][BUILD] Remove unused hive profile and related CI test

2021-11-28 Thread GitBox
dongjoon-hyun commented on pull request #34679: URL: https://github.com/apache/spark/pull/34679#issuecomment-981318255 +1, late LGTM. Thank you all. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [spark] dongjoon-hyun commented on pull request #34676: [SPARK-37434][BUILD] Add a new profile to auto disable unsupported UTs on MacOs using Apple Silicon

2021-11-28 Thread GitBox
dongjoon-hyun commented on pull request #34676: URL: https://github.com/apache/spark/pull/34676#issuecomment-981316245 Thank you for closing this PR, @LuciferYang . Ya, `leveldb` JNI library is severely outdated while `RocksDB` shows its progress, https://github.com/facebook/rocksdb/

[GitHub] [spark] SparkQA commented on pull request #34731: [SPARK-37153][PYTHON] Inline type hints for python/pyspark/profiler.py

2021-11-28 Thread GitBox
SparkQA commented on pull request #34731: URL: https://github.com/apache/spark/pull/34731#issuecomment-981313691 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50163/ -- This is an automated message from the A

[GitHub] [spark] SparkQA removed a comment on pull request #34720: [SPARK-37469][WebUI] unified shuffle read block time to shuffle read fetch wait time in StagePage

2021-11-28 Thread GitBox
SparkQA removed a comment on pull request #34720: URL: https://github.com/apache/spark/pull/34720#issuecomment-981255405 **[Test build #145690 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145690/testReport)** for PR 34720 at commit [`2f25efc`](https://gi

[GitHub] [spark] SparkQA commented on pull request #34720: [SPARK-37469][WebUI] unified shuffle read block time to shuffle read fetch wait time in StagePage

2021-11-28 Thread GitBox
SparkQA commented on pull request #34720: URL: https://github.com/apache/spark/pull/34720#issuecomment-981313387 **[Test build #145690 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145690/testReport)** for PR 34720 at commit [`2f25efc`](https://github.co

[GitHub] [spark] dongjoon-hyun commented on pull request #34620: [SPARK-37209][YARN][TESTS] Fix `YarnShuffleIntegrationSuite` releated UTs when using `hadoop-3.2` profile without `assembly/target/scal

2021-11-28 Thread GitBox
dongjoon-hyun commented on pull request #34620: URL: https://github.com/apache/spark/pull/34620#issuecomment-981311068 Thank you all. +1, late LGTM. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [spark] dongjoon-hyun closed pull request #34722: [SPARK-37319][K8S][FOLLOWUP] Set JAVA_HOME for Java 17 installed by apt-get

2021-11-28 Thread GitBox
dongjoon-hyun closed pull request #34722: URL: https://github.com/apache/spark/pull/34722 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-

[GitHub] [spark] HyukjinKwon closed pull request #34685: [SPARK-37443][PYTHON] Provide a profiler for Python/Pandas UDFs

2021-11-28 Thread GitBox
HyukjinKwon closed pull request #34685: URL: https://github.com/apache/spark/pull/34685 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-un

[GitHub] [spark] HyukjinKwon commented on pull request #34685: [SPARK-37443][PYTHON] Provide a profiler for Python/Pandas UDFs

2021-11-28 Thread GitBox
HyukjinKwon commented on pull request #34685: URL: https://github.com/apache/spark/pull/34685#issuecomment-981307601 Merged to master. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specifi

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34715: [SPARK-37445][BUILD] Rename the maven hadoop profile to hadoop-3 and hadoop-2

2021-11-28 Thread GitBox
AmplabJenkins removed a comment on pull request #34715: URL: https://github.com/apache/spark/pull/34715#issuecomment-981306552 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145691/ -

[GitHub] [spark] AmplabJenkins commented on pull request #34715: [SPARK-37445][BUILD] Rename the maven hadoop profile to hadoop-3 and hadoop-2

2021-11-28 Thread GitBox
AmplabJenkins commented on pull request #34715: URL: https://github.com/apache/spark/pull/34715#issuecomment-981306552 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145691/ -- This

[GitHub] [spark] SparkQA removed a comment on pull request #34715: [SPARK-37445][BUILD] Rename the maven hadoop profile to hadoop-3 and hadoop-2

2021-11-28 Thread GitBox
SparkQA removed a comment on pull request #34715: URL: https://github.com/apache/spark/pull/34715#issuecomment-981255501 **[Test build #145691 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145691/testReport)** for PR 34715 at commit [`758b267`](https://gi

[GitHub] [spark] SparkQA commented on pull request #34715: [SPARK-37445][BUILD] Rename the maven hadoop profile to hadoop-3 and hadoop-2

2021-11-28 Thread GitBox
SparkQA commented on pull request #34715: URL: https://github.com/apache/spark/pull/34715#issuecomment-981306063 **[Test build #145691 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145691/testReport)** for PR 34715 at commit [`758b267`](https://github.co

[GitHub] [spark] HeartSaVioR commented on pull request #34691: [SPARK-37447][SQL] Cache LogicalPlan.isStreaming() result in a lazy val

2021-11-28 Thread GitBox
HeartSaVioR commented on pull request #34691: URL: https://github.com/apache/spark/pull/34691#issuecomment-981305889 I can't imagine the case the logical plan somehow replaces the leaf nodes (sources) after other nodes are added on top of leaf nodes. If that is true, I guess this simply wo

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34730: [DNM] Trigger test for 'pandas==1.0.1'

2021-11-28 Thread GitBox
AmplabJenkins removed a comment on pull request #34730: URL: https://github.com/apache/spark/pull/34730#issuecomment-981299849 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34367: [SPARK-37099][SQL] Impl a rank-based filter to optimize top-k computation

2021-11-28 Thread GitBox
AmplabJenkins removed a comment on pull request #34367: URL: https://github.com/apache/spark/pull/34367#issuecomment-981299850 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50162/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34731: [SPARK-37153][PYTHON] Inline type hints for python/pyspark/profiler.py

2021-11-28 Thread GitBox
AmplabJenkins removed a comment on pull request #34731: URL: https://github.com/apache/spark/pull/34731#issuecomment-981299852 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145693/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34715: [SPARK-37445][BUILD] Rename the maven hadoop profile to hadoop-3 and hadoop-2

2021-11-28 Thread GitBox
AmplabJenkins removed a comment on pull request #34715: URL: https://github.com/apache/spark/pull/34715#issuecomment-981299851 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50161/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34720: [SPARK-37469][WebUI] unified shuffle read block time to shuffle read fetch wait time in StagePage

2021-11-28 Thread GitBox
AmplabJenkins removed a comment on pull request #34720: URL: https://github.com/apache/spark/pull/34720#issuecomment-981299853 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50160/

[GitHub] [spark] AmplabJenkins commented on pull request #34367: [SPARK-37099][SQL] Impl a rank-based filter to optimize top-k computation

2021-11-28 Thread GitBox
AmplabJenkins commented on pull request #34367: URL: https://github.com/apache/spark/pull/34367#issuecomment-981299850 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50162/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #34730: [DNM] Trigger test for 'pandas==1.0.1'

2021-11-28 Thread GitBox
AmplabJenkins commented on pull request #34730: URL: https://github.com/apache/spark/pull/34730#issuecomment-981299849 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To un

[GitHub] [spark] AmplabJenkins commented on pull request #34715: [SPARK-37445][BUILD] Rename the maven hadoop profile to hadoop-3 and hadoop-2

2021-11-28 Thread GitBox
AmplabJenkins commented on pull request #34715: URL: https://github.com/apache/spark/pull/34715#issuecomment-981299851 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50161/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #34720: [SPARK-37469][WebUI] unified shuffle read block time to shuffle read fetch wait time in StagePage

2021-11-28 Thread GitBox
AmplabJenkins commented on pull request #34720: URL: https://github.com/apache/spark/pull/34720#issuecomment-981299853 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50160/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #34731: [SPARK-37153][PYTHON] Inline type hints for python/pyspark/profiler.py

2021-11-28 Thread GitBox
AmplabJenkins commented on pull request #34731: URL: https://github.com/apache/spark/pull/34731#issuecomment-981299852 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145693/ -- This

[GitHub] [spark] SparkQA commented on pull request #34367: [SPARK-37099][SQL] Impl a rank-based filter to optimize top-k computation

2021-11-28 Thread GitBox
SparkQA commented on pull request #34367: URL: https://github.com/apache/spark/pull/34367#issuecomment-981299164 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50162/ -- This is an automated message from the A

[GitHub] [spark] SparkQA commented on pull request #34715: [SPARK-37445][BUILD] Rename the maven hadoop profile to hadoop-3 and hadoop-2

2021-11-28 Thread GitBox
SparkQA commented on pull request #34715: URL: https://github.com/apache/spark/pull/34715#issuecomment-981294029 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50161/ -- This is an automated message from the A

[GitHub] [spark] SparkQA commented on pull request #34720: [SPARK-37469][WebUI] unified shuffle read block time to shuffle read fetch wait time in StagePage

2021-11-28 Thread GitBox
SparkQA commented on pull request #34720: URL: https://github.com/apache/spark/pull/34720#issuecomment-981293088 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50160/ -- This is an automated message from the A

[GitHub] [spark] SparkQA commented on pull request #34731: [SPARK-37153][PYTHON] Inline type hints for python/pyspark/profiler.py

2021-11-28 Thread GitBox
SparkQA commented on pull request #34731: URL: https://github.com/apache/spark/pull/34731#issuecomment-981292258 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50163/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #34730: [DNM] Trigger test for 'pandas==1.0.1'

2021-11-28 Thread GitBox
SparkQA commented on pull request #34730: URL: https://github.com/apache/spark/pull/34730#issuecomment-981291980 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50159/ -- This is an automated message from the A

[GitHub] [spark] SparkQA removed a comment on pull request #34730: [DNM] Trigger test for 'pandas==1.0.1'

2021-11-28 Thread GitBox
SparkQA removed a comment on pull request #34730: URL: https://github.com/apache/spark/pull/34730#issuecomment-981232709 **[Test build #145687 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145687/testReport)** for PR 34730 at commit [`ca77e73`](https://gi

[GitHub] [spark] HyukjinKwon commented on pull request #34731: [SPARK-37153][PYTHON] Inline type hints for python/pyspark/profiler.py

2021-11-28 Thread GitBox
HyukjinKwon commented on pull request #34731: URL: https://github.com/apache/spark/pull/34731#issuecomment-981287828 Let's get https://github.com/apache/spark/pull/34685 done first. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to Gi

[GitHub] [spark] SparkQA commented on pull request #34730: [DNM] Trigger test for 'pandas==1.0.1'

2021-11-28 Thread GitBox
SparkQA commented on pull request #34730: URL: https://github.com/apache/spark/pull/34730#issuecomment-981287686 **[Test build #145687 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145687/testReport)** for PR 34730 at commit [`ca77e73`](https://github.co

[GitHub] [spark] SparkQA removed a comment on pull request #34731: [SPARK-37153][PYTHON] Inline type hints for python/pyspark/profiler.py

2021-11-28 Thread GitBox
SparkQA removed a comment on pull request #34731: URL: https://github.com/apache/spark/pull/34731#issuecomment-981279640 **[Test build #145693 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145693/testReport)** for PR 34731 at commit [`b947cc4`](https://gi

[GitHub] [spark] SparkQA commented on pull request #34731: [SPARK-37153][PYTHON] Inline type hints for python/pyspark/profiler.py

2021-11-28 Thread GitBox
SparkQA commented on pull request #34731: URL: https://github.com/apache/spark/pull/34731#issuecomment-981286377 **[Test build #145693 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145693/testReport)** for PR 34731 at commit [`b947cc4`](https://github.co

[GitHub] [spark] SparkQA commented on pull request #34367: [SPARK-37099][SQL] Impl a rank-based filter to optimize top-k computation

2021-11-28 Thread GitBox
SparkQA commented on pull request #34367: URL: https://github.com/apache/spark/pull/34367#issuecomment-981282705 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50162/ -- This is an automated message from the Apache

  1   2   3   >