[GitHub] [spark] AmplabJenkins removed a comment on pull request #29893: [SPARK-32976][SQL]Support column list in INSERT statement

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #29893:
URL: https://github.com/apache/spark/pull/29893#issuecomment-706913984


   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/129666/
   Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on pull request #29767: [SPARK-32896][SS] Add DataStreamWriter.table API

2020-10-12 Thread GitBox


cloud-fan commented on pull request #29767:
URL: https://github.com/apache/spark/pull/29767#issuecomment-706917712


   Just update the title is good enough, it's only about the commit message



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] zero323 commented on pull request #30009: [SPARK-32907][ML] adaptively blockify instances - LinearSVC

2020-10-12 Thread GitBox


zero323 commented on pull request #30009:
URL: https://github.com/apache/spark/pull/30009#issuecomment-706917827


   
   > @zero323 I send a new PR here, thanks for reviewing. I tried to verify 
consistency of annotations locally, but the following cmd failed:
   > 
   > ```
   > mypy --no-incremental --config python/mypy.ini python/pyspark
   > python/pyspark/ml/linalg/__init__.pyi:25: error: misplaced type annotation
   > ```
   > 
   > I installed `mypy` by `sudo apt install mypy` in ubuntu 18.04,
   > I am not very similar to `mypy`, do I need to configure it somewhere?
   
   No additional configuration should be required, but the version from Ubuntu 
errors is pretty old, and at first glance it doesn't support error codes 
(`[import]` part). 
   
   Personally I'd recommend either 
[venv](https://docs.python.org/3/library/venv.html) or miniconda, but if you 
want quick fix, installing pip and making user install should do the trick
   
   ```
   sudo apt purge mypy
   sudo apt install python3-pip
   pip install mypy
   ```
   
   I've checked things on my side (mypy 0.790, current stable), for both master 
and this PR, and things look good.
   
   
   
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #29965: [SPARK-33016][SQL] Potential SQLMetrics missed which might cause WEB UI display issue while AQE is on.

2020-10-12 Thread GitBox


SparkQA commented on pull request #29965:
URL: https://github.com/apache/spark/pull/29965#issuecomment-706919018


   **[Test build #129664 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129664/testReport)**
 for PR 29965 at commit 
[`77923c5`](https://github.com/apache/spark/commit/77923c5542ba762646c8a56fe97a1a2e3f553fab).
* This patch **fails due to an unknown error code, -9**.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #30010: [SPARK-33117][BUILD] Update zstd-jni to 1.4.5-6

2020-10-12 Thread GitBox


SparkQA commented on pull request #30010:
URL: https://github.com/apache/spark/pull/30010#issuecomment-706919019


   **[Test build #129665 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129665/testReport)**
 for PR 30010 at commit 
[`aa7004e`](https://github.com/apache/spark/commit/aa7004e44a948ad86f6756e6ca3085e75462ee47).
* This patch **fails due to an unknown error code, -9**.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #30011: [WIP][SPARK-32281][SQL] Spark keep SORTED spec in metastore

2020-10-12 Thread GitBox


SparkQA commented on pull request #30011:
URL: https://github.com/apache/spark/pull/30011#issuecomment-706919020


   **[Test build #129669 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129669/testReport)**
 for PR 30011 at commit 
[`9364dd9`](https://github.com/apache/spark/commit/9364dd9699f2a756c191389a6923be70c6e22759).
* This patch **fails due to an unknown error code, -9**.
* This patch merges cleanly.
* This patch adds the following public classes _(experimental)_:
 * `case class HiveMetaBucketSpec(`



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #29983: [SPARK-13860][SQL] Change statistical aggregate function to return null instead of Double.NaN when divideByZero

2020-10-12 Thread GitBox


SparkQA commented on pull request #29983:
URL: https://github.com/apache/spark/pull/29983#issuecomment-706919011







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #29965: [SPARK-33016][SQL] Potential SQLMetrics missed which might cause WEB UI display issue while AQE is on.

2020-10-12 Thread GitBox


AmplabJenkins commented on pull request #29965:
URL: https://github.com/apache/spark/pull/29965#issuecomment-706919215







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun opened a new pull request #30012: [SPARK-XXX][INFRA] Rebalance GitHub Action jobs

2020-10-12 Thread GitBox


dongjoon-hyun opened a new pull request #30012:
URL: https://github.com/apache/spark/pull/30012


   ### What changes were proposed in this pull request?
   
   This PR aims to rebalance GitHub Action jobs.
   
   ### Why are the changes needed?
   
   ![Screen Shot 2020-10-12 at 12 04 41 
AM](https://user-images.githubusercontent.com/9700541/95715351-8bdcfd00-0c1e-11eb-9b62-1744c10bc37c.png)
   
   ### Does this PR introduce _any_ user-facing change?
   
   No.
   
   ### How was this patch tested?
   
   Check the GitHub Action job runtime in this PR.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #30011: [WIP][SPARK-32281][SQL] Spark keep SORTED spec in metastore

2020-10-12 Thread GitBox


AmplabJenkins commented on pull request #30011:
URL: https://github.com/apache/spark/pull/30011#issuecomment-706919168







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #29983: [SPARK-13860][SQL] Change statistical aggregate function to return null instead of Double.NaN when divideByZero

2020-10-12 Thread GitBox


AmplabJenkins commented on pull request #29983:
URL: https://github.com/apache/spark/pull/29983#issuecomment-706919166







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-12 Thread GitBox


SparkQA commented on pull request #29843:
URL: https://github.com/apache/spark/pull/29843#issuecomment-706919022


   **[Test build #129647 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129647/testReport)**
 for PR 29843 at commit 
[`d8af755`](https://github.com/apache/spark/commit/d8af755bc0148541311f6829c78af74d44e9e7c2).
* This patch **fails due to an unknown error code, -9**.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #30010: [SPARK-33117][BUILD] Update zstd-jni to 1.4.5-6

2020-10-12 Thread GitBox


AmplabJenkins commented on pull request #30010:
URL: https://github.com/apache/spark/pull/30010#issuecomment-706919427







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #29999: [SPARK-33045][SQL] Support build-in function like_all and fix StackOverflowError issue.

2020-10-12 Thread GitBox


SparkQA commented on pull request #2:
URL: https://github.com/apache/spark/pull/2#issuecomment-706919007







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #29999: [SPARK-33045][SQL] Support build-in function like_all and fix StackOverflowError issue.

2020-10-12 Thread GitBox


AmplabJenkins commented on pull request #2:
URL: https://github.com/apache/spark/pull/2#issuecomment-706919391







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] beliefer commented on pull request #29999: [SPARK-33045][SQL] Support build-in function like_all and fix StackOverflowError issue.

2020-10-12 Thread GitBox


beliefer commented on pull request #2:
URL: https://github.com/apache/spark/pull/2#issuecomment-706919752


   retest this please



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #29965: [SPARK-33016][SQL] Potential SQLMetrics missed which might cause WEB UI display issue while AQE is on.

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #29965:
URL: https://github.com/apache/spark/pull/29965#issuecomment-706919215


   Merged build finished. Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #29999: [SPARK-33045][SQL] Support build-in function like_all and fix StackOverflowError issue.

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #2:
URL: https://github.com/apache/spark/pull/2#issuecomment-706919391


   Merged build finished. Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-12 Thread GitBox


AmplabJenkins commented on pull request #29843:
URL: https://github.com/apache/spark/pull/29843#issuecomment-706919743







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #30011: [WIP][SPARK-32281][SQL] Spark keep SORTED spec in metastore

2020-10-12 Thread GitBox


SparkQA removed a comment on pull request #30011:
URL: https://github.com/apache/spark/pull/30011#issuecomment-706909487


   **[Test build #129669 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129669/testReport)**
 for PR 30011 at commit 
[`9364dd9`](https://github.com/apache/spark/commit/9364dd9699f2a756c191389a6923be70c6e22759).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #30010: [SPARK-33117][BUILD] Update zstd-jni to 1.4.5-6

2020-10-12 Thread GitBox


SparkQA removed a comment on pull request #30010:
URL: https://github.com/apache/spark/pull/30010#issuecomment-706880332


   **[Test build #129665 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129665/testReport)**
 for PR 30010 at commit 
[`aa7004e`](https://github.com/apache/spark/commit/aa7004e44a948ad86f6756e6ca3085e75462ee47).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AngersZhuuuu commented on pull request #30011: [WIP][SPARK-32281][SQL] Spark keep SORTED spec in metastore

2020-10-12 Thread GitBox


AngersZh commented on pull request #30011:
URL: https://github.com/apache/spark/pull/30011#issuecomment-706919701


   retest this please



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #30010: [SPARK-33117][BUILD] Update zstd-jni to 1.4.5-6

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #30010:
URL: https://github.com/apache/spark/pull/30010#issuecomment-706919427


   Merged build finished. Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-12 Thread GitBox


SparkQA removed a comment on pull request #29843:
URL: https://github.com/apache/spark/pull/29843#issuecomment-706807152


   **[Test build #129647 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129647/testReport)**
 for PR 29843 at commit 
[`d8af755`](https://github.com/apache/spark/commit/d8af755bc0148541311f6829c78af74d44e9e7c2).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #29983: [SPARK-13860][SQL] Change statistical aggregate function to return null instead of Double.NaN when divideByZero

2020-10-12 Thread GitBox


SparkQA removed a comment on pull request #29983:
URL: https://github.com/apache/spark/pull/29983#issuecomment-706833295







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #30011: [WIP][SPARK-32281][SQL] Spark keep SORTED spec in metastore

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #30011:
URL: https://github.com/apache/spark/pull/30011#issuecomment-706919168


   Merged build finished. Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #29965: [SPARK-33016][SQL] Potential SQLMetrics missed which might cause WEB UI display issue while AQE is on.

2020-10-12 Thread GitBox


SparkQA removed a comment on pull request #29965:
URL: https://github.com/apache/spark/pull/29965#issuecomment-706867654


   **[Test build #129664 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129664/testReport)**
 for PR 29965 at commit 
[`77923c5`](https://github.com/apache/spark/commit/77923c5542ba762646c8a56fe97a1a2e3f553fab).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #29999: [SPARK-33045][SQL] Support build-in function like_all and fix StackOverflowError issue.

2020-10-12 Thread GitBox


AmplabJenkins commented on pull request #2:
URL: https://github.com/apache/spark/pull/2#issuecomment-706919702







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #29983: [SPARK-13860][SQL] Change statistical aggregate function to return null instead of Double.NaN when divideByZero

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #29983:
URL: https://github.com/apache/spark/pull/29983#issuecomment-706919166


   Merged build finished. Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #29999: [SPARK-33045][SQL] Support build-in function like_all and fix StackOverflowError issue.

2020-10-12 Thread GitBox


SparkQA removed a comment on pull request #2:
URL: https://github.com/apache/spark/pull/2#issuecomment-706841427







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #29999: [SPARK-33045][SQL] Support build-in function like_all and fix StackOverflowError issue.

2020-10-12 Thread GitBox


AmplabJenkins commented on pull request #2:
URL: https://github.com/apache/spark/pull/2#issuecomment-706919945







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #29843:
URL: https://github.com/apache/spark/pull/29843#issuecomment-706919743


   Merged build finished. Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #30011: [WIP][SPARK-32281][SQL] Spark keep SORTED spec in metastore

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #30011:
URL: https://github.com/apache/spark/pull/30011#issuecomment-706919179


   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/129669/
   Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #29965: [SPARK-33016][SQL] Potential SQLMetrics missed which might cause WEB UI display issue while AQE is on.

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #29965:
URL: https://github.com/apache/spark/pull/29965#issuecomment-706919223


   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/129664/
   Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #29983: [SPARK-13860][SQL] Change statistical aggregate function to return null instead of Double.NaN when divideByZero

2020-10-12 Thread GitBox


AmplabJenkins commented on pull request #29983:
URL: https://github.com/apache/spark/pull/29983#issuecomment-706920156







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #30010: [SPARK-33117][BUILD] Update zstd-jni to 1.4.5-6

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #30010:
URL: https://github.com/apache/spark/pull/30010#issuecomment-706919438


   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/129665/
   Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #29983: [SPARK-13860][SQL] Change statistical aggregate function to return null instead of Double.NaN when divideByZero

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #29983:
URL: https://github.com/apache/spark/pull/29983#issuecomment-706919178







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #29999: [SPARK-33045][SQL] Support build-in function like_all and fix StackOverflowError issue.

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #2:
URL: https://github.com/apache/spark/pull/2#issuecomment-706919408







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #29983: [SPARK-13860][SQL] Change statistical aggregate function to return null instead of Double.NaN when divideByZero

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #29983:
URL: https://github.com/apache/spark/pull/29983#issuecomment-706920181


   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/129658/
   Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #29843:
URL: https://github.com/apache/spark/pull/29843#issuecomment-706919758


   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/129647/
   Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #30011: [WIP][SPARK-32281][SQL] Spark keep SORTED spec in metastore

2020-10-12 Thread GitBox


SparkQA commented on pull request #30011:
URL: https://github.com/apache/spark/pull/30011#issuecomment-706922911


   **[Test build #129671 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129671/testReport)**
 for PR 30011 at commit 
[`9364dd9`](https://github.com/apache/spark/commit/9364dd9699f2a756c191389a6923be70c6e22759).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #30012: [SPARK-XXX][INFRA] Rebalance GitHub Action jobs

2020-10-12 Thread GitBox


SparkQA commented on pull request #30012:
URL: https://github.com/apache/spark/pull/30012#issuecomment-706922856


   **[Test build #129670 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129670/testReport)**
 for PR 30012 at commit 
[`083b2ed`](https://github.com/apache/spark/commit/083b2ed7a1bafe3d92bbd3c24c727dd4d8d16799).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #29999: [SPARK-33045][SQL] Support build-in function like_all and fix StackOverflowError issue.

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #2:
URL: https://github.com/apache/spark/pull/2#issuecomment-706919955


   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/129657/
   Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #29893: [SPARK-32976][SQL]Support column list in INSERT statement

2020-10-12 Thread GitBox


SparkQA commented on pull request #29893:
URL: https://github.com/apache/spark/pull/29893#issuecomment-706923120


   Kubernetes integration test status success
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34270/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #29893: [SPARK-32976][SQL]Support column list in INSERT statement

2020-10-12 Thread GitBox


SparkQA commented on pull request #29893:
URL: https://github.com/apache/spark/pull/29893#issuecomment-706923002


   **[Test build #129673 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129673/testReport)**
 for PR 29893 at commit 
[`f7e8d18`](https://github.com/apache/spark/commit/f7e8d186320aa805d679942937b12b3a5141fb89).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #27735: [SPARK-30985][k8s] Support propagating SPARK_CONF_DIR files to driver and executor pods.

2020-10-12 Thread GitBox


SparkQA commented on pull request #27735:
URL: https://github.com/apache/spark/pull/27735#issuecomment-706923272


   Kubernetes integration test status failure
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34271/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #27735: [SPARK-30985][k8s] Support propagating SPARK_CONF_DIR files to driver and executor pods.

2020-10-12 Thread GitBox


AmplabJenkins commented on pull request #27735:
URL: https://github.com/apache/spark/pull/27735#issuecomment-706923296







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #29893: [SPARK-32976][SQL]Support column list in INSERT statement

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #29893:
URL: https://github.com/apache/spark/pull/29893#issuecomment-706923142







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #29999: [SPARK-33045][SQL] Support build-in function like_all and fix StackOverflowError issue.

2020-10-12 Thread GitBox


SparkQA commented on pull request #2:
URL: https://github.com/apache/spark/pull/2#issuecomment-706922984


   **[Test build #129672 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129672/testReport)**
 for PR 2 at commit 
[`3e41cff`](https://github.com/apache/spark/commit/3e41cffb800e8e3f5a485021706f38a4fc73e07c).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #29893: [SPARK-32976][SQL]Support column list in INSERT statement

2020-10-12 Thread GitBox


AmplabJenkins commented on pull request #29893:
URL: https://github.com/apache/spark/pull/29893#issuecomment-706923142







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #27735: [SPARK-30985][k8s] Support propagating SPARK_CONF_DIR files to driver and executor pods.

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #27735:
URL: https://github.com/apache/spark/pull/27735#issuecomment-706923296


   Merged build finished. Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #27735: [SPARK-30985][k8s] Support propagating SPARK_CONF_DIR files to driver and executor pods.

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #27735:
URL: https://github.com/apache/spark/pull/27735#issuecomment-706923312


   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/34271/
   Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] zero323 commented on pull request #30002: [SPARK-33086][FOLLOW-UP] Remove unused Optional import from pyspark.resource.profile stub

2020-10-12 Thread GitBox


zero323 commented on pull request #30002:
URL: https://github.com/apache/spark/pull/30002#issuecomment-706924177


   > It's okay since the PR is already open but let's avoid fixing only a 
couple of in a separate PR in general. They don't affect the main codes and can 
be fixed together when you touch these codes later.
   
   Noted. I couldn't identify why it didn't trigger error in CI and thought it 
is better to fix it now, before it starts failing build at some point in the 
future.
   
   Thanks.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] leanken commented on pull request #29983: [SPARK-13860][SQL] Change statistical aggregate function to return null instead of Double.NaN when divideByZero

2020-10-12 Thread GitBox


leanken commented on pull request #29983:
URL: https://github.com/apache/spark/pull/29983#issuecomment-706927266


   retest this please



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #29983: [SPARK-13860][SQL] Change statistical aggregate function to return null instead of Double.NaN when divideByZero

2020-10-12 Thread GitBox


SparkQA commented on pull request #29983:
URL: https://github.com/apache/spark/pull/29983#issuecomment-706929650


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34272/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HeartSaVioR commented on pull request #29767: [SPARK-32896][SS] Add DataStreamWriter.saveAsTable API

2020-10-12 Thread GitBox


HeartSaVioR commented on pull request #29767:
URL: https://github.com/apache/spark/pull/29767#issuecomment-706931429


   Ah OK. Thanks for the guidance. I've updated the PR title and description as 
well, as the usage is a bit different from before.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #29983: [SPARK-13860][SQL] Change statistical aggregate function to return null instead of Double.NaN when divideByZero

2020-10-12 Thread GitBox


SparkQA commented on pull request #29983:
URL: https://github.com/apache/spark/pull/29983#issuecomment-706931586


   **[Test build #129674 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129674/testReport)**
 for PR 29983 at commit 
[`5095df4`](https://github.com/apache/spark/commit/5095df42051f18a5b752a85dd054db419cebb4f0).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun closed pull request #30010: [SPARK-33117][BUILD] Update zstd-jni to 1.4.5-6

2020-10-12 Thread GitBox


dongjoon-hyun closed pull request #30010:
URL: https://github.com/apache/spark/pull/30010


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on pull request #30010: [SPARK-33117][BUILD] Update zstd-jni to 1.4.5-6

2020-10-12 Thread GitBox


dongjoon-hyun commented on pull request #30010:
URL: https://github.com/apache/spark/pull/30010#issuecomment-706932227


   GitHub Action passed. Merged to master.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #30011: [WIP][SPARK-32281][SQL] Spark keep SORTED spec in metastore

2020-10-12 Thread GitBox


SparkQA commented on pull request #30011:
URL: https://github.com/apache/spark/pull/30011#issuecomment-706933052


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34273/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #29965: [SPARK-33016][SQL] Potential SQLMetrics missed which might cause WEB UI display issue while AQE is on.

2020-10-12 Thread GitBox


SparkQA commented on pull request #29965:
URL: https://github.com/apache/spark/pull/29965#issuecomment-706935871


   **[Test build #129675 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129675/testReport)**
 for PR 29965 at commit 
[`88313df`](https://github.com/apache/spark/commit/88313df3efa687a33ee5912cdfdf68015a68d548).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #29983: [SPARK-13860][SQL] Change statistical aggregate function to return null instead of Double.NaN when divideByZero

2020-10-12 Thread GitBox


cloud-fan commented on a change in pull request #29983:
URL: https://github.com/apache/spark/pull/29983#discussion_r503094201



##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/CentralMomentAgg.scala
##
@@ -62,6 +63,9 @@ abstract class CentralMomentAgg(child: Expression)
   protected val m3 = AttributeReference("m3", DoubleType, nullable = false)()
   protected val m4 = AttributeReference("m4", DoubleType, nullable = false)()
 
+  protected val divideByZeroEvalResult: Expression =
+if (SQLConf.get.legacyStatisticalAggregate) Double.NaN else 
Literal.create(null, DoubleType)

Review comment:
   can we move the flag to constructor parameter? e.g.
   ```
   abstract class CentralMomentAgg(child: Expression, nullOnDivideByZero: 
Boolean) {
 ...
 protected def divideByZeroEvalResult: Expression = if (nullOnDivideByZero) 
...
   }
   ```





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on pull request #29767: [SPARK-32896][SS] Add DataStreamWriter.saveAsTable API

2020-10-12 Thread GitBox


dongjoon-hyun commented on pull request #29767:
URL: https://github.com/apache/spark/pull/29767#issuecomment-706937317


   Sorry for missing that.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on pull request #30012: [SPARK-XXX][INFRA] Rebalance GitHub Action jobs

2020-10-12 Thread GitBox


HyukjinKwon commented on pull request #30012:
URL: https://github.com/apache/spark/pull/30012#issuecomment-706937559


   Nice, it should be good to go as long as the elapsed time in the jobs look 
even.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on a change in pull request #30012: [SPARK-XXX][INFRA] Rebalance GitHub Action jobs

2020-10-12 Thread GitBox


HyukjinKwon commented on a change in pull request #30012:
URL: https://github.com/apache/spark/pull/30012#discussion_r503097144



##
File path: .github/workflows/build_and_test.yml
##
@@ -42,9 +42,11 @@ jobs:
 mllib-local, mllib,
 yarn, mesos, kubernetes, hadoop-cloud, spark-ganglia-lgpl
   - >-
-pyspark-sql, pyspark-mllib, pyspark-resource
+pyspark-sql, pyspark-resource
   - >-
-pyspark-core, pyspark-streaming, pyspark-ml
+pyspark-core, pyspark-streaming
+  - >-
+pyspark-ml, pyspark-mllib

Review comment:
   @dongjoon-hyun, BTW do you have a GitHub Action workflow example of " 
The total execution time of two Python GitHub Action jobs is over 3 hours."? I 
just took a quick look and seems it's usually 1 hour 30 mins. For example, it 
has a change in core which probably runs most of tests: 
https://github.com/apache/spark/runs/1236699345





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] ScrapCodes commented on pull request #27735: [SPARK-30985][k8s] Support propagating SPARK_CONF_DIR files to driver and executor pods.

2020-10-12 Thread GitBox


ScrapCodes commented on pull request #27735:
URL: https://github.com/apache/spark/pull/27735#issuecomment-706940103


   Jenkins, retest this please.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon removed a comment on pull request #30012: [SPARK-XXX][INFRA] Rebalance GitHub Action jobs

2020-10-12 Thread GitBox


HyukjinKwon removed a comment on pull request #30012:
URL: https://github.com/apache/spark/pull/30012#issuecomment-706937559


   Nice, it should be good to go as long as the elapsed time in the jobs look 
even.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #30011: [WIP][SPARK-32281][SQL] Spark keep SORTED spec in metastore

2020-10-12 Thread GitBox


AmplabJenkins commented on pull request #30011:
URL: https://github.com/apache/spark/pull/30011#issuecomment-706942331







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #30011: [WIP][SPARK-32281][SQL] Spark keep SORTED spec in metastore

2020-10-12 Thread GitBox


SparkQA commented on pull request #30011:
URL: https://github.com/apache/spark/pull/30011#issuecomment-706942310


   Kubernetes integration test status success
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34273/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #29983: [SPARK-13860][SQL] Change statistical aggregate function to return null instead of Double.NaN when divideByZero

2020-10-12 Thread GitBox


SparkQA commented on pull request #29983:
URL: https://github.com/apache/spark/pull/29983#issuecomment-706942525


   Kubernetes integration test status success
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34272/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #29983: [SPARK-13860][SQL] Change statistical aggregate function to return null instead of Double.NaN when divideByZero

2020-10-12 Thread GitBox


AmplabJenkins commented on pull request #29983:
URL: https://github.com/apache/spark/pull/29983#issuecomment-706942553







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #27735: [SPARK-30985][k8s] Support propagating SPARK_CONF_DIR files to driver and executor pods.

2020-10-12 Thread GitBox


SparkQA commented on pull request #27735:
URL: https://github.com/apache/spark/pull/27735#issuecomment-706944027


   **[Test build #129676 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129676/testReport)**
 for PR 27735 at commit 
[`20fe56f`](https://github.com/apache/spark/commit/20fe56fbe80b3e2714dcd4fcae7ebfceb95e5443).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] Victsm commented on a change in pull request #29855: [SPARK-32915][CORE] Network-layer and shuffle RPC layer changes to support push shuffle blocks

2020-10-12 Thread GitBox


Victsm commented on a change in pull request #29855:
URL: https://github.com/apache/spark/pull/29855#discussion_r503101982



##
File path: 
common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/ErrorHandler.java
##
@@ -0,0 +1,92 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.network.shuffle;
+
+import java.net.ConnectException;
+
+/**
+ * Plugs into {@link RetryingBlockFetcher} to further control when an 
exception should be retried
+ * and logged.
+ * Note: {@link RetryingBlockFetcher} will delegate the exception to this 
handler only when
+ * - remaining retries < max retries
+ * - exception is an IOException
+ */
+
+public interface ErrorHandler {
+
+  boolean shouldRetryError(Throwable t);
+
+  default boolean shouldLogError(Throwable t) {
+return true;
+  }
+
+  /**
+   * A no-op error handler instance.
+   */
+  ErrorHandler NOOP_ERROR_HANDLER = t -> true;
+
+  /**
+   * The error handler for pushing shuffle blocks to remote shuffle services.
+   */
+  class BlockPushErrorHandler implements ErrorHandler {
+/**
+ * String constant used for generating exception messages indicating a 
block to be merged
+ * arrives too late on the server side, and also for later checking such 
exceptions on the
+ * client side. When we get a block push failure because of the block 
arrives too late, we
+ * will not retry pushing the block nor log the exception on the client 
side.
+ */
+public static final String TOO_LATE_MESSAGE_SUFFIX =
+"received after merged shuffle is finalized";
+
+/**
+ * String constant used for generating exception messages indicating the 
server couldn't
+ * append a block after all available attempts due to collision with other 
blocks belonging

Review comment:
   Yes, we are aware of the distinction between mapId and mapPartitionId 
introduced in 3.0.
   We have internally discussed the implication of this change, and it should 
be minimum.
   SPARK-32923 was created so that we can make push-based shuffle supporting 
indeterminate stage retries as well.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #29999: [SPARK-33045][SQL] Support build-in function like_all and fix StackOverflowError issue.

2020-10-12 Thread GitBox


SparkQA commented on pull request #2:
URL: https://github.com/apache/spark/pull/2#issuecomment-706946167


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34276/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] gemelen commented on pull request #29995: [SPARK-33080][BUILD] Replace fatal warnings snippet

2020-10-12 Thread GitBox


gemelen commented on pull request #29995:
URL: https://github.com/apache/spark/pull/29995#issuecomment-706946825


   cc @dongjoon-hyun 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] maropu closed pull request #29975: [SPARK-33092][SQL] Support subexpression elimination in ProjectExec

2020-10-12 Thread GitBox


maropu closed pull request #29975:
URL: https://github.com/apache/spark/pull/29975


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] maropu commented on pull request #29975: [SPARK-33092][SQL] Support subexpression elimination in ProjectExec

2020-10-12 Thread GitBox


maropu commented on pull request #29975:
URL: https://github.com/apache/spark/pull/29975#issuecomment-706947101


   Thanks! Merged to master.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #30011: [WIP][SPARK-32281][SQL] Spark keep SORTED spec in metastore

2020-10-12 Thread GitBox


SparkQA commented on pull request #30011:
URL: https://github.com/apache/spark/pull/30011#issuecomment-706947970


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34275/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] Victsm commented on a change in pull request #29855: [SPARK-32915][CORE] Network-layer and shuffle RPC layer changes to support push shuffle blocks

2020-10-12 Thread GitBox


Victsm commented on a change in pull request #29855:
URL: https://github.com/apache/spark/pull/29855#discussion_r503105917



##
File path: 
common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/OneForOneBlockPusher.java
##
@@ -0,0 +1,126 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.network.shuffle;
+
+import java.nio.ByteBuffer;
+import java.util.Arrays;
+import java.util.Map;
+
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
+import org.apache.spark.network.buffer.ManagedBuffer;
+import org.apache.spark.network.buffer.NioManagedBuffer;
+import org.apache.spark.network.client.RpcResponseCallback;
+import org.apache.spark.network.client.TransportClient;
+import org.apache.spark.network.shuffle.protocol.PushBlockStream;
+
+/**
+ * Similar to {@link OneForOneBlockFetcher}, but for pushing blocks to remote 
shuffle service to
+ * be merged instead of for fetching them from remote shuffle services. This 
is used by
+ * ShuffleWriter when the block push process is initiated. The supplied 
BlockFetchingListener
+ * is used to handle the success or failure in pushing each blocks.
+ */
+public class OneForOneBlockPusher {
+  private static final Logger logger = 
LoggerFactory.getLogger(OneForOneBlockPusher.class);
+  private static final ErrorHandler PUSH_ERROR_HANDLER = new 
ErrorHandler.BlockPushErrorHandler();
+
+  private final TransportClient client;
+  private final String appId;
+  private final String[] blockIds;
+  private final BlockFetchingListener listener;
+  private final Map buffers;
+
+  public OneForOneBlockPusher(
+  TransportClient client,
+  String appId,
+  String[] blockIds,
+  BlockFetchingListener listener,
+  Map buffers) {
+this.client = client;
+this.appId = appId;
+this.blockIds = blockIds;
+this.listener = listener;
+this.buffers = buffers;
+  }
+
+  private class BlockPushCallback implements RpcResponseCallback {
+
+private int index;
+private String blockId;
+
+BlockPushCallback(int index, String blockId) {
+  this.index = index;
+  this.blockId = blockId;
+}
+
+@Override
+public void onSuccess(ByteBuffer response) {
+  // On receipt of a successful block push
+  listener.onBlockFetchSuccess(blockId, new 
NioManagedBuffer(ByteBuffer.allocate(0)));
+}
+
+@Override
+public void onFailure(Throwable e) {
+  // Since block push is best effort, i.e., if we encountered a block push 
failure that's not
+  // retriable or exceeding the max retires, we should not fail all 
remaining block pushes.
+  // The best effort nature makes block push tolerable of a partial 
completion. Thus, we only
+  // fail the block that's actually failed. Not that, on the 
RetryingBlockFetcher side, once
+  // retry is initiated, it would still invalidate the previous active 
retry listener, and
+  // retry all outstanding blocks. We are preventing forwarding 
unnecessary block push failures
+  // to the parent listener of the retry listener. The only exceptions 
would be if the block
+  // push failure is due to block arriving on the server side after merge 
finalization, or the
+  // client fails to establish connection to the server side. In both 
cases, we would fail all
+  // remaining blocks.
+  if (PUSH_ERROR_HANDLER.shouldRetryError(e)) {
+String[] targetBlockId = Arrays.copyOfRange(blockIds, index, index + 
1);
+failRemainingBlocks(targetBlockId, e);
+  } else {
+String[] targetBlockId = Arrays.copyOfRange(blockIds, index, 
blockIds.length);

Review comment:
   Each time we call `failRemainingBlocks`, if the block transfer is 
retriable, then `RetryingBlockFetchListener` will initiate the retry.
   As part of that, it will retry all remaining block transfers and also change 
`currentListener`, so it would ignore any callback invocation for blocks from 
the previous attempt.
   For example, once we invoke `failRemainingBlocks(Array(1, 2, 3, 4), e)`, 
block 1, 2, 3, 4 will be retired if e is retriable.
   If later on, `failRemainingBlocks(Array(2, 3, 4), e)` is invoked, it means 
the transfer of block 2 durin

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29983: [SPARK-13860][SQL] Change statistical aggregate function to return null instead of Double.NaN when divideByZero

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #29983:
URL: https://github.com/apache/spark/pull/29983#issuecomment-706942553







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #30011: [WIP][SPARK-32281][SQL] Spark keep SORTED spec in metastore

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #30011:
URL: https://github.com/apache/spark/pull/30011#issuecomment-706942331







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] Victsm commented on a change in pull request #29855: [SPARK-32915][CORE] Network-layer and shuffle RPC layer changes to support push shuffle blocks

2020-10-12 Thread GitBox


Victsm commented on a change in pull request #29855:
URL: https://github.com/apache/spark/pull/29855#discussion_r503108862



##
File path: 
common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/ExternalBlockHandler.java
##
@@ -89,6 +110,21 @@ public void receive(TransportClient client, ByteBuffer 
message, RpcResponseCallb
 handleMessage(msgObj, client, callback);
   }
 
+  @Override
+  public StreamCallbackWithID receiveStream(

Review comment:
   This is from the API introduced in SPARK-6237.
   That API was used for transferring large RDD partition blocks (>2GB), and we 
reused it for transferring shuffle partition blocks.
   Prefer to keep this API's name generic in case there are other use cases 
that could reuse this as well.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #30012: [SPARK-XXX][INFRA] Rebalance GitHub Action jobs

2020-10-12 Thread GitBox


SparkQA commented on pull request #30012:
URL: https://github.com/apache/spark/pull/30012#issuecomment-706952849


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34274/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #27735: [SPARK-30985][k8s] Support propagating SPARK_CONF_DIR files to driver and executor pods.

2020-10-12 Thread GitBox


SparkQA commented on pull request #27735:
URL: https://github.com/apache/spark/pull/27735#issuecomment-706952994


   **[Test build #129676 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129676/testReport)**
 for PR 27735 at commit 
[`20fe56f`](https://github.com/apache/spark/commit/20fe56fbe80b3e2714dcd4fcae7ebfceb95e5443).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #27735: [SPARK-30985][k8s] Support propagating SPARK_CONF_DIR files to driver and executor pods.

2020-10-12 Thread GitBox


AmplabJenkins commented on pull request #27735:
URL: https://github.com/apache/spark/pull/27735#issuecomment-706953214







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #27735: [SPARK-30985][k8s] Support propagating SPARK_CONF_DIR files to driver and executor pods.

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #27735:
URL: https://github.com/apache/spark/pull/27735#issuecomment-706953214







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #27735: [SPARK-30985][k8s] Support propagating SPARK_CONF_DIR files to driver and executor pods.

2020-10-12 Thread GitBox


SparkQA removed a comment on pull request #27735:
URL: https://github.com/apache/spark/pull/27735#issuecomment-706944027


   **[Test build #129676 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129676/testReport)**
 for PR 27735 at commit 
[`20fe56f`](https://github.com/apache/spark/commit/20fe56fbe80b3e2714dcd4fcae7ebfceb95e5443).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] Victsm commented on a change in pull request #29855: [SPARK-32915][CORE] Network-layer and shuffle RPC layer changes to support push shuffle blocks

2020-10-12 Thread GitBox


Victsm commented on a change in pull request #29855:
URL: https://github.com/apache/spark/pull/29855#discussion_r503114602



##
File path: 
common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/ExternalBlockStoreClient.java
##
@@ -116,6 +121,42 @@ public void fetchBlocks(
 }
   }
 
+  @Override
+  public void pushBlocks(
+  String host,
+  int port,
+  String[] blockIds,
+  ManagedBuffer[] buffers,
+  BlockFetchingListener listener) {
+checkInit();
+assert blockIds.length == buffers.length : "Number of block ids and 
buffers do not match.";
+
+Map buffersWithId = new HashMap<>();
+for (int i = 0; i < blockIds.length; i++) {
+  buffersWithId.put(blockIds[i], buffers[i]);
+}
+logger.debug("Push {} shuffle blocks to {}:{}", blockIds.length, host, 
port);
+try {
+  RetryingBlockFetcher.BlockFetchStarter blockPushStarter =
+  (blockIds1, listener1) -> {
+TransportClient client = clientFactory.createClient(host, port);
+new OneForOneBlockPusher(client, appId, blockIds1, listener1, 
buffersWithId).start();
+  };
+  int maxRetries = conf.maxIORetries();
+  if (maxRetries > 0) {
+new RetryingBlockFetcher(
+  conf, blockPushStarter, blockIds, listener, 
PUSH_ERROR_HANDLER).start();
+  } else {
+blockPushStarter.createAndStart(blockIds, listener);

Review comment:
   This follows the way it is done in the `fetchBlocks` API.
   Should we remove it?





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #30011: [WIP][SPARK-32281][SQL] Spark keep SORTED spec in metastore

2020-10-12 Thread GitBox


AmplabJenkins commented on pull request #30011:
URL: https://github.com/apache/spark/pull/30011#issuecomment-706958594







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #30011: [WIP][SPARK-32281][SQL] Spark keep SORTED spec in metastore

2020-10-12 Thread GitBox


SparkQA commented on pull request #30011:
URL: https://github.com/apache/spark/pull/30011#issuecomment-706958572


   Kubernetes integration test status success
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34275/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #30011: [WIP][SPARK-32281][SQL] Spark keep SORTED spec in metastore

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #30011:
URL: https://github.com/apache/spark/pull/30011#issuecomment-706958594







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #29999: [SPARK-33045][SQL] Support build-in function like_all and fix StackOverflowError issue.

2020-10-12 Thread GitBox


SparkQA commented on pull request #2:
URL: https://github.com/apache/spark/pull/2#issuecomment-706960711


   Kubernetes integration test status success
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34276/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #29999: [SPARK-33045][SQL] Support build-in function like_all and fix StackOverflowError issue.

2020-10-12 Thread GitBox


AmplabJenkins commented on pull request #2:
URL: https://github.com/apache/spark/pull/2#issuecomment-706960737







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #29999: [SPARK-33045][SQL] Support build-in function like_all and fix StackOverflowError issue.

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #2:
URL: https://github.com/apache/spark/pull/2#issuecomment-706960737







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] zhengruifeng opened a new pull request #30013: [SPARK-32455][ML][Follow-Up] LogisticRegressionModel prediction optimization

2020-10-12 Thread GitBox


zhengruifeng opened a new pull request #30013:
URL: https://github.com/apache/spark/pull/30013


   ### What changes were proposed in this pull request?
   use `lazy array` instead of `var` for auxiliary variables in binary lor
   
   ### Why are the changes needed?
   In https://github.com/apache/spark/pull/29255, I made a mistake:
   the `private var _threshold` and `_rawThreshold`  are initialized by defaut 
values of `threshold`, that is beacuse: 
   1, param `threshold` is set default value at first;
   2, `_threshold` and `_rawThreshold` are initialized based on the default 
value;
   3, param `threshold` is updated by the value from estimator, by `.set` method
   ```
 if (map.contains(param) && to.hasParam(param.name)) {
   to.set(param.name, map(param))
 }
   ```
   
   We can update `_threshold` and `_rawThreshold` in `setThreshold` and 
`setThresholds`, but we can not update them in `set` so their values are 
computed based on default values.
   
   
   
   ### Does this PR introduce _any_ user-facing change?
   No
   
   
   ### How was this patch tested?
   test in repl
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #30012: [SPARK-XXX][INFRA] Rebalance GitHub Action jobs

2020-10-12 Thread GitBox


AmplabJenkins commented on pull request #30012:
URL: https://github.com/apache/spark/pull/30012#issuecomment-706963597







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #30012: [SPARK-XXX][INFRA] Rebalance GitHub Action jobs

2020-10-12 Thread GitBox


SparkQA commented on pull request #30012:
URL: https://github.com/apache/spark/pull/30012#issuecomment-706963566


   Kubernetes integration test status success
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34274/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #30012: [SPARK-XXX][INFRA] Rebalance GitHub Action jobs

2020-10-12 Thread GitBox


AmplabJenkins removed a comment on pull request #30012:
URL: https://github.com/apache/spark/pull/30012#issuecomment-706963597







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] zhengruifeng commented on pull request #30013: [SPARK-32455][ML][Follow-Up] LogisticRegressionModel prediction optimization

2020-10-12 Thread GitBox


zhengruifeng commented on pull request #30013:
URL: https://github.com/apache/spark/pull/30013#issuecomment-706965136


   test in commit 27eab00:
   
   ```
   import scala.util.Random
   import org.apache.spark.ml.linalg._
   import org.apache.spark.ml.classification._
   import org.apache.spark.ml.regression._
   import org.apache.spark.sql.functions._
   import org.apache.spark.storage.StorageLevel
   
   
   
   val df = spark.read.option("numFeatures", 
"2000").format("libsvm").load("/data1/Datasets/epsilon/epsilon_normalized.t").withColumn("aftcensor",
 (col("label")+1)/2).withColumn("aftlabel", 
(col("label")+2)/2).withColumn("label", (col("label")+1)/2).limit(100)
   df.persist(StorageLevel.MEMORY_AND_DISK)
   df.count
   
   val vec = df.select("features").head.getAs[Vector](0)
   
   val lor = new LogisticRegression().setMaxIter(1).setThreshold(0.1)
   
   val lorm = lor.fit(df)
   
   lorm.getThreshold
   
   lorm.predict(vec)
   
   ```
   
   results:
   // master
   scala> val lorm = lor.fit(df)
   20/10/12 15:47:23 WARN LogisticRegressionModel: **_threshold=0.5, 
_rawThreshold=0.0**
   lorm: org.apache.spark.ml.classification.LogisticRegressionModel = 
LogisticRegressionModel: uid=logreg_4c79066a563d, numClasses=2, numFeatures=2000
   
   scala> lorm.getThreshold
   res9: Double = 0.1
   
   scala> lorm.predict(vec)
   20/10/12 15:47:29 WARN LogisticRegressionModel: **_threshold=0.5, 
_rawThreshold=0.0**
   res10: Double = 0.0
   
   The `_threshold` and `_rawThreshold` here are incorrect.
   
   
   // this PR
   scala> lorm.predict(vec)
   20/10/12 16:01:09 WARN LogisticRegressionModel: _threshold=0.1, 
_rawThreshold=-2.197224577336219
   res3: Double = 0.0
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] zhongyi-peng commented on pull request #16485: [SPARK-19099] correct the wrong time display in history server web UI

2020-10-12 Thread GitBox


zhongyi-peng commented on pull request #16485:
URL: https://github.com/apache/spark/pull/16485#issuecomment-706965528


   suppose to be?so it is not changable?



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



  1   2   3   4   5   6   7   8   9   >