[GitHub] [spark] SparkQA commented on pull request #31190: [SPARK-34118][CORE][SQL][3.0] Replaces filter and check for emptiness with exists or forall

2021-01-14 Thread GitBox


SparkQA commented on pull request #31190:
URL: https://github.com/apache/spark/pull/31190#issuecomment-760720166


   Kubernetes integration test status success
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38672/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] Ngone51 closed pull request #31053: [SPARK-32165][SQL] Ensure Spark only initiates SharedState once across SparkSessions

2021-01-14 Thread GitBox


Ngone51 closed pull request #31053:
URL: https://github.com/apache/spark/pull/31053


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] sunchao commented on a change in pull request #31107: [SPARK-34052][SQL] A cached view should become invalid after the source table is dropped

2021-01-14 Thread GitBox


sunchao commented on a change in pull request #31107:
URL: https://github.com/apache/spark/pull/31107#discussion_r557982546



##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/v2Commands.scala
##
@@ -833,6 +833,7 @@ case class CacheTable(
 case class CacheTableAsSelect(
 tempViewName: String,
 plan: LogicalPlan,
+originalText: Option[String],

Review comment:
   I only see `AstBuilder` as the only source for constructing a 
`CacheTableAsSelect` so yes we can probably remove the `Option`. 





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] sunchao commented on a change in pull request #31107: [SPARK-34052][SQL] A cached view should become invalid after the source table is dropped

2021-01-14 Thread GitBox


sunchao commented on a change in pull request #31107:
URL: https://github.com/apache/spark/pull/31107#discussion_r557980220



##
File path: 
sql/core/src/main/scala/org/apache/spark/sql/internal/CatalogImpl.scala
##
@@ -399,7 +399,7 @@ class CatalogImpl(sparkSession: SparkSession) extends 
Catalog {
   try {
 val plan = sparkSession.sessionState.executePlan(viewDef)
 sparkSession.sharedState.cacheManager.uncacheQuery(
-  sparkSession, plan.analyzed, cascade = false)
+  sparkSession, plan.analyzed, cascade = true)

Review comment:
   Oops forgot to update here. Will update and add test cases.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] sunchao commented on a change in pull request #31107: [SPARK-34052][SQL] A cached view should become invalid after the source table is dropped

2021-01-14 Thread GitBox


sunchao commented on a change in pull request #31107:
URL: https://github.com/apache/spark/pull/31107#discussion_r557979466



##
File path: 
sql/core/src/main/scala/org/apache/spark/sql/execution/command/ddl.scala
##
@@ -238,7 +238,7 @@ case class DropTableCommand(
 if (isTempView || catalog.tableExists(tableName)) {
   try {
 sparkSession.sharedState.cacheManager.uncacheQuery(
-  sparkSession.table(tableName), cascade = !isTempView)
+  sparkSession.table(tableName), cascade = !(isTempView && 
conf.storeAnalyzedPlanForView))

Review comment:
   Good point. Will update.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #30974: [SPARK-33939][SQL] Make Column.named UnresolvedExtractValue use UnresolvedAlias to assign name

2021-01-14 Thread GitBox


SparkQA commented on pull request #30974:
URL: https://github.com/apache/spark/pull/30974#issuecomment-760717657


   **[Test build #134093 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134093/testReport)**
 for PR 30974 at commit 
[`65fe331`](https://github.com/apache/spark/commit/65fe331d0ec609232a2b5a92622753531181ac66).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31186: [SPARK-33790][CORE][3.1] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2021-01-14 Thread GitBox


SparkQA commented on pull request #31186:
URL: https://github.com/apache/spark/pull/31186#issuecomment-760717442


   **[Test build #134091 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134091/testReport)**
 for PR 31186 at commit 
[`41c2277`](https://github.com/apache/spark/commit/41c22773f7b4ffe7805bd443f35940013b976146).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31160: [SPARK-34080][ML][PYTHON] Add UnivariateFeatureSelector

2021-01-14 Thread GitBox


SparkQA commented on pull request #31160:
URL: https://github.com/apache/spark/pull/31160#issuecomment-760717455


   **[Test build #134092 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134092/testReport)**
 for PR 31160 at commit 
[`1a9d26b`](https://github.com/apache/spark/commit/1a9d26bd99437c54ce6fd229ed141c897972a82e).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31185: [SPARK-34111][BUILD][test-maven] Deconflict the jars jakarta.servlet-api-4.0.3.jar and javax.servlet-api-3.1.0.jar

2021-01-14 Thread GitBox


SparkQA commented on pull request #31185:
URL: https://github.com/apache/spark/pull/31185#issuecomment-760713933


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38673/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #31160: [SPARK-34080][ML][PYTHON] Add UnivariateFeatureSelector

2021-01-14 Thread GitBox


AmplabJenkins removed a comment on pull request #31160:
URL: https://github.com/apache/spark/pull/31160#issuecomment-760713222


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/38668/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #31186: [SPARK-33790][CORE][3.1] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2021-01-14 Thread GitBox


AmplabJenkins removed a comment on pull request #31186:
URL: https://github.com/apache/spark/pull/31186#issuecomment-760713219


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134075/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #31187: [SPARK-33790][CORE][3.0] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2021-01-14 Thread GitBox


AmplabJenkins removed a comment on pull request #31187:
URL: https://github.com/apache/spark/pull/31187#issuecomment-760713231


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/38667/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #31142: [SPARK-33346][CORE][SQL][MLLIB][DSTREAM][K8S] Change the never changed 'var' to 'val'

2021-01-14 Thread GitBox


AmplabJenkins removed a comment on pull request #31142:
URL: https://github.com/apache/spark/pull/31142#issuecomment-760713232


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134077/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #29737: [SPARK-32864][SQL] Support ORC forced positional evolution

2021-01-14 Thread GitBox


AmplabJenkins removed a comment on pull request #29737:
URL: https://github.com/apache/spark/pull/29737#issuecomment-760713228


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134070/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #31192: [SPARK-34118][CORE][SQL][2.4] Replaces filter and check for emptiness with exists or forall

2021-01-14 Thread GitBox


AmplabJenkins removed a comment on pull request #31192:
URL: https://github.com/apache/spark/pull/31192#issuecomment-760713223


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/38671/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #31142: [SPARK-33346][CORE][SQL][MLLIB][DSTREAM][K8S] Change the never changed 'var' to 'val'

2021-01-14 Thread GitBox


AmplabJenkins commented on pull request #31142:
URL: https://github.com/apache/spark/pull/31142#issuecomment-760713232


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134077/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #31189: [SPARK-34122][SQL] Remove duplicated branches in CaseWhen

2021-01-14 Thread GitBox


AmplabJenkins removed a comment on pull request #31189:
URL: https://github.com/apache/spark/pull/31189#issuecomment-760713218


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/38665/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #31160: [SPARK-34080][ML][PYTHON] Add UnivariateFeatureSelector

2021-01-14 Thread GitBox


AmplabJenkins commented on pull request #31160:
URL: https://github.com/apache/spark/pull/31160#issuecomment-760713222


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/38668/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #31192: [SPARK-34118][CORE][SQL][2.4] Replaces filter and check for emptiness with exists or forall

2021-01-14 Thread GitBox


AmplabJenkins commented on pull request #31192:
URL: https://github.com/apache/spark/pull/31192#issuecomment-760713223


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/38671/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #31185: [SPARK-34111][BUILD][test-maven] Deconflict the jars jakarta.servlet-api-4.0.3.jar and javax.servlet-api-3.1.0.jar

2021-01-14 Thread GitBox


AmplabJenkins commented on pull request #31185:
URL: https://github.com/apache/spark/pull/31185#issuecomment-760713224


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134076/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #31188: [SPARK-34118][CORE][SQL][3.1] Replaces filter and check for emptiness with exists or forall

2021-01-14 Thread GitBox


AmplabJenkins commented on pull request #31188:
URL: https://github.com/apache/spark/pull/31188#issuecomment-760713220


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/38666/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #31187: [SPARK-33790][CORE][3.0] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2021-01-14 Thread GitBox


AmplabJenkins commented on pull request #31187:
URL: https://github.com/apache/spark/pull/31187#issuecomment-760713231


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/38667/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #31188: [SPARK-34118][CORE][SQL][3.1] Replaces filter and check for emptiness with exists or forall

2021-01-14 Thread GitBox


AmplabJenkins removed a comment on pull request #31188:
URL: https://github.com/apache/spark/pull/31188#issuecomment-760713220


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/38666/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #31185: [SPARK-34111][BUILD][test-maven] Deconflict the jars jakarta.servlet-api-4.0.3.jar and javax.servlet-api-3.1.0.jar

2021-01-14 Thread GitBox


AmplabJenkins removed a comment on pull request #31185:
URL: https://github.com/apache/spark/pull/31185#issuecomment-760713224


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134076/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #31193: [SPARK-34117][SQL] Disable LeftSemi/LeftAnti push down over Aggregate

2021-01-14 Thread GitBox


AmplabJenkins removed a comment on pull request #31193:
URL: https://github.com/apache/spark/pull/31193#issuecomment-760690501







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #31189: [SPARK-34122][SQL] Remove duplicated branches in CaseWhen

2021-01-14 Thread GitBox


AmplabJenkins commented on pull request #31189:
URL: https://github.com/apache/spark/pull/31189#issuecomment-760713218


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/38665/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #31193: [SPARK-34117][SQL] Disable LeftSemi/LeftAnti push down over Aggregate

2021-01-14 Thread GitBox


AmplabJenkins commented on pull request #31193:
URL: https://github.com/apache/spark/pull/31193#issuecomment-760713230


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/38670/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #29737: [SPARK-32864][SQL] Support ORC forced positional evolution

2021-01-14 Thread GitBox


AmplabJenkins commented on pull request #29737:
URL: https://github.com/apache/spark/pull/29737#issuecomment-760713228


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134070/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #31186: [SPARK-33790][CORE][3.1] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2021-01-14 Thread GitBox


AmplabJenkins commented on pull request #31186:
URL: https://github.com/apache/spark/pull/31186#issuecomment-760713219


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134075/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31185: [SPARK-34111][BUILD][test-maven] Deconflict the jars jakarta.servlet-api-4.0.3.jar and javax.servlet-api-3.1.0.jar

2021-01-14 Thread GitBox


SparkQA commented on pull request #31185:
URL: https://github.com/apache/spark/pull/31185#issuecomment-760713017


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38676/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31193: [SPARK-34117][SQL] Disable LeftSemi/LeftAnti push down over Aggregate

2021-01-14 Thread GitBox


SparkQA commented on pull request #31193:
URL: https://github.com/apache/spark/pull/31193#issuecomment-760712164


   Kubernetes integration test status failure
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38670/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] yaooqinn commented on pull request #31079: [SPARK-34037][SQL] Remove unnecessary upcasting for Avg & Sum which handle by themself internally

2021-01-14 Thread GitBox


yaooqinn commented on pull request #31079:
URL: https://github.com/apache/spark/pull/31079#issuecomment-760711076


   updated thanks



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31079: [SPARK-34037][SQL] Remove unnecessary upcasting for Avg & Sum which handle by themself internally

2021-01-14 Thread GitBox


SparkQA commented on pull request #31079:
URL: https://github.com/apache/spark/pull/31079#issuecomment-760710848


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38675/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31193: [SPARK-34117][SQL] Disable LeftSemi/LeftAnti push down over Aggregate

2021-01-14 Thread GitBox


SparkQA commented on pull request #31193:
URL: https://github.com/apache/spark/pull/31193#issuecomment-760710298


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38670/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #31142: [SPARK-33346][CORE][SQL][MLLIB][DSTREAM][K8S] Change the never changed 'var' to 'val'

2021-01-14 Thread GitBox


SparkQA removed a comment on pull request #31142:
URL: https://github.com/apache/spark/pull/31142#issuecomment-760643095


   **[Test build #134077 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134077/testReport)**
 for PR 31142 at commit 
[`c599a05`](https://github.com/apache/spark/commit/c599a054b0ae30ce75a359331ebc9bde366ceac2).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31190: [SPARK-34118][CORE][SQL][3.0] Replaces filter and check for emptiness with exists or forall

2021-01-14 Thread GitBox


SparkQA commented on pull request #31190:
URL: https://github.com/apache/spark/pull/31190#issuecomment-760709735


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38672/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31142: [SPARK-33346][CORE][SQL][MLLIB][DSTREAM][K8S] Change the never changed 'var' to 'val'

2021-01-14 Thread GitBox


SparkQA commented on pull request #31142:
URL: https://github.com/apache/spark/pull/31142#issuecomment-760709225


   **[Test build #134077 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134077/testReport)**
 for PR 31142 at commit 
[`c599a05`](https://github.com/apache/spark/commit/c599a054b0ae30ce75a359331ebc9bde366ceac2).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on pull request #31079: [SPARK-34037][SQL] ResolveAggregateFunctions pushes duplicated sort order into aggregate because of unnecessary casting

2021-01-14 Thread GitBox


cloud-fan commented on pull request #31079:
URL: https://github.com/apache/spark/pull/31079#issuecomment-760705381


   @yaooqinn can we update the PR title and description? Now it's a simple 
improvement to avoid unnecessary casts for sum/avg.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HeartSaVioR commented on pull request #31186: [SPARK-33790][CORE][3.1] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2021-01-14 Thread GitBox


HeartSaVioR commented on pull request #31186:
URL: https://github.com/apache/spark/pull/31186#issuecomment-760704537


   retest this, please



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #31119: [SPARK-34064][SQL] Cancel the running broadcast sub-jobs when SQL statement is cancelled

2021-01-14 Thread GitBox


cloud-fan commented on a change in pull request #31119:
URL: https://github.com/apache/spark/pull/31119#discussion_r557936826



##
File path: 
sql/core/src/main/scala/org/apache/spark/sql/execution/exchange/BroadcastExchangeExec.scala
##
@@ -74,7 +74,10 @@ case class BroadcastExchangeExec(
 child: SparkPlan) extends BroadcastExchangeLike {
   import BroadcastExchangeExec._
 
-  override val runId: UUID = UUID.randomUUID
+  // Cancelling a SQL statement from Spark ThriftServer needs to cancel
+  // its related broadcast sub-jobs. So set the run id to job group id if 
exists.
+  override val runId: UUID = 
Option(sparkContext.getLocalProperty(SparkContext.SPARK_JOB_GROUP_ID))

Review comment:
   @LantaoJin what do you think?





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #31119: [SPARK-34064][SQL] Cancel the running broadcast sub-jobs when SQL statement is cancelled

2021-01-14 Thread GitBox


cloud-fan commented on a change in pull request #31119:
URL: https://github.com/apache/spark/pull/31119#discussion_r557936604



##
File path: 
sql/core/src/main/scala/org/apache/spark/sql/execution/exchange/BroadcastExchangeExec.scala
##
@@ -74,7 +74,10 @@ case class BroadcastExchangeExec(
 child: SparkPlan) extends BroadcastExchangeLike {
   import BroadcastExchangeExec._
 
-  override val runId: UUID = UUID.randomUUID
+  // Cancelling a SQL statement from Spark ThriftServer needs to cancel
+  // its related broadcast sub-jobs. So set the run id to job group id if 
exists.
+  override val runId: UUID = 
Option(sparkContext.getLocalProperty(SparkContext.SPARK_JOB_GROUP_ID))

Review comment:
   After a second thought, I think this is risky. It's possible that in a 
non-STS environment, users set job group id manually, and run some long-running 
jobs. If we capture the job group id here in broadcast exchange, when the 
broadcast timeout, it will cancel the whole job group which may kill the user's 
other long-running jobs unexpectedly.
   
   I think we need to revisit the STS's SQL statement canceling feature. We 
should use SQL execution ID to find out all the jobs of a SQL query, and assign 
a unique job group id to them.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31192: [SPARK-34118][CORE][SQL][2.4] Replaces filter and check for emptiness with exists or forall

2021-01-14 Thread GitBox


SparkQA commented on pull request #31192:
URL: https://github.com/apache/spark/pull/31192#issuecomment-760703741


   Kubernetes integration test status success
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38671/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] LuciferYang commented on pull request #30746: [SPARK-33766][BUILD] Upgrade Jackson to 2.11.4

2021-01-14 Thread GitBox


LuciferYang commented on pull request #30746:
URL: https://github.com/apache/spark/pull/30746#issuecomment-760703656


   Got it, I'll file a new JIRA and try to fix it first :)



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #31185: [SPARK-34111][BUILD][test-maven] Deconflict the jars jakarta.servlet-api-4.0.3.jar and javax.servlet-api-3.1.0.jar

2021-01-14 Thread GitBox


SparkQA removed a comment on pull request #31185:
URL: https://github.com/apache/spark/pull/31185#issuecomment-760641904


   **[Test build #134076 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134076/testReport)**
 for PR 31185 at commit 
[`f8f12a0`](https://github.com/apache/spark/commit/f8f12a0734ac2708d41ea68a8721b85541c6d4ca).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31185: [SPARK-34111][BUILD][test-maven] Deconflict the jars jakarta.servlet-api-4.0.3.jar and javax.servlet-api-3.1.0.jar

2021-01-14 Thread GitBox


SparkQA commented on pull request #31185:
URL: https://github.com/apache/spark/pull/31185#issuecomment-760702444


   **[Test build #134076 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134076/testReport)**
 for PR 31185 at commit 
[`f8f12a0`](https://github.com/apache/spark/commit/f8f12a0734ac2708d41ea68a8721b85541c6d4ca).
* This patch **fails PySpark unit tests**.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] viirya commented on a change in pull request #31107: [SPARK-34052][SQL] A cached view should become invalid after the source table is dropped

2021-01-14 Thread GitBox


viirya commented on a change in pull request #31107:
URL: https://github.com/apache/spark/pull/31107#discussion_r557932490



##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/v2Commands.scala
##
@@ -833,6 +833,7 @@ case class CacheTable(
 case class CacheTableAsSelect(
 tempViewName: String,
 plan: LogicalPlan,
+originalText: Option[String],

Review comment:
   Is there any other source of `CacheTableAsSelect` other than 
`AstBuilder`? Don't we always have this `originalText`?





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] viirya commented on a change in pull request #31107: [SPARK-34052][SQL] A cached view should become invalid after the source table is dropped

2021-01-14 Thread GitBox


viirya commented on a change in pull request #31107:
URL: https://github.com/apache/spark/pull/31107#discussion_r557932490



##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/v2Commands.scala
##
@@ -833,6 +833,7 @@ case class CacheTable(
 case class CacheTableAsSelect(
 tempViewName: String,
 plan: LogicalPlan,
+originalText: Option[String],

Review comment:
   Is any other source of `CacheTableAsSelect` other than `AstBuilder`? 
Don't we always have this `originalText`?





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #31186: [SPARK-33790][CORE][3.1] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2021-01-14 Thread GitBox


SparkQA removed a comment on pull request #31186:
URL: https://github.com/apache/spark/pull/31186#issuecomment-760641870


   **[Test build #134075 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134075/testReport)**
 for PR 31186 at commit 
[`41c2277`](https://github.com/apache/spark/commit/41c22773f7b4ffe7805bd443f35940013b976146).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31186: [SPARK-33790][CORE][3.1] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2021-01-14 Thread GitBox


SparkQA commented on pull request #31186:
URL: https://github.com/apache/spark/pull/31186#issuecomment-760701574


   **[Test build #134075 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134075/testReport)**
 for PR 31186 at commit 
[`41c2277`](https://github.com/apache/spark/commit/41c22773f7b4ffe7805bd443f35940013b976146).
* This patch **fails PySpark unit tests**.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #29737: [SPARK-32864][SQL] Support ORC forced positional evolution

2021-01-14 Thread GitBox


SparkQA removed a comment on pull request #29737:
URL: https://github.com/apache/spark/pull/29737#issuecomment-760607924


   **[Test build #134070 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134070/testReport)**
 for PR 29737 at commit 
[`51f503c`](https://github.com/apache/spark/commit/51f503c5738a714d6ea77467ac8f7dfba231d989).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #29737: [SPARK-32864][SQL] Support ORC forced positional evolution

2021-01-14 Thread GitBox


SparkQA commented on pull request #29737:
URL: https://github.com/apache/spark/pull/29737#issuecomment-760700894


   **[Test build #134070 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134070/testReport)**
 for PR 29737 at commit 
[`51f503c`](https://github.com/apache/spark/commit/51f503c5738a714d6ea77467ac8f7dfba231d989).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31160: [SPARK-34080][ML][PYTHON] Add UnivariateFeatureSelector

2021-01-14 Thread GitBox


SparkQA commented on pull request #31160:
URL: https://github.com/apache/spark/pull/31160#issuecomment-760699987


   Kubernetes integration test status success
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38668/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on pull request #31193: [SPARK-34117][SQL] Disable LeftSemi/LeftAnti push down over Aggregate

2021-01-14 Thread GitBox


cloud-fan commented on pull request #31193:
URL: https://github.com/apache/spark/pull/31193#issuecomment-760699379


   why do we need to disable it completely?



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31192: [SPARK-34118][CORE][SQL][2.4] Replaces filter and check for emptiness with exists or forall

2021-01-14 Thread GitBox


SparkQA commented on pull request #31192:
URL: https://github.com/apache/spark/pull/31192#issuecomment-760698666


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38671/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31187: [SPARK-33790][CORE][3.0] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2021-01-14 Thread GitBox


SparkQA commented on pull request #31187:
URL: https://github.com/apache/spark/pull/31187#issuecomment-760698590


   Kubernetes integration test status success
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38667/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31188: [SPARK-34118][CORE][SQL][3.1] Replaces filter and check for emptiness with exists or forall

2021-01-14 Thread GitBox


SparkQA commented on pull request #31188:
URL: https://github.com/apache/spark/pull/31188#issuecomment-760697233


   Kubernetes integration test status success
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38666/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31185: [SPARK-34111][BUILD][test-maven] Deconflict the jars jakarta.servlet-api-4.0.3.jar and javax.servlet-api-3.1.0.jar

2021-01-14 Thread GitBox


SparkQA commented on pull request #31185:
URL: https://github.com/apache/spark/pull/31185#issuecomment-760695785


   **[Test build #134090 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134090/testReport)**
 for PR 31185 at commit 
[`f8f12a0`](https://github.com/apache/spark/commit/f8f12a0734ac2708d41ea68a8721b85541c6d4ca).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] yaooqinn commented on a change in pull request #30508: [SPARK-33618][CORE] Use hadoop-client instead of hadoop-client-api to make hadoop-aws work

2021-01-14 Thread GitBox


yaooqinn commented on a change in pull request #30508:
URL: https://github.com/apache/spark/pull/30508#discussion_r557919211



##
File path: dev/deps/spark-deps-hadoop-3.2-hive-2.3
##
@@ -116,14 +142,30 @@ jline/2.14.6//jline-2.14.6.jar
 joda-time/2.10.5//joda-time-2.10.5.jar
 jodd-core/3.5.2//jodd-core-3.5.2.jar
 jpam/1.1//jpam-1.1.jar
+json-smart/2.3//json-smart-2.3.jar
 json/1.8//json-1.8.jar
 json4s-ast_2.12/3.7.0-M5//json4s-ast_2.12-3.7.0-M5.jar
 json4s-core_2.12/3.7.0-M5//json4s-core_2.12-3.7.0-M5.jar
 json4s-jackson_2.12/3.7.0-M5//json4s-jackson_2.12-3.7.0-M5.jar
 json4s-scalap_2.12/3.7.0-M5//json4s-scalap_2.12-3.7.0-M5.jar
+jsp-api/2.1//jsp-api-2.1.jar
 jsr305/3.0.0//jsr305-3.0.0.jar
 jta/1.1//jta-1.1.jar
 jul-to-slf4j/1.7.30//jul-to-slf4j-1.7.30.jar
+kerb-admin/1.0.1//kerb-admin-1.0.1.jar
+kerb-client/1.0.1//kerb-client-1.0.1.jar
+kerb-common/1.0.1//kerb-common-1.0.1.jar
+kerb-core/1.0.1//kerb-core-1.0.1.jar
+kerb-crypto/1.0.1//kerb-crypto-1.0.1.jar
+kerb-identity/1.0.1//kerb-identity-1.0.1.jar
+kerb-server/1.0.1//kerb-server-1.0.1.jar
+kerb-simplekdc/1.0.1//kerb-simplekdc-1.0.1.jar

Review comment:
   looks like the original PR does not handle any transitive artifact 
exclusion at all 😸





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31189: [SPARK-34122][SQL] Remove duplicated branches in CaseWhen

2021-01-14 Thread GitBox


SparkQA commented on pull request #31189:
URL: https://github.com/apache/spark/pull/31189#issuecomment-760694192


   Kubernetes integration test status success
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38665/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] ulysses-you commented on pull request #30974: [SPARK-33939][SQL] Make Column.named UnresolvedExtractValue use UnresolvedAlias to assign name

2021-01-14 Thread GitBox


ulysses-you commented on pull request #30974:
URL: https://github.com/apache/spark/pull/30974#issuecomment-760693554


   Seems there is another issue, `Literal(null, NullType)` will be convert to 
other data type  during analysis. Then `Null` will be `CAST(Null AS xxx)`. 
   
https://github.com/apache/spark/blob/00d43b1f829fb5f79f0355afbbacc804162648e5/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/literals.scala#L390-L392
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on pull request #31185: [SPARK-34111][BUILD][test-maven] Deconflict the jars jakarta.servlet-api-4.0.3.jar and javax.servlet-api-3.1.0.jar

2021-01-14 Thread GitBox


HyukjinKwon commented on pull request #31185:
URL: https://github.com/apache/spark/pull/31185#issuecomment-760692189


   retest this please



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on pull request #30746: [SPARK-33766][BUILD] Upgrade Jackson to 2.11.4

2021-01-14 Thread GitBox


HyukjinKwon commented on pull request #30746:
URL: https://github.com/apache/spark/pull/30746#issuecomment-760691842


   Yes, we should. Not sure if we'll have more 2.4.x releases though: 
https://spark.apache.org/versioning-policy.html
   
   > The last minor release within a major a release will typically be 
maintained for longer as an “LTS” release. For example, 2.4.0 was released in 
November 2018, but will likely see releases for more than 18 months, beyond May 
2020.
   
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #31187: [SPARK-33790][CORE][3.0] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2021-01-14 Thread GitBox


AmplabJenkins removed a comment on pull request #31187:
URL: https://github.com/apache/spark/pull/31187#issuecomment-760690502


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134081/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #31190: [SPARK-34118][CORE][SQL][3.0] Replaces filter and check for emptiness with exists or forall

2021-01-14 Thread GitBox


AmplabJenkins removed a comment on pull request #31190:
URL: https://github.com/apache/spark/pull/31190#issuecomment-760690500







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #31187: [SPARK-33790][CORE][3.0] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2021-01-14 Thread GitBox


AmplabJenkins commented on pull request #31187:
URL: https://github.com/apache/spark/pull/31187#issuecomment-760690502


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134081/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #31193: [SPARK-34117][SQL] Disable LeftSemi/LeftAnti push down over Aggregate

2021-01-14 Thread GitBox


AmplabJenkins commented on pull request #31193:
URL: https://github.com/apache/spark/pull/31193#issuecomment-760690501


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134085/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #31190: [SPARK-34118][CORE][SQL][3.0] Replaces filter and check for emptiness with exists or forall

2021-01-14 Thread GitBox


AmplabJenkins commented on pull request #31190:
URL: https://github.com/apache/spark/pull/31190#issuecomment-760690500







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on a change in pull request #30508: [SPARK-33618][CORE] Use hadoop-client instead of hadoop-client-api to make hadoop-aws work

2021-01-14 Thread GitBox


HyukjinKwon commented on a change in pull request #30508:
URL: https://github.com/apache/spark/pull/30508#discussion_r557909634



##
File path: dev/deps/spark-deps-hadoop-3.2-hive-2.3
##
@@ -116,14 +142,30 @@ jline/2.14.6//jline-2.14.6.jar
 joda-time/2.10.5//joda-time-2.10.5.jar
 jodd-core/3.5.2//jodd-core-3.5.2.jar
 jpam/1.1//jpam-1.1.jar
+json-smart/2.3//json-smart-2.3.jar
 json/1.8//json-1.8.jar
 json4s-ast_2.12/3.7.0-M5//json4s-ast_2.12-3.7.0-M5.jar
 json4s-core_2.12/3.7.0-M5//json4s-core_2.12-3.7.0-M5.jar
 json4s-jackson_2.12/3.7.0-M5//json4s-jackson_2.12-3.7.0-M5.jar
 json4s-scalap_2.12/3.7.0-M5//json4s-scalap_2.12-3.7.0-M5.jar
+jsp-api/2.1//jsp-api-2.1.jar
 jsr305/3.0.0//jsr305-3.0.0.jar
 jta/1.1//jta-1.1.jar
 jul-to-slf4j/1.7.30//jul-to-slf4j-1.7.30.jar
+kerb-admin/1.0.1//kerb-admin-1.0.1.jar
+kerb-client/1.0.1//kerb-client-1.0.1.jar
+kerb-common/1.0.1//kerb-common-1.0.1.jar
+kerb-core/1.0.1//kerb-core-1.0.1.jar
+kerb-crypto/1.0.1//kerb-crypto-1.0.1.jar
+kerb-identity/1.0.1//kerb-identity-1.0.1.jar
+kerb-server/1.0.1//kerb-server-1.0.1.jar
+kerb-simplekdc/1.0.1//kerb-simplekdc-1.0.1.jar

Review comment:
   This is actually a revert. It was added in 
https://github.com/apache/spark/commit/ce7ba2e98e0a3b038e881c271b5905058c43155b#diff-e45e1eee8dcfd7eaf8a013cec02b67806da3edeabe0f195ac6b4402f67d4b6dcR146





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31079: [SPARK-34037][SQL] ResolveAggregateFunctions pushes duplicated sort order into aggregate because of unnecessary casting

2021-01-14 Thread GitBox


SparkQA commented on pull request #31079:
URL: https://github.com/apache/spark/pull/31079#issuecomment-760690112


   **[Test build #134089 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134089/testReport)**
 for PR 31079 at commit 
[`e891f5e`](https://github.com/apache/spark/commit/e891f5e0640ae78491d76826b43ae6b2edd20e64).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31190: [SPARK-34118][CORE][SQL][3.0] Replaces filter and check for emptiness with exists or forall

2021-01-14 Thread GitBox


SparkQA commented on pull request #31190:
URL: https://github.com/apache/spark/pull/31190#issuecomment-760690020


   **[Test build #134087 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134087/testReport)**
 for PR 31190 at commit 
[`4ac192b`](https://github.com/apache/spark/commit/4ac192bd30d9bc0d54330789d9b3b0694203b8df).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31192: [SPARK-34118][CORE][SQL][2.4] Replaces filter and check for emptiness with exists or forall

2021-01-14 Thread GitBox


SparkQA commented on pull request #31192:
URL: https://github.com/apache/spark/pull/31192#issuecomment-760690017


   **[Test build #134086 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134086/testReport)**
 for PR 31192 at commit 
[`e309061`](https://github.com/apache/spark/commit/e309061820be1746a82914b3b45dc579b969e7b8).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31142: [SPARK-33346][CORE][SQL][MLLIB][DSTREAM][K8S] Change the never changed 'var' to 'val'

2021-01-14 Thread GitBox


SparkQA commented on pull request #31142:
URL: https://github.com/apache/spark/pull/31142#issuecomment-760690046


   **[Test build #134088 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134088/testReport)**
 for PR 31142 at commit 
[`c599a05`](https://github.com/apache/spark/commit/c599a054b0ae30ce75a359331ebc9bde366ceac2).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31187: [SPARK-33790][CORE][3.0] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2021-01-14 Thread GitBox


SparkQA commented on pull request #31187:
URL: https://github.com/apache/spark/pull/31187#issuecomment-760689350


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38667/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] yaooqinn commented on pull request #31185: [SPARK-34111][BUILD][test-maven] Deconflict the jars jakarta.servlet-api-4.0.3.jar and javax.servlet-api-3.1.0.jar

2021-01-14 Thread GitBox


yaooqinn commented on pull request #31185:
URL: https://github.com/apache/spark/pull/31185#issuecomment-760689228


   ```
   - decommission workers ensure that shuffle output is regenerated even with 
shuffle service *** FAILED *** (20 seconds, 783 milliseconds)
   [info]   5 did not equal 4 Expected 4 tasks but got List(0:0:0:0-SUCCESS, 
0:0:1:0-FAILED, 0:0:1:1-SUCCESS, 0:1:0:0-SUCCESS, 1:0:0:0-SUCCESS) 
(DecommissionWorkerSuite.scala:190)
   [info]   org.scalatest.exceptions.TestFailedException:
   [info]   at 
org.scalatest.Assertions.newAssertionFailedException(Assertions.scala:472)
   [info]   at 
org.scalatest.Assertions.newAssertionFailedException$(Assertions.scala:471)
   [info]   at 
org.scalatest.Assertions$.newAssertionFailedException(Assertions.scala:1231)
   [info]   at 
org.scalatest.Assertions$AssertionsHelper.macroAssert(Assertions.scala:1295)
   [info]   at 
org.apache.spark.deploy.DecommissionWorkerSuite.$anonfun$new$6(DecommissionWorkerSuite.scala:190)
   ```
   
   seems just a flaky test itself and not related to this change



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #31053: [SPARK-32165][SQL] Ensure Spark only initiates SharedState once across SparkSessions

2021-01-14 Thread GitBox


AmplabJenkins removed a comment on pull request #31053:
URL: https://github.com/apache/spark/pull/31053#issuecomment-760688678


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/38669/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #31191: [SPARK-34123][Web UI] optimize spark history summary page loading

2021-01-14 Thread GitBox


AmplabJenkins commented on pull request #31191:
URL: https://github.com/apache/spark/pull/31191#issuecomment-760688769


   Can one of the admins verify this patch?



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #31053: [SPARK-32165][SQL] Ensure Spark only initiates SharedState once across SparkSessions

2021-01-14 Thread GitBox


AmplabJenkins commented on pull request #31053:
URL: https://github.com/apache/spark/pull/31053#issuecomment-760688678


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/38669/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31053: [SPARK-32165][SQL] Ensure Spark only initiates SharedState once across SparkSessions

2021-01-14 Thread GitBox


SparkQA commented on pull request #31053:
URL: https://github.com/apache/spark/pull/31053#issuecomment-760688662


   Kubernetes integration test status failure
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38669/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #31185: [SPARK-34111][BUILD][test-maven] Deconflict the jars jakarta.servlet-api-4.0.3.jar and javax.servlet-api-3.1.0.jar

2021-01-14 Thread GitBox


AmplabJenkins removed a comment on pull request #31185:
URL: https://github.com/apache/spark/pull/31185#issuecomment-760688348


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/38662/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #31160: [SPARK-34080][ML][PYTHON] Add UnivariateFeatureSelector

2021-01-14 Thread GitBox


AmplabJenkins removed a comment on pull request #31160:
URL: https://github.com/apache/spark/pull/31160#issuecomment-760688346


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134082/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #31186: [SPARK-33790][CORE][3.1] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2021-01-14 Thread GitBox


AmplabJenkins removed a comment on pull request #31186:
URL: https://github.com/apache/spark/pull/31186#issuecomment-760688349


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/38661/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #30974: [SPARK-33939][SQL] Make Column.named UnresolvedExtractValue use UnresolvedAlias to assign name

2021-01-14 Thread GitBox


AmplabJenkins removed a comment on pull request #30974:
URL: https://github.com/apache/spark/pull/30974#issuecomment-760688345


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134072/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #31142: [SPARK-33346][CORE][SQL][MLLIB][DSTREAM][K8S] Change the never changed 'var' to 'val'

2021-01-14 Thread GitBox


AmplabJenkins removed a comment on pull request #31142:
URL: https://github.com/apache/spark/pull/31142#issuecomment-760688347


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/38663/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #31160: [SPARK-34080][ML][PYTHON] Add UnivariateFeatureSelector

2021-01-14 Thread GitBox


AmplabJenkins commented on pull request #31160:
URL: https://github.com/apache/spark/pull/31160#issuecomment-760688346


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134082/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #31142: [SPARK-33346][CORE][SQL][MLLIB][DSTREAM][K8S] Change the never changed 'var' to 'val'

2021-01-14 Thread GitBox


AmplabJenkins commented on pull request #31142:
URL: https://github.com/apache/spark/pull/31142#issuecomment-760688347


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/38663/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #31186: [SPARK-33790][CORE][3.1] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2021-01-14 Thread GitBox


AmplabJenkins commented on pull request #31186:
URL: https://github.com/apache/spark/pull/31186#issuecomment-760688349


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/38661/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #31185: [SPARK-34111][BUILD][test-maven] Deconflict the jars jakarta.servlet-api-4.0.3.jar and javax.servlet-api-3.1.0.jar

2021-01-14 Thread GitBox


AmplabJenkins commented on pull request #31185:
URL: https://github.com/apache/spark/pull/31185#issuecomment-760688348


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/38662/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #30974: [SPARK-33939][SQL] Make Column.named UnresolvedExtractValue use UnresolvedAlias to assign name

2021-01-14 Thread GitBox


AmplabJenkins commented on pull request #30974:
URL: https://github.com/apache/spark/pull/30974#issuecomment-760688345


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134072/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31053: [SPARK-32165][SQL] Ensure Spark only initiates SharedState once across SparkSessions

2021-01-14 Thread GitBox


SparkQA commented on pull request #31053:
URL: https://github.com/apache/spark/pull/31053#issuecomment-760686545


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38669/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] LuciferYang commented on pull request #30746: [SPARK-33766][BUILD] Upgrade Jackson to 2.11.4

2021-01-14 Thread GitBox


LuciferYang commented on pull request #30746:
URL: https://github.com/apache/spark/pull/30746#issuecomment-760686077


   ```
   FasterXML jackson-databind 2.x before 2.9.10.8 mishandles the interaction 
between serialization gadgets and typing, related to 
oadd.org.apache.commons.dbcp.cpdsadapter.DriverAdapterCPDS.
   ```
   
   
[CVE-2020-36179](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-36179)
   
   I found Spark 2.4 still using Jackson 2.6.7
   
   
https://github.com/apache/spark/blob/63e93a5c38a83669ccc58a5b45d5cff0b296fcc9/pom.xml#L161-L163
   
   Should we upgrade it and which version should we use in Spark 2.4.
   
   cc @wangyum @dongjoon-hyun @HyukjinKwon 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] wangyum opened a new pull request #31193: [SPARK-34117][SQL] Disable LeftSemi/LeftAnti push down over Aggregate

2021-01-14 Thread GitBox


wangyum opened a new pull request #31193:
URL: https://github.com/apache/spark/pull/31193


   ### What changes were proposed in this pull request?
   
   This pr disable LeftSemi/LeftAnti push down over Aggregate.
   
   
   ### Why are the changes needed?
   
   LeftSemi/LeftAnti push down over Aggregate may affect performance. for 
example:
   ```sql
   SELECT i_item_sk ss_item_sk
 FROM item,
   (SELECT
 distinct
 iss.i_brand_id brand_id,
 iss.i_class_id class_id,
 iss.i_category_id category_id
   FROM store_sales, item iss, date_dim d1
   WHERE ss_item_sk = iss.i_item_sk
 AND ss_sold_date_sk = d1.d_date_sk
 AND d1.d_year BETWEEN 1999 AND 1999 + 2
   INTERSECT
   SELECT
   distinct
 ics.i_brand_id,
 ics.i_class_id,
 ics.i_category_id
   FROM catalog_sales, item ics, date_dim d2
   WHERE cs_item_sk = ics.i_item_sk
 AND cs_sold_date_sk = d2.d_date_sk
 AND d2.d_year BETWEEN 1999 AND 1999 + 2
   INTERSECT
   SELECT
   distinct
 iws.i_brand_id,
 iws.i_class_id,
 iws.i_category_id
   FROM web_sales, item iws, date_dim d3
   WHERE ws_item_sk = iws.i_item_sk
 AND ws_sold_date_sk = d3.d_date_sk
 AND d3.d_year BETWEEN 1999 AND 1999 + 2) x
 WHERE i_brand_id = brand_id
   AND i_class_id = class_id
   AND i_category_id = category_id;
   ```
   
   This query is rewritten from 
[q14b](https://github.com/apache/spark/blob/a78d6ce376edf2a8836e01f47b9dff5371058d4c/sql/core/src/test/resources/tpcds/q14b.sql#L2-L32).
   
   
   CBO enabled | CBO disabled
   -- | --
   The statistics are more accurate, so it will be converted to broadcast join, 
and then push down aggregate. | It will not push down aggregate.
   
![image](https://issues.apache.org/jira/secure/attachment/13018801/current.jpg) 
| 
![image](https://issues.apache.org/jira/secure/attachment/13018802/disable_pushdown.jpg)
   
   
   
   ### Does this PR introduce _any_ user-facing change?
   
   No.
   
   
   ### How was this patch tested?
   
   Unit test.
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MaxGekk commented on pull request #31066: [SPARK-34027][SQL] Refresh cache in `ALTER TABLE .. RECOVER PARTITIONS`

2021-01-14 Thread GitBox


MaxGekk commented on pull request #31066:
URL: https://github.com/apache/spark/pull/31066#issuecomment-760685706


   @HyukjinKwon @cloud-fan Please, take a look at this PR one more time.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #31160: [SPARK-34080][ML][PYTHON] Add UnivariateFeatureSelector

2021-01-14 Thread GitBox


SparkQA removed a comment on pull request #31160:
URL: https://github.com/apache/spark/pull/31160#issuecomment-760661019


   **[Test build #134082 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134082/testReport)**
 for PR 31160 at commit 
[`d8601e7`](https://github.com/apache/spark/commit/d8601e7cdf06904e7d5450e87292da8c5cb37036).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31160: [SPARK-34080][ML][PYTHON] Add UnivariateFeatureSelector

2021-01-14 Thread GitBox


SparkQA commented on pull request #31160:
URL: https://github.com/apache/spark/pull/31160#issuecomment-760685496


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38668/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31160: [SPARK-34080][ML][PYTHON] Add UnivariateFeatureSelector

2021-01-14 Thread GitBox


SparkQA commented on pull request #31160:
URL: https://github.com/apache/spark/pull/31160#issuecomment-760685337


   **[Test build #134082 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134082/testReport)**
 for PR 31160 at commit 
[`d8601e7`](https://github.com/apache/spark/commit/d8601e7cdf06904e7d5450e87292da8c5cb37036).
* This patch **fails PySpark unit tests**.
* This patch merges cleanly.
* This patch adds the following public classes _(experimental)_:
 * `class _UnivariateFeatureSelectorParams(HasFeaturesCol, HasOutputCol, 
HasLabelCol):`



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31188: [SPARK-34118][CORE][SQL][3.1] Replaces filter and check for emptiness with exists or forall

2021-01-14 Thread GitBox


SparkQA commented on pull request #31188:
URL: https://github.com/apache/spark/pull/31188#issuecomment-760684233


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38666/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] LuciferYang commented on a change in pull request #31190: [SPARK-34118][CORE][SQL][3.0] Replaces filter and check for emptiness with exists or forall

2021-01-14 Thread GitBox


LuciferYang commented on a change in pull request #31190:
URL: https://github.com/apache/spark/pull/31190#discussion_r557896600



##
File path: 
sql/core/src/test/scala/org/apache/spark/sql/streaming/StreamingQueryListenerSuite.scala
##
@@ -48,8 +48,7 @@ class StreamingQueryListenerSuite extends StreamTest with 
BeforeAndAfter {
 spark.streams.active.foreach(_.stop())
 assert(spark.streams.active.isEmpty)
 // Skip check default `StreamingQueryStatusListener` which is for 
streaming UI.
-assert(spark.streams.listListeners()
-  .filterNot(_.isInstanceOf[StreamingQueryStatusListener]).isEmpty)
+
assert(spark.streams.listListeners().forall(_.isInstanceOf[StreamingQueryStatusListener]))

Review comment:
   Address 4ac192b add a newline





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #31189: [SPARK-34122][SQL] Remove duplicated branches in CaseWhen

2021-01-14 Thread GitBox


SparkQA commented on pull request #31189:
URL: https://github.com/apache/spark/pull/31189#issuecomment-760682298


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38665/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] yaooqinn commented on a change in pull request #30508: [SPARK-33618][CORE] Use hadoop-client instead of hadoop-client-api to make hadoop-aws work

2021-01-14 Thread GitBox


yaooqinn commented on a change in pull request #30508:
URL: https://github.com/apache/spark/pull/30508#discussion_r557892419



##
File path: dev/deps/spark-deps-hadoop-3.2-hive-2.3
##
@@ -116,14 +142,30 @@ jline/2.14.6//jline-2.14.6.jar
 joda-time/2.10.5//joda-time-2.10.5.jar
 jodd-core/3.5.2//jodd-core-3.5.2.jar
 jpam/1.1//jpam-1.1.jar
+json-smart/2.3//json-smart-2.3.jar
 json/1.8//json-1.8.jar
 json4s-ast_2.12/3.7.0-M5//json4s-ast_2.12-3.7.0-M5.jar
 json4s-core_2.12/3.7.0-M5//json4s-core_2.12-3.7.0-M5.jar
 json4s-jackson_2.12/3.7.0-M5//json4s-jackson_2.12-3.7.0-M5.jar
 json4s-scalap_2.12/3.7.0-M5//json4s-scalap_2.12-3.7.0-M5.jar
+jsp-api/2.1//jsp-api-2.1.jar
 jsr305/3.0.0//jsr305-3.0.0.jar
 jta/1.1//jta-1.1.jar
 jul-to-slf4j/1.7.30//jul-to-slf4j-1.7.30.jar
+kerb-admin/1.0.1//kerb-admin-1.0.1.jar
+kerb-client/1.0.1//kerb-client-1.0.1.jar
+kerb-common/1.0.1//kerb-common-1.0.1.jar
+kerb-core/1.0.1//kerb-core-1.0.1.jar
+kerb-crypto/1.0.1//kerb-crypto-1.0.1.jar
+kerb-identity/1.0.1//kerb-identity-1.0.1.jar
+kerb-server/1.0.1//kerb-server-1.0.1.jar
+kerb-simplekdc/1.0.1//kerb-simplekdc-1.0.1.jar

Review comment:
   just for curiosity, does spark has a chance to play the role of KDC at 
runtime? 





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] LuciferYang commented on a change in pull request #31190: [SPARK-34118][CORE][SQL][3.0] Replaces filter and check for emptiness with exists or forall

2021-01-14 Thread GitBox


LuciferYang commented on a change in pull request #31190:
URL: https://github.com/apache/spark/pull/31190#discussion_r557892183



##
File path: 
sql/core/src/test/scala/org/apache/spark/sql/streaming/StreamingQueryListenerSuite.scala
##
@@ -48,8 +48,7 @@ class StreamingQueryListenerSuite extends StreamTest with 
BeforeAndAfter {
 spark.streams.active.foreach(_.stop())
 assert(spark.streams.active.isEmpty)
 // Skip check default `StreamingQueryStatusListener` which is for 
streaming UI.
-assert(spark.streams.listListeners()
-  .filterNot(_.isInstanceOf[StreamingQueryStatusListener]).isEmpty)
+
assert(spark.streams.listListeners().forall(_.isInstanceOf[StreamingQueryStatusListener]))

Review comment:
   ok





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



  1   2   3   4   5   6   >