[GitHub] [spark] dongjoon-hyun commented on issue #24597: [SPARK-27698][SQL] Add new method for getting pushed down filters in Parquet file reader

2019-05-19 Thread GitBox
dongjoon-hyun commented on issue #24597: [SPARK-27698][SQL] Add new method for 
getting pushed down filters in Parquet file reader
URL: https://github.com/apache/spark/pull/24597#issuecomment-493848222
 
 
   Thank you for pinging me, @gengliangwang . Shall we wait for one day? 
Currently, after SPARK-27699, `HiveOrcFilterSuite` failure is reported in 
`Hadoop 3.2` profile. The fix is under testing and will be merged tomorrow. 
@wangyum and @HyukjinKwon is working on that.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun edited a comment on issue #24597: [SPARK-27698][SQL] Add new method for getting pushed down filters in Parquet file reader

2019-05-19 Thread GitBox
dongjoon-hyun edited a comment on issue #24597: [SPARK-27698][SQL] Add new 
method for getting pushed down filters in Parquet file reader
URL: https://github.com/apache/spark/pull/24597#issuecomment-493848222
 
 
   Thank you for pinging me, @gengliangwang . Shall we wait for one day? 
Currently, after SPARK-27699, `HiveOrcFilterSuite` failure is reported in 
`Hadoop 3.2` profile. The fix is under testing and will be merged tomorrow. 
@wangyum and @HyukjinKwon is actively working on that.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #23992: [SPARK-27074][SQL] Hive 3.1 metastore support HiveClientImpl.runHive

2019-05-19 Thread GitBox
SparkQA commented on issue #23992: [SPARK-27074][SQL] Hive 3.1 metastore 
support HiveClientImpl.runHive
URL: https://github.com/apache/spark/pull/23992#issuecomment-493847713
 
 
   **[Test build #105550 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/105550/testReport)**
 for PR 23992 at commit 
[`7cf2875`](https://github.com/apache/spark/commit/7cf2875aa71094bf2db6f7d24991533a91c6ad29).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #23992: [SPARK-27074][SQL] Hive 3.1 metastore support HiveClientImpl.runHive

2019-05-19 Thread GitBox
AmplabJenkins commented on issue #23992: [SPARK-27074][SQL] Hive 3.1 metastore 
support HiveClientImpl.runHive
URL: https://github.com/apache/spark/pull/23992#issuecomment-493847373
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #23992: [SPARK-27074][SQL] Hive 3.1 metastore support HiveClientImpl.runHive

2019-05-19 Thread GitBox
AmplabJenkins commented on issue #23992: [SPARK-27074][SQL] Hive 3.1 metastore 
support HiveClientImpl.runHive
URL: https://github.com/apache/spark/pull/23992#issuecomment-493847375
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/10804/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #23992: [SPARK-27074][SQL] Hive 3.1 metastore support HiveClientImpl.runHive

2019-05-19 Thread GitBox
AmplabJenkins removed a comment on issue #23992: [SPARK-27074][SQL] Hive 3.1 
metastore support HiveClientImpl.runHive
URL: https://github.com/apache/spark/pull/23992#issuecomment-493847375
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/10804/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #23992: [SPARK-27074][SQL] Hive 3.1 metastore support HiveClientImpl.runHive

2019-05-19 Thread GitBox
AmplabJenkins removed a comment on issue #23992: [SPARK-27074][SQL] Hive 3.1 
metastore support HiveClientImpl.runHive
URL: https://github.com/apache/spark/pull/23992#issuecomment-493847373
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on issue #23992: [SPARK-27074][SQL] Hive 3.1 metastore support HiveClientImpl.runHive

2019-05-19 Thread GitBox
dongjoon-hyun commented on issue #23992: [SPARK-27074][SQL] Hive 3.1 metastore 
support HiveClientImpl.runHive
URL: https://github.com/apache/spark/pull/23992#issuecomment-493847124
 
 
   Retest this please.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #24626: [SPARK-27747][SQL] add a logical plan link in the physical plan

2019-05-19 Thread GitBox
SparkQA commented on issue #24626: [SPARK-27747][SQL] add a logical plan link 
in the physical plan
URL: https://github.com/apache/spark/pull/24626#issuecomment-493846374
 
 
   **[Test build #105549 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/105549/testReport)**
 for PR 24626 at commit 
[`b380f1d`](https://github.com/apache/spark/commit/b380f1dc39f0e4c7701b1a46603b0b62d1f94f96).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #24626: [SPARK-27747][SQL] add a logical plan link in the physical plan

2019-05-19 Thread GitBox
AmplabJenkins commented on issue #24626: [SPARK-27747][SQL] add a logical plan 
link in the physical plan
URL: https://github.com/apache/spark/pull/24626#issuecomment-493846055
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/10803/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #24626: [SPARK-27747][SQL] add a logical plan link in the physical plan

2019-05-19 Thread GitBox
AmplabJenkins removed a comment on issue #24626: [SPARK-27747][SQL] add a 
logical plan link in the physical plan
URL: https://github.com/apache/spark/pull/24626#issuecomment-493846049
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #24626: [SPARK-27747][SQL] add a logical plan link in the physical plan

2019-05-19 Thread GitBox
AmplabJenkins removed a comment on issue #24626: [SPARK-27747][SQL] add a 
logical plan link in the physical plan
URL: https://github.com/apache/spark/pull/24626#issuecomment-493846055
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/10803/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] maryannxue commented on issue #24626: [SPARK-27747][SQL] add a logical plan link in the physical plan

2019-05-19 Thread GitBox
maryannxue commented on issue #24626: [SPARK-27747][SQL] add a logical plan 
link in the physical plan
URL: https://github.com/apache/spark/pull/24626#issuecomment-493846041
 
 
   LGTM, pending Jenkins.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #24626: [SPARK-27747][SQL] add a logical plan link in the physical plan

2019-05-19 Thread GitBox
AmplabJenkins commented on issue #24626: [SPARK-27747][SQL] add a logical plan 
link in the physical plan
URL: https://github.com/apache/spark/pull/24626#issuecomment-493846049
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on a change in pull request #24601: [SPARK-27702][K8S] Allow using some alternatives for service accounts

2019-05-19 Thread GitBox
dongjoon-hyun commented on a change in pull request #24601: [SPARK-27702][K8S] 
Allow using some alternatives for service accounts
URL: https://github.com/apache/spark/pull/24601#discussion_r285434004
 
 

 ##
 File path: 
resource-managers/kubernetes/core/src/main/scala/org/apache/spark/scheduler/cluster/k8s/KubernetesClusterManager.scala
 ##
 @@ -50,10 +50,21 @@ private[spark] class KubernetesClusterManager extends 
ExternalClusterManager wit
   require(sc.conf.get(KUBERNETES_DRIVER_POD_NAME).isDefined,
 "If the application is deployed using spark-submit in cluster mode, 
the driver pod name " +
   "must be provided.")
+  val serviceAccountToken =
+Some(new 
File(Config.KUBERNETES_SERVICE_ACCOUNT_TOKEN_PATH)).filter(_.exists)
+  val serviceAccountCaCrt =
+Some(new 
File(Config.KUBERNETES_SERVICE_ACCOUNT_CA_CRT_PATH)).filter(_.exists)
+  val apiServerUri = if (serviceAccountToken.nonEmpty || 
serviceAccountCaCrt.nonEmpty) {
+KUBERNETES_MASTER_INTERNAL_URL
+  } else {
+val masterUri = KubernetesUtils.parseMasterUrl(masterURL)
+logWarning(s"Service account has been disabled. Using master: 
$masterUri")
+masterUri
+  }
   (KUBERNETES_AUTH_DRIVER_MOUNTED_CONF_PREFIX,
-KUBERNETES_MASTER_INTERNAL_URL,
-Some(new File(Config.KUBERNETES_SERVICE_ACCOUNT_TOKEN_PATH)),
-Some(new File(Config.KUBERNETES_SERVICE_ACCOUNT_CA_CRT_PATH)))
+apiServerUri,
+serviceAccountToken,
+serviceAccountCaCrt)
 
 Review comment:
   @Udbhav30 . Could you add a test case this your PR?


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #24626: [SPARK-27747][SQL] add a logical plan link in the physical plan

2019-05-19 Thread GitBox
SparkQA commented on issue #24626: [SPARK-27747][SQL] add a logical plan link 
in the physical plan
URL: https://github.com/apache/spark/pull/24626#issuecomment-493845094
 
 
   **[Test build #105548 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/105548/testReport)**
 for PR 24626 at commit 
[`b033f55`](https://github.com/apache/spark/commit/b033f55560c7f2fc26b898a21cad444d619fab92).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #24626: [SPARK-27747][SQL] add a logical plan link in the physical plan

2019-05-19 Thread GitBox
AmplabJenkins removed a comment on issue #24626: [SPARK-27747][SQL] add a 
logical plan link in the physical plan
URL: https://github.com/apache/spark/pull/24626#issuecomment-493844771
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #24626: [SPARK-27747][SQL] add a logical plan link in the physical plan

2019-05-19 Thread GitBox
AmplabJenkins commented on issue #24626: [SPARK-27747][SQL] add a logical plan 
link in the physical plan
URL: https://github.com/apache/spark/pull/24626#issuecomment-493844775
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/10802/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #24626: [SPARK-27747][SQL] add a logical plan link in the physical plan

2019-05-19 Thread GitBox
AmplabJenkins commented on issue #24626: [SPARK-27747][SQL] add a logical plan 
link in the physical plan
URL: https://github.com/apache/spark/pull/24626#issuecomment-493844771
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #24626: [SPARK-27747][SQL] add a logical plan link in the physical plan

2019-05-19 Thread GitBox
AmplabJenkins removed a comment on issue #24626: [SPARK-27747][SQL] add a 
logical plan link in the physical plan
URL: https://github.com/apache/spark/pull/24626#issuecomment-493844775
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/10802/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun closed pull request #24596: [SPARK-27694][SQL] Support auto-updating table statistics for data source CTAS command

2019-05-19 Thread GitBox
dongjoon-hyun closed pull request #24596: [SPARK-27694][SQL] Support 
auto-updating table statistics for data source CTAS command
URL: https://github.com/apache/spark/pull/24596
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
SparkQA commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 
'hive-thriftserver' in modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493843871
 
 
   **[Test build #105547 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/105547/testReport)**
 for PR 24644 at commit 
[`92abff8`](https://github.com/apache/spark/commit/92abff88ea626812c2b6e161122bc4d0766be4a7).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
AmplabJenkins removed a comment on issue #24644: 
[SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test 
for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493843572
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
AmplabJenkins removed a comment on issue #24644: 
[SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test 
for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493843576
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/10801/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
AmplabJenkins commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] 
Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493843576
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/10801/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
AmplabJenkins commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] 
Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493843572
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #24233: [SPARK-26356][SQL] remove SaveMode from data source v2

2019-05-19 Thread GitBox
cloud-fan commented on a change in pull request #24233: [SPARK-26356][SQL] 
remove SaveMode from data source v2
URL: https://github.com/apache/spark/pull/24233#discussion_r285431892
 
 

 ##
 File path: 
sql/core/src/test/scala/org/apache/spark/sql/sources/v2/DataSourceV2Suite.scala
 ##
 @@ -219,14 +219,14 @@ class DataSourceV2Suite extends QueryTest with 
SharedSQLContext {
 assert(spark.read.format(cls.getName).option("path", 
path).load().collect().isEmpty)
 
 spark.range(10).select('id as 'i, -'id as 'j).write.format(cls.getName)
-  .option("path", path).save()
+  .option("path", path).mode("append").save()
 
 Review comment:
   Before my PR, we write the files twice: once without the save mode, once 
with append mode.
   
   Now I switch order, to make sure that the second write doesn't specify save 
mode, and prove the default mode is append.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
HyukjinKwon commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 
'hive-thriftserver' in modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493843148
 
 
   retest this please


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #24594: [SPARK-27693][SQL] Add default catalog property

2019-05-19 Thread GitBox
cloud-fan commented on a change in pull request #24594: [SPARK-27693][SQL] Add 
default catalog property
URL: https://github.com/apache/spark/pull/24594#discussion_r285431382
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala
 ##
 @@ -1767,6 +1767,11 @@ object SQLConf {
 "with String")
 .booleanConf
 .createWithDefault(false)
+
+  val DEFAULT_V2_CATALOG = buildConf("spark.sql.default.catalog")
+  .doc("Name of the default v2 catalog, used when an catalog is not 
identified in queries")
 
 Review comment:
   as we discussed in the DS v2 meeting, we should clearly point out which 
places this default catalog is used. View/Function resolution definitely 
doesn't use this default catalog for now. 


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #24594: [SPARK-27693][SQL] Add default catalog property

2019-05-19 Thread GitBox
cloud-fan commented on a change in pull request #24594: [SPARK-27693][SQL] Add 
default catalog property
URL: https://github.com/apache/spark/pull/24594#discussion_r285431212
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala
 ##
 @@ -1767,6 +1767,11 @@ object SQLConf {
 "with String")
 .booleanConf
 .createWithDefault(false)
+
+  val DEFAULT_V2_CATALOG = buildConf("spark.sql.default.catalog")
 
 Review comment:
   nit: `spark.sql.catalog.default` is more consistent with other SQL config 
names: `spark.sql.componentName.featureName`.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] viirya commented on a change in pull request #24637: [SPARK-27707][SQL] Prune unnecessary nested fields from Generate to address performance issue in explode

2019-05-19 Thread GitBox
viirya commented on a change in pull request #24637: [SPARK-27707][SQL] Prune 
unnecessary nested fields from Generate to address performance issue in explode
URL: https://github.com/apache/spark/pull/24637#discussion_r285430439
 
 

 ##
 File path: 
sql/core/src/test/scala/org/apache/spark/sql/execution/benchmark/MiscBenchmark.scala
 ##
 @@ -130,6 +130,18 @@ object MiscBenchmark extends SqlBasedBenchmark {
 df.selectExpr("*", "explode(arr) as arr_col")
   .select("col", "arr_col.*").count
   }
+
+  codegenBenchmark("generate big nested struct array", M) {
+import spark.implicits._
+val df = spark.sparkContext.parallelize(Seq(("1",
+  Array.fill(M)({
+val i = math.random
+(i.toString, (i + 1).toString, (i + 2).toString, (i + 3).toString)
+  }.toDF("col", "arr")
+  .selectExpr("col", "struct(col, arr) as st")
+  .selectExpr("col", "st.col as col1", "explode(st.arr) as arr_col")
+df.collect()
+  }
 
 Review comment:
   Yea, I posted the benchmark result for the added one. Because I think the 
all benchmark results should be run on r3.xlarge machine, I plan to post it 
later.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] viirya commented on a change in pull request #24637: [SPARK-27707][SQL] Prune unnecessary nested fields from Generate to address performance issue in explode

2019-05-19 Thread GitBox
viirya commented on a change in pull request #24637: [SPARK-27707][SQL] Prune 
unnecessary nested fields from Generate to address performance issue in explode
URL: https://github.com/apache/spark/pull/24637#discussion_r285430503
 
 

 ##
 File path: 
sql/core/src/test/scala/org/apache/spark/sql/execution/benchmark/MiscBenchmark.scala
 ##
 @@ -130,6 +130,18 @@ object MiscBenchmark extends SqlBasedBenchmark {
 df.selectExpr("*", "explode(arr) as arr_col")
   .select("col", "arr_col.*").count
   }
+
+  codegenBenchmark("generate big nested struct array", M) {
+import spark.implicits._
+val df = spark.sparkContext.parallelize(Seq(("1",
+  Array.fill(M)({
+val i = math.random
+(i.toString, (i + 1).toString, (i + 2).toString, (i + 3).toString)
+  }.toDF("col", "arr")
+  .selectExpr("col", "struct(col, arr) as st")
+  .selectExpr("col", "st.col as col1", "explode(st.arr) as arr_col")
+df.collect()
+  }
 
 Review comment:
   Will update `MiscBenchmark-results` soon.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
AmplabJenkins removed a comment on issue #24644: 
[SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test 
for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493840306
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/105545/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
SparkQA removed a comment on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] 
Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493827618
 
 
   **[Test build #105545 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/105545/testReport)**
 for PR 24644 at commit 
[`92abff8`](https://github.com/apache/spark/commit/92abff88ea626812c2b6e161122bc4d0766be4a7).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
AmplabJenkins removed a comment on issue #24644: 
[SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test 
for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493840303
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on a change in pull request #24637: [SPARK-27707][SQL] Prune unnecessary nested fields from Generate to address performance issue in explode

2019-05-19 Thread GitBox
dongjoon-hyun commented on a change in pull request #24637: [SPARK-27707][SQL] 
Prune unnecessary nested fields from Generate to address performance issue in 
explode
URL: https://github.com/apache/spark/pull/24637#discussion_r285429699
 
 

 ##
 File path: 
sql/core/src/test/scala/org/apache/spark/sql/execution/benchmark/MiscBenchmark.scala
 ##
 @@ -130,6 +130,18 @@ object MiscBenchmark extends SqlBasedBenchmark {
 df.selectExpr("*", "explode(arr) as arr_col")
   .select("col", "arr_col.*").count
   }
+
+  codegenBenchmark("generate big nested struct array", M) {
+import spark.implicits._
+val df = spark.sparkContext.parallelize(Seq(("1",
+  Array.fill(M)({
+val i = math.random
+(i.toString, (i + 1).toString, (i + 2).toString, (i + 3).toString)
+  }.toDF("col", "arr")
+  .selectExpr("col", "struct(col, arr) as st")
+  .selectExpr("col", "st.col as col1", "explode(st.arr) as arr_col")
+df.collect()
+  }
 
 Review comment:
   This one, `sql/core/benchmarks/MiscBenchmark-results.txt`~


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on a change in pull request #24637: [SPARK-27707][SQL] Prune unnecessary nested fields from Generate to address performance issue in explode

2019-05-19 Thread GitBox
dongjoon-hyun commented on a change in pull request #24637: [SPARK-27707][SQL] 
Prune unnecessary nested fields from Generate to address performance issue in 
explode
URL: https://github.com/apache/spark/pull/24637#discussion_r285429711
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/NestedColumnAliasing.scala
 ##
 @@ -32,9 +32,7 @@ object NestedColumnAliasing {
 
   def unapply(plan: LogicalPlan)
 : Option[(Map[GetStructField, Alias], Map[ExprId, Seq[Alias]])] = plan 
match {
-case Project(projectList, child)
-if SQLConf.get.nestedSchemaPruningEnabled && 
canProjectPushThrough(child) =>
-  getAliasSubMap(projectList)
+case Project(projectList, child) => getAliasSubMap(projectList)
 
 Review comment:
   Thanks!


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
AmplabJenkins commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] 
Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493840306
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/105545/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
AmplabJenkins commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] 
Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493840303
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on a change in pull request #24637: [SPARK-27707][SQL] Prune unnecessary nested fields from Generate to address performance issue in explode

2019-05-19 Thread GitBox
dongjoon-hyun commented on a change in pull request #24637: [SPARK-27707][SQL] 
Prune unnecessary nested fields from Generate to address performance issue in 
explode
URL: https://github.com/apache/spark/pull/24637#discussion_r285429576
 
 

 ##
 File path: 
sql/core/src/test/scala/org/apache/spark/sql/execution/benchmark/MiscBenchmark.scala
 ##
 @@ -130,6 +130,18 @@ object MiscBenchmark extends SqlBasedBenchmark {
 df.selectExpr("*", "explode(arr) as arr_col")
   .select("col", "arr_col.*").count
   }
+
+  codegenBenchmark("generate big nested struct array", M) {
+import spark.implicits._
+val df = spark.sparkContext.parallelize(Seq(("1",
+  Array.fill(M)({
+val i = math.random
+(i.toString, (i + 1).toString, (i + 2).toString, (i + 3).toString)
+  }.toDF("col", "arr")
+  .selectExpr("col", "struct(col, arr) as st")
+  .selectExpr("col", "st.col as col1", "explode(st.arr) as arr_col")
+df.collect()
+  }
 
 Review comment:
   Please add benchmark result in this PR.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
SparkQA commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 
'hive-thriftserver' in modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493840155
 
 
   **[Test build #105545 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/105545/testReport)**
 for PR 24644 at commit 
[`92abff8`](https://github.com/apache/spark/commit/92abff88ea626812c2b6e161122bc4d0766be4a7).
* This patch **fails Spark unit tests**.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] viirya commented on a change in pull request #24637: [SPARK-27707][SQL] Prune unnecessary nested fields from Generate to address performance issue in explode

2019-05-19 Thread GitBox
viirya commented on a change in pull request #24637: [SPARK-27707][SQL] Prune 
unnecessary nested fields from Generate to address performance issue in explode
URL: https://github.com/apache/spark/pull/24637#discussion_r285429526
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/NestedColumnAliasing.scala
 ##
 @@ -32,9 +32,7 @@ object NestedColumnAliasing {
 
   def unapply(plan: LogicalPlan)
 : Option[(Map[GetStructField, Alias], Map[ExprId, Seq[Alias]])] = plan 
match {
-case Project(projectList, child)
-if SQLConf.get.nestedSchemaPruningEnabled && 
canProjectPushThrough(child) =>
-  getAliasSubMap(projectList)
+case Project(projectList, child) => getAliasSubMap(projectList)
 
 Review comment:
   I see. If so, I need to make a little change to prevent it. Will change it 
later.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on a change in pull request #24637: [SPARK-27707][SQL] Prune unnecessary nested fields from Generate to address performance issue in explode

2019-05-19 Thread GitBox
dongjoon-hyun commented on a change in pull request #24637: [SPARK-27707][SQL] 
Prune unnecessary nested fields from Generate to address performance issue in 
explode
URL: https://github.com/apache/spark/pull/24637#discussion_r285429248
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/NestedColumnAliasing.scala
 ##
 @@ -32,9 +32,7 @@ object NestedColumnAliasing {
 
   def unapply(plan: LogicalPlan)
 : Option[(Map[GetStructField, Alias], Map[ExprId, Seq[Alias]])] = plan 
match {
-case Project(projectList, child)
-if SQLConf.get.nestedSchemaPruningEnabled && 
canProjectPushThrough(child) =>
-  getAliasSubMap(projectList)
+case Project(projectList, child) => getAliasSubMap(projectList)
 
 Review comment:
   @viirya . Sorry, but this is a regression on all the existing code. We 
should avoid `getAliasSubMap` invocation. 
https://github.com/apache/spark/pull/24637/files#diff-a636a87d8843eeccca90140be91d4fafR635
 doesn't prevent `getAliasSubMap` invocation inside `unapply`, does it?


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] jzhuge commented on issue #24594: [SPARK-27693][SQL] Add default catalog property

2019-05-19 Thread GitBox
jzhuge commented on issue #24594: [SPARK-27693][SQL] Add default catalog 
property
URL: https://github.com/apache/spark/pull/24594#issuecomment-493839634
 
 
   Thanks @dongjoon-hyun !


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on a change in pull request #24637: [SPARK-27707][SQL] Prune unnecessary nested fields from Generate to address performance issue in explode

2019-05-19 Thread GitBox
dongjoon-hyun commented on a change in pull request #24637: [SPARK-27707][SQL] 
Prune unnecessary nested fields from Generate to address performance issue in 
explode
URL: https://github.com/apache/spark/pull/24637#discussion_r285429248
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/NestedColumnAliasing.scala
 ##
 @@ -32,9 +32,7 @@ object NestedColumnAliasing {
 
   def unapply(plan: LogicalPlan)
 : Option[(Map[GetStructField, Alias], Map[ExprId, Seq[Alias]])] = plan 
match {
-case Project(projectList, child)
-if SQLConf.get.nestedSchemaPruningEnabled && 
canProjectPushThrough(child) =>
-  getAliasSubMap(projectList)
+case Project(projectList, child) => getAliasSubMap(projectList)
 
 Review comment:
   @viirya . Sorry, but this is a regression on all the existing code. We 
should avoid `getAliasSubMap` invocation. 
https://github.com/apache/spark/pull/24637/files#diff-a636a87d8843eeccca90140be91d4fafR635
 doesn't prevent `getAliasSubMap` invocation inside `unapply`.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on issue #24637: [SPARK-27707][SQL] Prune unnecessary nested fields from Generate to address performance issue in explode

2019-05-19 Thread GitBox
dongjoon-hyun commented on issue #24637: [SPARK-27707][SQL] Prune unnecessary 
nested fields from Generate to address performance issue in explode
URL: https://github.com/apache/spark/pull/24637#issuecomment-493838695
 
 
   cc @dbtsai 


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on a change in pull request #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
HyukjinKwon commented on a change in pull request #24644: 
[SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test 
for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#discussion_r285428395
 
 

 ##
 File path: dev/sparktestsupport/modules.py
 ##
 @@ -568,6 +558,15 @@ def __hash__(self):
 ]
 )
 
+# TODO: Skip hive-thriftserver module for hadoop-3.2. remove this once 
hadoop-3.2 support it
+if os.environ.get("AMPLAB_JENKINS"):
+hadoop_version = os.environ.get("AMPLAB_JENKINS_BUILD_PROFILE", 
"hadoop2.7")
+else:
+hadoop_version = os.environ.get("HADOOP_PROFILE", "hadoop2.7")
+if hadoop_version == "hadoop3.2":
+print("[info] Skip unsupported module:", "hive-thriftserver")
+all_modules = [m for m in all_modules if m.name != "hive-thriftserver"]
 
 Review comment:
   yea I just moved.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on issue #24631: [MINOR][CORE] Avoid hardcoded configs

2019-05-19 Thread GitBox
dongjoon-hyun commented on issue #24631: [MINOR][CORE] Avoid hardcoded configs
URL: https://github.com/apache/spark/pull/24631#issuecomment-493838177
 
 
   @wenxuanguan . Thank you for improving the PR. Now, this PR seems need a 
JIRA issue because it touches many files. Could you file a Apache Spark JIRA 
issue? Please enumerate the configs in the JIRA which you replaced. Also, 
please use that JIRA id instead of `[MINOR]`. 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cxzl25 commented on a change in pull request #24497: [SPARK-27630][CORE]Stage retry causes totalRunningTasks calculation to be negative

2019-05-19 Thread GitBox
cxzl25 commented on a change in pull request #24497: [SPARK-27630][CORE]Stage 
retry causes totalRunningTasks calculation to be negative
URL: https://github.com/apache/spark/pull/24497#discussion_r285428024
 
 

 ##
 File path: core/src/main/scala/org/apache/spark/ExecutorAllocationManager.scala
 ##
 @@ -646,10 +646,9 @@ private[spark] class ExecutorAllocationManager(
   private[spark] class ExecutorAllocationListener extends SparkListener {
 
 private val stageIdToNumTasks = new mutable.HashMap[Int, Int]
-// Number of running tasks per stage including speculative tasks.
-// Should be 0 when no stages are active.
-private val stageIdToNumRunningTask = new mutable.HashMap[Int, Int]
 private val stageIdToTaskIndices = new mutable.HashMap[Int, 
mutable.HashSet[Int]]
+private val liveTaskIds = new mutable.HashSet[Long]
 
 Review comment:
   The taskid is actually not repeated.
   Only the same taskid appears in some test cases, and the task has different 
types.
   ```scala
   createTaskInfo(0, 0, "executor-1")
   createTaskInfo(0, 0, "executor-2", true)
   ```
   
   
https://github.com/apache/spark/blob/cd4a284030e5142bdb405ff5b71735ac8cee2dde/core/src/test/scala/org/apache/spark/ExecutorAllocationManagerSuite.scala#L260-L267
   
   
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
AmplabJenkins removed a comment on issue #24644: 
[SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test 
for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493837335
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/105544/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
AmplabJenkins removed a comment on issue #24644: 
[SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test 
for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493837330
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on issue #24335: [SPARK-27425][SQL] Add count_if functions

2019-05-19 Thread GitBox
dongjoon-hyun commented on issue #24335: [SPARK-27425][SQL] Add count_if 
functions
URL: https://github.com/apache/spark/pull/24335#issuecomment-493837370
 
 
   Hi, @cryeo . Did you ask the questions to the community as @HyukjinKwon 
recommended? I'm just wondering if the decision was made. If we are not going 
to proceed with this, we had better close this PR and JIRA issue.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
AmplabJenkins commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] 
Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493837335
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/105544/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
AmplabJenkins commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] 
Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493837330
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
SparkQA commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 
'hive-thriftserver' in modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493837230
 
 
   **[Test build #105544 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/105544/testReport)**
 for PR 24644 at commit 
[`92abff8`](https://github.com/apache/spark/commit/92abff88ea626812c2b6e161122bc4d0766be4a7).
* This patch **fails Spark unit tests**.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
SparkQA removed a comment on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] 
Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493826550
 
 
   **[Test build #105544 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/105544/testReport)**
 for PR 24644 at commit 
[`92abff8`](https://github.com/apache/spark/commit/92abff88ea626812c2b6e161122bc4d0766be4a7).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun closed pull request #24594: [SPARK-27693][SQL] Add default catalog property

2019-05-19 Thread GitBox
dongjoon-hyun closed pull request #24594: [SPARK-27693][SQL] Add default 
catalog property
URL: https://github.com/apache/spark/pull/24594
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #24594: [SPARK-27693][SQL] Add default catalog property

2019-05-19 Thread GitBox
SparkQA commented on issue #24594: [SPARK-27693][SQL] Add default catalog 
property
URL: https://github.com/apache/spark/pull/24594#issuecomment-493835187
 
 
   **[Test build #105546 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/105546/testReport)**
 for PR 24594 at commit 
[`03cf48b`](https://github.com/apache/spark/commit/03cf48ba8c9e13fb55cfe409b477b3a70a5a1760).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #24594: [SPARK-27693][SQL] Add default catalog property

2019-05-19 Thread GitBox
AmplabJenkins removed a comment on issue #24594: [SPARK-27693][SQL] Add default 
catalog property
URL: https://github.com/apache/spark/pull/24594#issuecomment-493834930
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #24594: [SPARK-27693][SQL] Add default catalog property

2019-05-19 Thread GitBox
AmplabJenkins removed a comment on issue #24594: [SPARK-27693][SQL] Add default 
catalog property
URL: https://github.com/apache/spark/pull/24594#issuecomment-493834935
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/10800/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #24594: [SPARK-27693][SQL] Add default catalog property

2019-05-19 Thread GitBox
AmplabJenkins commented on issue #24594: [SPARK-27693][SQL] Add default catalog 
property
URL: https://github.com/apache/spark/pull/24594#issuecomment-493834935
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/10800/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #24594: [SPARK-27693][SQL] Add default catalog property

2019-05-19 Thread GitBox
AmplabJenkins commented on issue #24594: [SPARK-27693][SQL] Add default catalog 
property
URL: https://github.com/apache/spark/pull/24594#issuecomment-493834930
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on issue #24594: [SPARK-27693][SQL] Add default catalog property

2019-05-19 Thread GitBox
dongjoon-hyun commented on issue #24594: [SPARK-27693][SQL] Add default catalog 
property
URL: https://github.com/apache/spark/pull/24594#issuecomment-493834677
 
 
   Retest this please.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on a change in pull request #24594: [SPARK-27693][SQL] Add default catalog property

2019-05-19 Thread GitBox
dongjoon-hyun commented on a change in pull request #24594: [SPARK-27693][SQL] 
Add default catalog property
URL: https://github.com/apache/spark/pull/24594#discussion_r285424373
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala
 ##
 @@ -1767,6 +1767,11 @@ object SQLConf {
 "with String")
 .booleanConf
 .createWithDefault(false)
+
+  val DEFAULT_V2_CATALOG = buildConf("spark.sql.default.catalog")
+  .doc("Name of the default v2 catalog, used when an catalog is not 
identified in queries")
 
 Review comment:
   nit. `an catalog` -> `a catalog`.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun closed pull request #24641: [SPARK-27610][FOLLOW-UP][YARN] Remove duplicate declaration of plugin maven-antrun-plugin

2019-05-19 Thread GitBox
dongjoon-hyun closed pull request #24641: [SPARK-27610][FOLLOW-UP][YARN] Remove 
duplicate declaration of plugin maven-antrun-plugin
URL: https://github.com/apache/spark/pull/24641
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] viirya commented on a change in pull request #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
viirya commented on a change in pull request #24644: 
[SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test 
for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#discussion_r285423349
 
 

 ##
 File path: dev/sparktestsupport/modules.py
 ##
 @@ -568,6 +558,15 @@ def __hash__(self):
 ]
 )
 
+# TODO: Skip hive-thriftserver module for hadoop-3.2. remove this once 
hadoop-3.2 support it
+if os.environ.get("AMPLAB_JENKINS"):
+hadoop_version = os.environ.get("AMPLAB_JENKINS_BUILD_PROFILE", 
"hadoop2.7")
+else:
+hadoop_version = os.environ.get("HADOOP_PROFILE", "hadoop2.7")
+if hadoop_version == "hadoop3.2":
+print("[info] Skip unsupported module:", "hive-thriftserver")
+all_modules = [m for m in all_modules if m.name != "hive-thriftserver"]
 
 Review comment:
   This just does the same thing as before, right?


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
HyukjinKwon commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 
'hive-thriftserver' in modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493830628
 
 
   okie profiles look good.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on a change in pull request #24221: [SPARK-27248][SQL] `refreshTable` should recreate cache with same cache name and storage level

2019-05-19 Thread GitBox
dongjoon-hyun commented on a change in pull request #24221: [SPARK-27248][SQL] 
`refreshTable` should recreate cache with same cache name and storage level
URL: https://github.com/apache/spark/pull/24221#discussion_r285422373
 
 

 ##
 File path: sql/core/src/test/scala/org/apache/spark/sql/CachedTableSuite.scala
 ##
 @@ -985,4 +976,99 @@ class CachedTableSuite extends QueryTest with 
SQLTestUtils with SharedSQLContext
 val queryStats3 = query().queryExecution.optimizedPlan.stats.attributeStats
 assert(queryStats3.map(_._1.name).toSet === Set("c0", "v1", "v2"))
   }
+
+  test("SPARK-27248 refreshTable should recreate cache with same cache name 
and storage level") {
+
+// This section tests when a table is cached with its qualified name but 
it is refreshed with
+// its unqualified name.
+withTempDatabase { db =>
+  withTempPath { path =>
+withTable(s"$db.cachedTable") {
+  // Create table 'cachedTable' in temp db for testing purpose.
+  spark.catalog.createTable(
+s"$db.cachedTable",
+"PARQUET",
+StructType(Array(StructField("key", StringType))),
+Map("LOCATION" -> path.toURI.toString))
+
+  withCache(s"$db.cachedTable") {
+
 
 Review comment:
   ditto. 


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on a change in pull request #24221: [SPARK-27248][SQL] `refreshTable` should recreate cache with same cache name and storage level

2019-05-19 Thread GitBox
dongjoon-hyun commented on a change in pull request #24221: [SPARK-27248][SQL] 
`refreshTable` should recreate cache with same cache name and storage level
URL: https://github.com/apache/spark/pull/24221#discussion_r285422109
 
 

 ##
 File path: 
sql/hive/src/test/scala/org/apache/spark/sql/hive/CachedTableSuite.scala
 ##
 @@ -49,16 +49,7 @@ class CachedTableSuite extends QueryTest with SQLTestUtils 
with TestHiveSingleto
 maybeBlock.nonEmpty
   }
 
-  // Blocking uncache table for tests
-  private def uncacheTable(tableName: String): Unit = {
-val tableIdent = 
spark.sessionState.sqlParser.parseTableIdentifier(tableName)
-val cascade = !spark.sessionState.catalog.isTemporaryTable(tableIdent)
-spark.sharedState.cacheManager.uncacheQuery(
-  spark,
-  spark.table(tableName).logicalPlan,
-  cascade = cascade,
-  blocking = true)
-  }
+
 
 Review comment:
   Remove the blank line 52. Removing the existing function is enough and we 
don't need to add another blank line here.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #24631: [MINOR][CORE] Avoid hardcoded configs

2019-05-19 Thread GitBox
AmplabJenkins removed a comment on issue #24631: [MINOR][CORE] Avoid hardcoded 
configs
URL: https://github.com/apache/spark/pull/24631#issuecomment-493830155
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/105540/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on a change in pull request #24221: [SPARK-27248][SQL] `refreshTable` should recreate cache with same cache name and storage level

2019-05-19 Thread GitBox
dongjoon-hyun commented on a change in pull request #24221: [SPARK-27248][SQL] 
`refreshTable` should recreate cache with same cache name and storage level
URL: https://github.com/apache/spark/pull/24221#discussion_r285422259
 
 

 ##
 File path: sql/core/src/test/scala/org/apache/spark/sql/test/SQLTestUtils.scala
 ##
 @@ -299,6 +299,27 @@ private[sql] trait SQLTestUtilsBase
 }
   }
 
+  /**
+   * Drops cache `cacheName` after calling `f`.
+   */
+  protected def withCache(cacheNames: String*)(f: => Unit): Unit = {
+try f finally {
+  cacheNames.foreach(uncacheTable)
+}
+  }
 
 Review comment:
   The above is required when we use `withCache("a", "b", "c")`.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #24631: [MINOR][CORE] Avoid hardcoded configs

2019-05-19 Thread GitBox
AmplabJenkins removed a comment on issue #24631: [MINOR][CORE] Avoid hardcoded 
configs
URL: https://github.com/apache/spark/pull/24631#issuecomment-493830153
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on a change in pull request #24221: [SPARK-27248][SQL] `refreshTable` should recreate cache with same cache name and storage level

2019-05-19 Thread GitBox
dongjoon-hyun commented on a change in pull request #24221: [SPARK-27248][SQL] 
`refreshTable` should recreate cache with same cache name and storage level
URL: https://github.com/apache/spark/pull/24221#discussion_r285422353
 
 

 ##
 File path: sql/core/src/test/scala/org/apache/spark/sql/CachedTableSuite.scala
 ##
 @@ -985,4 +976,99 @@ class CachedTableSuite extends QueryTest with 
SQLTestUtils with SharedSQLContext
 val queryStats3 = query().queryExecution.optimizedPlan.stats.attributeStats
 assert(queryStats3.map(_._1.name).toSet === Set("c0", "v1", "v2"))
   }
+
+  test("SPARK-27248 refreshTable should recreate cache with same cache name 
and storage level") {
+
 
 Review comment:
   Remove a blank line here.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #24631: [MINOR][CORE] Avoid hardcoded configs

2019-05-19 Thread GitBox
AmplabJenkins commented on issue #24631: [MINOR][CORE] Avoid hardcoded configs
URL: https://github.com/apache/spark/pull/24631#issuecomment-493830153
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #24631: [MINOR][CORE] Avoid hardcoded configs

2019-05-19 Thread GitBox
AmplabJenkins commented on issue #24631: [MINOR][CORE] Avoid hardcoded configs
URL: https://github.com/apache/spark/pull/24631#issuecomment-493830155
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/105540/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on a change in pull request #24221: [SPARK-27248][SQL] `refreshTable` should recreate cache with same cache name and storage level

2019-05-19 Thread GitBox
dongjoon-hyun commented on a change in pull request #24221: [SPARK-27248][SQL] 
`refreshTable` should recreate cache with same cache name and storage level
URL: https://github.com/apache/spark/pull/24221#discussion_r285422138
 
 

 ##
 File path: 
sql/hive/src/test/scala/org/apache/spark/sql/hive/CachedTableSuite.scala
 ##
 @@ -361,4 +352,84 @@ class CachedTableSuite extends QueryTest with 
SQLTestUtils with TestHiveSingleto
   
assert(spark.sharedState.cacheManager.lookupCachedData(samePlan).isDefined)
 }
   }
+
+  test("SPARK-27248 refreshTable should recreate cache with same cache name 
and storage level") {
+
 
 Review comment:
   Please remove blank line here.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on a change in pull request #24221: [SPARK-27248][SQL] `refreshTable` should recreate cache with same cache name and storage level

2019-05-19 Thread GitBox
dongjoon-hyun commented on a change in pull request #24221: [SPARK-27248][SQL] 
`refreshTable` should recreate cache with same cache name and storage level
URL: https://github.com/apache/spark/pull/24221#discussion_r285422162
 
 

 ##
 File path: 
sql/hive/src/test/scala/org/apache/spark/sql/hive/CachedTableSuite.scala
 ##
 @@ -361,4 +352,84 @@ class CachedTableSuite extends QueryTest with 
SQLTestUtils with TestHiveSingleto
   
assert(spark.sharedState.cacheManager.lookupCachedData(samePlan).isDefined)
 }
   }
+
+  test("SPARK-27248 refreshTable should recreate cache with same cache name 
and storage level") {
+
+// This section tests when a table is cached with its qualified name but 
its is refreshed with
+// its unqualified name.
+withTempDatabase { db =>
+  withTable(s"$db.cachedTable") {
+withCache(s"$db.cachedTable") {
+
+  // Create table 'cachedTable' in default db for testing purpose.
+  sql(s"CREATE TABLE $db.cachedTable AS SELECT 1 AS key")
+
+  // Cache the table 'cachedTable' in temp db with qualified table 
name,
+  // and then check whether the table is cached with expected name
+  sql(s"CACHE TABLE $db.cachedTable")
+  assertCached(sql(s"select * from $db.cachedTable"), 
s"`$db`.`cachedTable`")
+  assert(spark.catalog.isCached(s"$db.cachedTable"),
+s"Table '$db.cachedTable' should be cached.")
+
+  // Refresh the table 'cachedTable' in temp db with qualified table 
name, and then check
+  // whether the table is still cached with the same name and storage 
level.
+  sql(s"REFRESH TABLE $db.cachedTable")
+  assertCached(sql(s"select * from $db.cachedTable"), 
s"`$db`.`cachedTable`")
+  assert(spark.catalog.isCached(s"$db.cachedTable"),
+s"Table '$db.cachedTable' should be cached after refreshing with 
its qualified name.")
+
+  // Change the active database to the temp db and refresh the table 
with unqualified
+  // table name, and then check whether the table is still cached with 
the same name and
+  // storage level.
+  // Without bug fix 'SPARK-27248', the recreated cache name will be 
changed to
+  // 'cachedTable', instead of '$db.cachedTable'
+  activateDatabase(db) {
+sql("REFRESH TABLE cachedTable")
+assertCached(sql("select * from cachedTable"), 
s"`$db`.`cachedTable`")
+assert(spark.catalog.isCached("cachedTable"),
+  s"Table '$db.cachedTable' should be cached after refreshing with 
its " +
+"unqualified name.")
+  }
+}
+  }
+}
+
+
+// This section tests when a table is cached with its unqualified name but 
it is refreshed
+// with its qualified name.
+withTempDatabase { db =>
+  withTable("cachedTable") {
+withCache("cachedTable") {
+
 
 Review comment:
   ditto.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on a change in pull request #24221: [SPARK-27248][SQL] `refreshTable` should recreate cache with same cache name and storage level

2019-05-19 Thread GitBox
dongjoon-hyun commented on a change in pull request #24221: [SPARK-27248][SQL] 
`refreshTable` should recreate cache with same cache name and storage level
URL: https://github.com/apache/spark/pull/24221#discussion_r285422146
 
 

 ##
 File path: 
sql/hive/src/test/scala/org/apache/spark/sql/hive/CachedTableSuite.scala
 ##
 @@ -361,4 +352,84 @@ class CachedTableSuite extends QueryTest with 
SQLTestUtils with TestHiveSingleto
   
assert(spark.sharedState.cacheManager.lookupCachedData(samePlan).isDefined)
 }
   }
+
+  test("SPARK-27248 refreshTable should recreate cache with same cache name 
and storage level") {
+
+// This section tests when a table is cached with its qualified name but 
its is refreshed with
+// its unqualified name.
+withTempDatabase { db =>
+  withTable(s"$db.cachedTable") {
+withCache(s"$db.cachedTable") {
+
 
 Review comment:
   ditto.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on a change in pull request #24221: [SPARK-27248][SQL] `refreshTable` should recreate cache with same cache name and storage level

2019-05-19 Thread GitBox
dongjoon-hyun commented on a change in pull request #24221: [SPARK-27248][SQL] 
`refreshTable` should recreate cache with same cache name and storage level
URL: https://github.com/apache/spark/pull/24221#discussion_r285422109
 
 

 ##
 File path: 
sql/hive/src/test/scala/org/apache/spark/sql/hive/CachedTableSuite.scala
 ##
 @@ -49,16 +49,7 @@ class CachedTableSuite extends QueryTest with SQLTestUtils 
with TestHiveSingleto
 maybeBlock.nonEmpty
   }
 
-  // Blocking uncache table for tests
-  private def uncacheTable(tableName: String): Unit = {
-val tableIdent = 
spark.sessionState.sqlParser.parseTableIdentifier(tableName)
-val cascade = !spark.sessionState.catalog.isTemporaryTable(tableIdent)
-spark.sharedState.cacheManager.uncacheQuery(
-  spark,
-  spark.table(tableName).logicalPlan,
-  cascade = cascade,
-  blocking = true)
-  }
+
 
 Review comment:
   Remove the blank line 52. Removing the existing one is enough and we don't 
need to add another blank line here.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on a change in pull request #24221: [SPARK-27248][SQL] `refreshTable` should recreate cache with same cache name and storage level

2019-05-19 Thread GitBox
dongjoon-hyun commented on a change in pull request #24221: [SPARK-27248][SQL] 
`refreshTable` should recreate cache with same cache name and storage level
URL: https://github.com/apache/spark/pull/24221#discussion_r285422007
 
 

 ##
 File path: sql/core/src/test/scala/org/apache/spark/sql/test/SQLTestUtils.scala
 ##
 @@ -299,6 +299,27 @@ private[sql] trait SQLTestUtilsBase
 }
   }
 
+  /**
+   * Drops cache `cacheName` after calling `f`.
+   */
+  protected def withCache(cacheNames: String*)(f: => Unit): Unit = {
+try f finally {
+  cacheNames.foreach(uncacheTable)
+}
+  }
+
+
 
 Review comment:
   Remove line 311. One blank line will be enough.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on issue #24631: [MINOR][CORE] Avoid hardcoded configs

2019-05-19 Thread GitBox
SparkQA removed a comment on issue #24631: [MINOR][CORE] Avoid hardcoded configs
URL: https://github.com/apache/spark/pull/24631#issuecomment-493812040
 
 
   **[Test build #105540 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/105540/testReport)**
 for PR 24631 at commit 
[`ecef09f`](https://github.com/apache/spark/commit/ecef09fe8f901b4e1595faeca715894cbf62126c).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #24631: [MINOR][CORE] Avoid hardcoded configs

2019-05-19 Thread GitBox
SparkQA commented on issue #24631: [MINOR][CORE] Avoid hardcoded configs
URL: https://github.com/apache/spark/pull/24631#issuecomment-493829903
 
 
   **[Test build #105540 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/105540/testReport)**
 for PR 24631 at commit 
[`ecef09f`](https://github.com/apache/spark/commit/ecef09fe8f901b4e1595faeca715894cbf62126c).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on a change in pull request #24221: [SPARK-27248][SQL] `refreshTable` should recreate cache with same cache name and storage level

2019-05-19 Thread GitBox
dongjoon-hyun commented on a change in pull request #24221: [SPARK-27248][SQL] 
`refreshTable` should recreate cache with same cache name and storage level
URL: https://github.com/apache/spark/pull/24221#discussion_r285421984
 
 

 ##
 File path: sql/core/src/test/scala/org/apache/spark/sql/test/SQLTestUtils.scala
 ##
 @@ -299,6 +299,27 @@ private[sql] trait SQLTestUtilsBase
 }
   }
 
+  /**
+   * Drops cache `cacheName` after calling `f`.
+   */
+  protected def withCache(cacheNames: String*)(f: => Unit): Unit = {
+try f finally {
+  cacheNames.foreach(uncacheTable)
+}
+  }
 
 Review comment:
   Could you update like the following? Like the other `withXXX` functions, 
`withCache` should clean up multiple cache names in the `finally` clause.
   ```scala
  protected def withCache(cacheNames: String*)(f: => Unit): Unit = {
try f finally {
   -  cacheNames.foreach(uncacheTable)
   +  cacheNames.foreach { cacheName =>
   +try uncacheTable(cacheName) catch {
   +  case _: AnalysisException =>
   +}
   +  }
}
  }
   ```


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
SparkQA commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 
'hive-thriftserver' in modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493827618
 
 
   **[Test build #105545 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/105545/testReport)**
 for PR 24644 at commit 
[`92abff8`](https://github.com/apache/spark/commit/92abff88ea626812c2b6e161122bc4d0766be4a7).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
AmplabJenkins removed a comment on issue #24644: 
[SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test 
for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493827310
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
AmplabJenkins removed a comment on issue #24644: 
[SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test 
for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493827311
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/10799/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
AmplabJenkins commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] 
Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493827311
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/10799/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
AmplabJenkins commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] 
Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493827310
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
HyukjinKwon commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP] Exclude 
'hive-thriftserver' in modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493826857
 
 
   retest this please


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP][test-hadoop3.2] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
HyukjinKwon commented on issue #24644: 
[SPARK-27402][INFRA][FOLLOW-UP][test-hadoop3.2] Exclude 'hive-thriftserver' in 
modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493826505
 
 
   retest this please


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP][test-hadoop3.2] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
SparkQA commented on issue #24644: 
[SPARK-27402][INFRA][FOLLOW-UP][test-hadoop3.2] Exclude 'hive-thriftserver' in 
modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493826550
 
 
   **[Test build #105544 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/105544/testReport)**
 for PR 24644 at commit 
[`92abff8`](https://github.com/apache/spark/commit/92abff88ea626812c2b6e161122bc4d0766be4a7).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on a change in pull request #24644: [SPARK-27402][INFRA][FOLLOW-UP][test-hadoop3.2] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
HyukjinKwon commented on a change in pull request #24644: 
[SPARK-27402][INFRA][FOLLOW-UP][test-hadoop3.2] Exclude 'hive-thriftserver' in 
modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#discussion_r285417372
 
 

 ##
 File path: dev/run-tests.py
 ##
 @@ -122,9 +123,15 @@ def determine_modules_to_test(changed_modules):
 # If we need to run all of the tests, then we should short-circuit and 
return 'root'
 if modules.root in modules_to_test:
 return [modules.root]
-return toposort_flatten(
+changed_modules = toposort_flatten(
 {m: set(m.dependencies).intersection(modules_to_test) for m in 
modules_to_test}, sort=True)
 
+# TODO: Skip hive-thriftserver module for hadoop-3.2. remove this once 
hadoop-3.2 support it
 
 Review comment:
   To be clear, this fix itself is a temp fix too in order to make the affected 
scope isolated and minimised. To fix it properly, we should maybe be able to 
fix it within `Module` at `modules.py` (and remove this entire module 
conditionally)


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #24628: [SPARK-27749][SQL][test-hadoop3.2][test-maven] hadoop-3.2 support hive-thriftserver

2019-05-19 Thread GitBox
AmplabJenkins removed a comment on issue #24628: 
[SPARK-27749][SQL][test-hadoop3.2][test-maven] hadoop-3.2 support 
hive-thriftserver
URL: https://github.com/apache/spark/pull/24628#issuecomment-493826145
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/105538/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #24628: [SPARK-27749][SQL][test-hadoop3.2][test-maven] hadoop-3.2 support hive-thriftserver

2019-05-19 Thread GitBox
AmplabJenkins removed a comment on issue #24628: 
[SPARK-27749][SQL][test-hadoop3.2][test-maven] hadoop-3.2 support 
hive-thriftserver
URL: https://github.com/apache/spark/pull/24628#issuecomment-493826139
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #24628: [SPARK-27749][SQL][test-hadoop3.2][test-maven] hadoop-3.2 support hive-thriftserver

2019-05-19 Thread GitBox
AmplabJenkins commented on issue #24628: 
[SPARK-27749][SQL][test-hadoop3.2][test-maven] hadoop-3.2 support 
hive-thriftserver
URL: https://github.com/apache/spark/pull/24628#issuecomment-493826139
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on issue #24628: [SPARK-27749][SQL][test-hadoop3.2][test-maven] hadoop-3.2 support hive-thriftserver

2019-05-19 Thread GitBox
SparkQA removed a comment on issue #24628: 
[SPARK-27749][SQL][test-hadoop3.2][test-maven] hadoop-3.2 support 
hive-thriftserver
URL: https://github.com/apache/spark/pull/24628#issuecomment-493802167
 
 
   **[Test build #105538 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/105538/testReport)**
 for PR 24628 at commit 
[`a7bc4e8`](https://github.com/apache/spark/commit/a7bc4e891ae0862d19403164b223ec3b50a38ff3).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #24628: [SPARK-27749][SQL][test-hadoop3.2][test-maven] hadoop-3.2 support hive-thriftserver

2019-05-19 Thread GitBox
AmplabJenkins commented on issue #24628: 
[SPARK-27749][SQL][test-hadoop3.2][test-maven] hadoop-3.2 support 
hive-thriftserver
URL: https://github.com/apache/spark/pull/24628#issuecomment-493826145
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/105538/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #24628: [SPARK-27749][SQL][test-hadoop3.2][test-maven] hadoop-3.2 support hive-thriftserver

2019-05-19 Thread GitBox
SparkQA commented on issue #24628: 
[SPARK-27749][SQL][test-hadoop3.2][test-maven] hadoop-3.2 support 
hive-thriftserver
URL: https://github.com/apache/spark/pull/24628#issuecomment-493826026
 
 
   **[Test build #105538 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/105538/testReport)**
 for PR 24628 at commit 
[`a7bc4e8`](https://github.com/apache/spark/commit/a7bc4e891ae0862d19403164b223ec3b50a38ff3).
* This patch **fails Spark unit tests**.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on issue #24644: [SPARK-27402][INFRA][FOLLOW-UP][test-hadoop3.2][test-maven] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for now

2019-05-19 Thread GitBox
HyukjinKwon commented on issue #24644: 
[SPARK-27402][INFRA][FOLLOW-UP][test-hadoop3.2][test-maven] Exclude 
'hive-thriftserver' in modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#issuecomment-493825269
 
 
   profiles look fine so far. I will switch to SBT's to make sure it doesn't 
affect our regular builds.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on a change in pull request #24644: [SPARK-27402][INFRA][FOLLOW-UP][test-hadoop3.2][test-maven] Exclude 'hive-thriftserver' in modules to test for hadoop3.2 for

2019-05-19 Thread GitBox
HyukjinKwon commented on a change in pull request #24644: 
[SPARK-27402][INFRA][FOLLOW-UP][test-hadoop3.2][test-maven] Exclude 
'hive-thriftserver' in modules to test for hadoop3.2 for now
URL: https://github.com/apache/spark/pull/24644#discussion_r285417372
 
 

 ##
 File path: dev/run-tests.py
 ##
 @@ -122,9 +123,15 @@ def determine_modules_to_test(changed_modules):
 # If we need to run all of the tests, then we should short-circuit and 
return 'root'
 if modules.root in modules_to_test:
 return [modules.root]
-return toposort_flatten(
+changed_modules = toposort_flatten(
 {m: set(m.dependencies).intersection(modules_to_test) for m in 
modules_to_test}, sort=True)
 
+# TODO: Skip hive-thriftserver module for hadoop-3.2. remove this once 
hadoop-3.2 support it
 
 Review comment:
   To be clear, this fix itself is a temp fix too in order to make the affected 
scope isolated and minimised. To fix it properly, we should maybe be able to 
fix it within `Module` at `modules.py`.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



  1   2   3   4   >