[jira] [Commented] (SPARK-16163) Statistics of logical plan is super slow on large query

2016-06-23 Thread Davies Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-16163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15347136#comment-15347136
 ] 

Davies Liu commented on SPARK-16163:


[~srowen] yes, thanks for correct it.

> Statistics of logical plan is super slow on large query
> ---
>
> Key: SPARK-16163
> URL: https://issues.apache.org/jira/browse/SPARK-16163
> Project: Spark
>  Issue Type: Bug
>  Components: SQL
>Affects Versions: 2.0.0
>Reporter: Davies Liu
> Fix For: 2.0.1
>
>
> It took several minutes to plan TPC-DS Q64, because the canBroadcast() is 
> super slow on large plan.
> Right now, we are considering the schema in statistics(), it's not trivial 
> anymore, we should cache the result (using lazy val rather than def).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-16163) Statistics of logical plan is super slow on large query

2016-06-23 Thread Sean Owen (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-16163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15347022#comment-15347022
 ] 

Sean Owen commented on SPARK-16163:
---

[~davies] these need to be resolved for 2.0.1 not 2.0.0 technically

> Statistics of logical plan is super slow on large query
> ---
>
> Key: SPARK-16163
> URL: https://issues.apache.org/jira/browse/SPARK-16163
> Project: Spark
>  Issue Type: Bug
>  Components: SQL
>Affects Versions: 2.0.0
>Reporter: Davies Liu
> Fix For: 2.0.1
>
>
> It took several minutes to plan TPC-DS Q64, because the canBroadcast() is 
> super slow on large plan.
> Right now, we are considering the schema in statistics(), it's not trivial 
> anymore, we should cache the result (using lazy val rather than def).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-16163) Statistics of logical plan is super slow on large query

2016-06-23 Thread Apache Spark (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-16163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15345934#comment-15345934
 ] 

Apache Spark commented on SPARK-16163:
--

User 'davies' has created a pull request for this issue:
https://github.com/apache/spark/pull/13871

> Statistics of logical plan is super slow on large query
> ---
>
> Key: SPARK-16163
> URL: https://issues.apache.org/jira/browse/SPARK-16163
> Project: Spark
>  Issue Type: Bug
>  Components: SQL
>Affects Versions: 2.0.0
>Reporter: Davies Liu
>
> It took several minutes to plan TPC-DS Q64, because the canBroadcast() is 
> super slow on large plan.
> Right now, we are considering the schema in statistics(), it's not trivial 
> anymore, we should cache the result (using lazy val rather than def).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org