[ 
https://issues.apache.org/jira/browse/SPARK-13307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15146765#comment-15146765
 ] 

Xiao Li commented on SPARK-13307:
---------------------------------

In the following PR: https://github.com/apache/spark/pull/9645, shuffle hash 
join is removed from Spark SQL. Try to see if broadcast join works in this test 
case. You also can use hint to force the broadcast join. 

Let me CC [~rxin] [~yhuai] [~marmbrus]

> TPCDS query 66 degraded by 30% in 1.6.0 compared to 1.4.1
> ---------------------------------------------------------
>
>                 Key: SPARK-13307
>                 URL: https://issues.apache.org/jira/browse/SPARK-13307
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 1.6.0
>            Reporter: JESSE CHEN
>
> Majority of the TPCDS queries ran faster in 1.6.0 than in 1.4.1, average 
> about 9% faster. There are a few degraded, and one that is definitely not 
> within error margin is query 66.
> Query 66 in 1.4.1: 699 seconds
> Query 66 in 1.6.0: 918 seconds
> 30% worse.
> Collected the physical plans from both versions - drastic difference maybe 
> partially from using Tungsten in 1.6, but anything else at play here?
> Please see plans here:
> https://ibm.box.com/spark-sql-q66-debug-160plan
> https://ibm.box.com/spark-sql-q66-debug-141plan



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to