[ https://issues.apache.org/jira/browse/SPARK-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319364#comment-14319364 ]
Yi Zhou edited comment on SPARK-5791 at 2/28/15 12:39 AM: ---------------------------------------------------------- INSERT INTO TABLE q22_spark_RUN_QUERY_0_result SELECT * FROM ( SELECT w_warehouse_name, i_item_id, SUM( CASE WHEN datediff(d_date, '2001-05-08') < 0 THEN inv_quantity_on_hand ELSE 0 END ) AS inv_before, SUM( CASE WHEN datediff(d_date, '2001-05-08') >= 0 THEN inv_quantity_on_hand ELSE 0 END ) AS inv_after FROM ( SELECT * FROM inventory inv JOIN ( SELECT i_item_id, i_item_sk FROM item WHERE i_current_price > 0.98 AND i_current_price < 1.5 ) items ON inv.inv_item_sk = items.i_item_sk JOIN warehouse w ON inv.inv_warehouse_sk = w.w_warehouse_sk JOIN date_dim d ON inv.inv_date_sk = d.d_date_sk WHERE datediff(d_date, '2001-05-08') >= -30 AND datediff(d_date, '2001-05-08') <= 30 ) q22_coalition_22 GROUP BY w_warehouse_name, i_item_id ) name WHERE inv_before > 0 AND inv_after / inv_before >= 2.0 / 3.0 AND inv_after / inv_before <= 3.0 / 2.0 CLUSTER BY w_warehouse_name, i_item_id; was (Author: jameszhouyi): For example: SELECT * FROM inventory inv JOIN ( SELECT i_item_id, i_item_sk FROM item WHERE i_current_price > 0.98 AND i_current_price < 1.5 ) items ON inv.inv_item_sk = items.i_item_sk JOIN warehouse w ON inv.inv_warehouse_sk = w.w_warehouse_sk JOIN date_dim d ON inv.inv_date_sk = d.d_date_sk WHERE datediff(d_date, '2001-05-08') >= -30 AND datediff(d_date, '2001-05-08') <= 30; > [Spark SQL] show poor performance when multiple table do join operation > ----------------------------------------------------------------------- > > Key: SPARK-5791 > URL: https://issues.apache.org/jira/browse/SPARK-5791 > Project: Spark > Issue Type: Bug > Components: SQL > Affects Versions: 1.2.0 > Reporter: Yi Zhou > Attachments: Physical_Plan.txt > > > Spark SQL shows poor performance when multiple tables do join operation -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org