[
https://issues.apache.org/jira/browse/HIVE-8486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14174292#comment-14174292
]
Chao commented on HIVE-8486:
----------------------------
1) I discussed with [~szehon] about this, and seems we cannot change the
calculation now, since it will affect the way how the buckets are calculated.
2) In the Spark branch, we are not doing anything to estimate the number of
reducers. In MR, in case this number is not set, it will estimate and set it at
runtime. This is done in {{MapRedTask}}. Tez also uses "Auto Reducer
Parallelism" (see HIVE-7158) to control this. As result, I think we should do
something to resolve this.
> TPC-DS Query 96 parallelism is not set correcly
> -----------------------------------------------
>
> Key: HIVE-8486
> URL: https://issues.apache.org/jira/browse/HIVE-8486
> Project: Hive
> Issue Type: Sub-task
> Components: Spark
> Reporter: Brock Noland
> Assignee: Chao
>
> When we run the query on a 20B we only have a parallelism factor of 1.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)