[jira] [Commented] (SPARK-33791) grouping__id() result does not consistent with hive's version < 2.3

Su Qilong (Jira) Tue, 15 Dec 2020 17:17:35 -0800


    [ 
https://issues.apache.org/jira/browse/SPARK-33791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17250042#comment-17250042
 ]


Su Qilong commented on SPARK-33791:
-----------------------------------

Hi [~yumwang] ，

I've read this migration guide carefully, but do not find any info about the 
grouping id impl details.

I mean, we should list this point into this migration guide, and also supply an 
option to let user to use hive 1.x compatible mode, and thus user from hive 1.x 
will be more pleased

> grouping__id() result does not consistent with hive's version < 2.3
> -------------------------------------------------------------------
>
>                 Key: SPARK-33791
>                 URL: https://issues.apache.org/jira/browse/SPARK-33791
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>    Affects Versions: 2.4.3, 3.0.1
>            Reporter: Su Qilong
>            Priority: Minor
>
> See this 
> [https://cwiki.apache.org/confluence/display/Hive/Enhanced+Aggregation%2C+Cube%2C+Grouping+and+Rollup]
> Hive's grouping__id method made a change since hive version 2.3.0. Now spark 
> does not declare this inconsistency with Hive, which may make user believe 
> they're safe from migrating their query from Hive 1.x to Spark, but which is 
> wrong.
> I guess we should note this difference in Hive migration guide, and add a 
> configuration to let grouping__id to use hive 1.x compatible algorithm



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-33791) grouping__id() result does not consistent with hive's version < 2.3

Reply via email to