[jira] [Created] (SPARK-33791) grouping__id() result does not consistent with hive's version < 2.3

Su Qilong (Jira) Tue, 15 Dec 2020 04:43:34 -0800

Su Qilong created SPARK-33791:
---------------------------------

             Summary: grouping__id() result does not consistent with hive's 
version < 2.3
                 Key: SPARK-33791
                 URL: https://issues.apache.org/jira/browse/SPARK-33791
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 3.0.1, 2.4.3
            Reporter: Su Qilong



See this 
[https://cwiki.apache.org/confluence/display/Hive/Enhanced+Aggregation%2C+Cube%2C+Grouping+and+Rollup]

Hive's grouping__id method made a change since hive version 2.3.0. Now spark 
does not declare this inconsistency with Hive, which may make user believe 
they're safe from migrating their query from Hive 1.x to Spark, but which is 
wrong.

I guess we should note this difference in Hive migration guide, and add a 
configuration to let grouping__id to use hive 1.x compatible algorithm



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Created] (SPARK-33791) grouping__id() result does not consistent with hive's version < 2.3

Reply via email to