Su Qilong created SPARK-33791:
---------------------------------

             Summary: grouping__id() result does not consistent with hive's 
version < 2.3
                 Key: SPARK-33791
                 URL: https://issues.apache.org/jira/browse/SPARK-33791
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 3.0.1, 2.4.3
            Reporter: Su Qilong


See this 
[https://cwiki.apache.org/confluence/display/Hive/Enhanced+Aggregation%2C+Cube%2C+Grouping+and+Rollup]

Hive's grouping__id method made a change since hive version 2.3.0. Now spark 
does not declare this inconsistency with Hive, which may make user believe 
they're safe from migrating their query from Hive 1.x to Spark, but which is 
wrong.

I guess we should note this difference in Hive migration guide, and add a 
configuration to let grouping__id to use hive 1.x compatible algorithm



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to