JacobZheng created SPARK-45929:
----------------------------------

             Summary: support grouping set operation in dataframe api
                 Key: SPARK-45929
                 URL: https://issues.apache.org/jira/browse/SPARK-45929
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 3.4.1
            Reporter: JacobZheng


I am using spark dataframe api for complex calculations. When I need to use the 
grouping sets function, I can only convert the expression to sql via 
analyzedPlan and then splice these sql into a complex sql to execute. In some 
cases, this operation generates an extremely complex sql. executing this 
complex sql, antlr4 continues to consume a large amount of memory, similar to a 
memory leak scenario. If you can and rollup, cube function through the 
dataframe api to calculate these operations will be much simpler.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to