[jira] [Assigned] (SPARK-45929) support grouping set operation in dataframe api
[ https://issues.apache.org/jira/browse/SPARK-45929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-45929: Assignee: JacobZheng > support grouping set operation in dataframe api > --- > > Key: SPARK-45929 > URL: https://issues.apache.org/jira/browse/SPARK-45929 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 3.4.1 >Reporter: JacobZheng >Assignee: JacobZheng >Priority: Major > Labels: pull-request-available > > I am using spark dataframe api for complex calculations. When I need to use > the grouping sets function, I can only convert the expression to sql via > analyzedPlan and then splice these sql into a complex sql to execute. In some > cases, this operation generates an extremely complex sql. executing this > complex sql, antlr4 continues to consume a large amount of memory, similar to > a memory leak scenario. If you can and rollup, cube function through the > dataframe api to calculate these operations will be much simpler. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Assigned] (SPARK-45929) support grouping set operation in dataframe api
[ https://issues.apache.org/jira/browse/SPARK-45929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot reassigned SPARK-45929: -- Assignee: (was: Apache Spark) > support grouping set operation in dataframe api > --- > > Key: SPARK-45929 > URL: https://issues.apache.org/jira/browse/SPARK-45929 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 3.4.1 >Reporter: JacobZheng >Priority: Major > Labels: pull-request-available > > I am using spark dataframe api for complex calculations. When I need to use > the grouping sets function, I can only convert the expression to sql via > analyzedPlan and then splice these sql into a complex sql to execute. In some > cases, this operation generates an extremely complex sql. executing this > complex sql, antlr4 continues to consume a large amount of memory, similar to > a memory leak scenario. If you can and rollup, cube function through the > dataframe api to calculate these operations will be much simpler. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Assigned] (SPARK-45929) support grouping set operation in dataframe api
[ https://issues.apache.org/jira/browse/SPARK-45929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot reassigned SPARK-45929: -- Assignee: Apache Spark > support grouping set operation in dataframe api > --- > > Key: SPARK-45929 > URL: https://issues.apache.org/jira/browse/SPARK-45929 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 3.4.1 >Reporter: JacobZheng >Assignee: Apache Spark >Priority: Major > Labels: pull-request-available > > I am using spark dataframe api for complex calculations. When I need to use > the grouping sets function, I can only convert the expression to sql via > analyzedPlan and then splice these sql into a complex sql to execute. In some > cases, this operation generates an extremely complex sql. executing this > complex sql, antlr4 continues to consume a large amount of memory, similar to > a memory leak scenario. If you can and rollup, cube function through the > dataframe api to calculate these operations will be much simpler. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Assigned] (SPARK-45929) support grouping set operation in dataframe api
[ https://issues.apache.org/jira/browse/SPARK-45929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot reassigned SPARK-45929: -- Assignee: (was: Apache Spark) > support grouping set operation in dataframe api > --- > > Key: SPARK-45929 > URL: https://issues.apache.org/jira/browse/SPARK-45929 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 3.4.1 >Reporter: JacobZheng >Priority: Major > Labels: pull-request-available > > I am using spark dataframe api for complex calculations. When I need to use > the grouping sets function, I can only convert the expression to sql via > analyzedPlan and then splice these sql into a complex sql to execute. In some > cases, this operation generates an extremely complex sql. executing this > complex sql, antlr4 continues to consume a large amount of memory, similar to > a memory leak scenario. If you can and rollup, cube function through the > dataframe api to calculate these operations will be much simpler. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Assigned] (SPARK-45929) support grouping set operation in dataframe api
[ https://issues.apache.org/jira/browse/SPARK-45929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot reassigned SPARK-45929: -- Assignee: Apache Spark > support grouping set operation in dataframe api > --- > > Key: SPARK-45929 > URL: https://issues.apache.org/jira/browse/SPARK-45929 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 3.4.1 >Reporter: JacobZheng >Assignee: Apache Spark >Priority: Major > Labels: pull-request-available > > I am using spark dataframe api for complex calculations. When I need to use > the grouping sets function, I can only convert the expression to sql via > analyzedPlan and then splice these sql into a complex sql to execute. In some > cases, this operation generates an extremely complex sql. executing this > complex sql, antlr4 continues to consume a large amount of memory, similar to > a memory leak scenario. If you can and rollup, cube function through the > dataframe api to calculate these operations will be much simpler. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Assigned] (SPARK-45929) support grouping set operation in dataframe api
[ https://issues.apache.org/jira/browse/SPARK-45929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot reassigned SPARK-45929: -- Assignee: (was: Apache Spark) > support grouping set operation in dataframe api > --- > > Key: SPARK-45929 > URL: https://issues.apache.org/jira/browse/SPARK-45929 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 3.4.1 >Reporter: JacobZheng >Priority: Major > Labels: pull-request-available > > I am using spark dataframe api for complex calculations. When I need to use > the grouping sets function, I can only convert the expression to sql via > analyzedPlan and then splice these sql into a complex sql to execute. In some > cases, this operation generates an extremely complex sql. executing this > complex sql, antlr4 continues to consume a large amount of memory, similar to > a memory leak scenario. If you can and rollup, cube function through the > dataframe api to calculate these operations will be much simpler. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Assigned] (SPARK-45929) support grouping set operation in dataframe api
[ https://issues.apache.org/jira/browse/SPARK-45929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot reassigned SPARK-45929: -- Assignee: Apache Spark > support grouping set operation in dataframe api > --- > > Key: SPARK-45929 > URL: https://issues.apache.org/jira/browse/SPARK-45929 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 3.4.1 >Reporter: JacobZheng >Assignee: Apache Spark >Priority: Major > Labels: pull-request-available > > I am using spark dataframe api for complex calculations. When I need to use > the grouping sets function, I can only convert the expression to sql via > analyzedPlan and then splice these sql into a complex sql to execute. In some > cases, this operation generates an extremely complex sql. executing this > complex sql, antlr4 continues to consume a large amount of memory, similar to > a memory leak scenario. If you can and rollup, cube function through the > dataframe api to calculate these operations will be much simpler. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org