[jira] [Commented] (SPARK-20320) AnalysisException: Columns of grouping_id (count(value#17L)) does not match grouping columns (count(value#17L))

Takeshi Yamamuro (JIRA) Mon, 17 Apr 2017 21:57:07 -0700

    [ 
https://issues.apache.org/jira/browse/SPARK-20320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15972126#comment-15972126
 ]


Takeshi Yamamuro commented on SPARK-20320:
------------------------------------------

Is this query (putting `AggregateFunction` like `count(value)` in `cube`) valid?
Could you explain what you get from this query?
{code}
scala> spark.range(5).cube(count("id")).avg().show
org.apache.spark.sql.AnalysisException: grouping expressions sequence is empty, 
and '`id`' is not an aggregate function. Wrap '(count(`id`) AS `count(id#7L)`)' 
in windowing function(s) or wrap '`id`' in first() (or first_value) if you 
don't care which value you get.;;
Aggregate [count(id#7L)#19L, spark_grouping_id#17], [count(id#7L) AS 
count(id)#15L, avg(id#7L) AS avg(id)#16]
+- Expand [List(id#7L, count(id#7L)#18L, 0), List(id#7L, null, 1)], [id#7L, 
count(id#7L)#19L, spark_grouping_id#17]
   +- Aggregate [id#7L, count(id#7L) AS count(id#7L)#18L]
      +- Range (0, 5, step=1, splits=Some(4))
{code}

> AnalysisException: Columns of grouping_id (count(value#17L)) does not match 
> grouping columns (count(value#17L))
> ---------------------------------------------------------------------------------------------------------------
>
>                 Key: SPARK-20320
>                 URL: https://issues.apache.org/jira/browse/SPARK-20320
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 2.2.0
>            Reporter: Jacek Laskowski
>            Priority: Minor
>
> I'm not questioning the {{AnalysisException}} (which I don't know whether 
> should be reported or not), but the exception message that tells...nothing 
> helpful.
> {code}
> val records = spark.range(5).flatMap(n => Seq.fill(n.toInt)(n))
> scala> 
> records.cube(count("value")).agg(grouping_id(count("value"))).queryExecution.logical
> org.apache.spark.sql.AnalysisException: Columns of grouping_id 
> (count(value#17L)) does not match grouping columns (count(value#17L));
>   at 
> org.apache.spark.sql.catalyst.analysis.Analyzer$ResolveGroupingAnalytics$$anonfun$org$apache$spark$sql$catalyst$analysis$Analyzer$ResolveGroupingAnalytics$$replaceGroupingFunc$1.applyOrElse(Analyzer.scala:313)
>   at 
> org.apache.spark.sql.catalyst.analysis.Analyzer$ResolveGroupingAnalytics$$anonfun$org$apache$spark$sql$catalyst$analysis$Analyzer$ResolveGroupingAnalytics$$replaceGroupingFunc$1.applyOrElse(Analyzer.scala:308)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-20320) AnalysisException: Columns of grouping_id (count(value#17L)) does not match grouping columns (count(value#17L))

Reply via email to