[ https://issues.apache.org/jira/browse/SPARK-11329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15026372#comment-15026372 ]
Maciej BryĆski commented on SPARK-11329: ---------------------------------------- I'm using Spark 1.6. I did some additional tests. This select use TungstenAggregate: ``` sqlCtx.sql('select id, max(data) as max from table group by id ').collect() ``` When I add struct() it changed to ConvertToSafe path. So I think the problem lies in struct() function. > Expand Star when creating a struct > ---------------------------------- > > Key: SPARK-11329 > URL: https://issues.apache.org/jira/browse/SPARK-11329 > Project: Spark > Issue Type: Improvement > Components: SQL > Reporter: Yin Huai > Assignee: Nong Li > Fix For: 1.6.0 > > > It is pretty common for customers to do regular extractions of update data > from an external datasource (e.g. mysql or postgres). While this is possible > today, the syntax is a little onerous. With some small improvements to the > analyzer I think we could make this much easier. > Goal: Allow users to execute the following two queries as well as their > dataframe equivalents > to find the most recent record for each key > {{SELECT max(struct(timestamp, *)) as mostRecentRecord GROUP BY key}} > to unnest the struct from above. > {{SELECT mostRecentRecord.* FROM data}} -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org