[jira] [Commented] (SPARK-8007) Support resolving virtual columns in DataFrames

Michael Armbrust (JIRA) Tue, 21 Jul 2015 11:51:10 -0700

    [ 
https://issues.apache.org/jira/browse/SPARK-8007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14635597#comment-14635597
 ]


Michael Armbrust commented on SPARK-8007:
-----------------------------------------

I'm going to propose that we don't change the analyzer, but instead just use 
functions for all the cases that were specified.  This is nice because we can 
never be ambiguous with a user column.


> Support resolving virtual columns in DataFrames
> -----------------------------------------------
>
>                 Key: SPARK-8007
>                 URL: https://issues.apache.org/jira/browse/SPARK-8007
>             Project: Spark
>          Issue Type: Sub-task
>          Components: SQL
>            Reporter: Reynold Xin
>            Assignee: Joseph Batchik
>
> Create the infrastructure so we can resolve df("SPARK__PARTITION__ID") to 
> SparkPartitionID expression.
> A cool use case is to understand physical data skew:
> {code}
> df.groupBy("SPARK__PARTITION__ID").count()
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-8007) Support resolving virtual columns in DataFrames

Reply via email to