[
https://issues.apache.org/jira/browse/SPARK-11371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14980484#comment-14980484
]
Ted Yu commented on SPARK-11371:
--------------------------------
Since I cannot assign the JIRA to myself, attaching patch shows my intention
working on the JIRA.
The background is that I wanted to open 3 PRs as of yerterday but I don't have
as many email addresses (forked repo's, i.e.).
I am more than willing to learn from experts how multiple outstanding PRs are
managed.
As for the mean alias, I quoted Reynold's response.
I am open to discussion on whether this would ultimately go through.
> Make "mean" an alias for "avg" operator
> ---------------------------------------
>
> Key: SPARK-11371
> URL: https://issues.apache.org/jira/browse/SPARK-11371
> Project: Spark
> Issue Type: Improvement
> Components: SQL
> Reporter: Ted Yu
> Priority: Minor
> Attachments: spark-11371-v1.patch
>
>
> From Reynold in the thread 'Exception when using some aggregate operators'
> (http://search-hadoop.com/m/q3RTt0xFr22nXB4/):
> I don't think these are bugs. The SQL standard for average is "avg", not
> "mean". Similarly, a distinct count is supposed to be written as
> "count(distinct col)", not "countDistinct(col)".
> We can, however, make "mean" an alias for "avg" to improve compatibility
> between DataFrame and SQL.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]