[ https://issues.apache.org/jira/browse/SPARK-11371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14980484#comment-14980484 ]
Ted Yu commented on SPARK-11371: -------------------------------- Since I cannot assign the JIRA to myself, attaching patch shows my intention working on the JIRA. The background is that I wanted to open 3 PRs as of yerterday but I don't have as many email addresses (forked repo's, i.e.). I am more than willing to learn from experts how multiple outstanding PRs are managed. As for the mean alias, I quoted Reynold's response. I am open to discussion on whether this would ultimately go through. > Make "mean" an alias for "avg" operator > --------------------------------------- > > Key: SPARK-11371 > URL: https://issues.apache.org/jira/browse/SPARK-11371 > Project: Spark > Issue Type: Improvement > Components: SQL > Reporter: Ted Yu > Priority: Minor > Attachments: spark-11371-v1.patch > > > From Reynold in the thread 'Exception when using some aggregate operators' > (http://search-hadoop.com/m/q3RTt0xFr22nXB4/): > I don't think these are bugs. The SQL standard for average is "avg", not > "mean". Similarly, a distinct count is supposed to be written as > "count(distinct col)", not "countDistinct(col)". > We can, however, make "mean" an alias for "avg" to improve compatibility > between DataFrame and SQL. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org