[ https://issues.apache.org/jira/browse/SPARK-3994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Aaron Davidson reassigned SPARK-3994: ------------------------------------- Assignee: Aaron Davidson > countByKey / countByValue do not go through Aggregator > ------------------------------------------------------ > > Key: SPARK-3994 > URL: https://issues.apache.org/jira/browse/SPARK-3994 > Project: Spark > Issue Type: Bug > Reporter: Aaron Davidson > Assignee: Aaron Davidson > > The implementations of these methods are historical remnants of Spark from a > time when the shuffle may have been nonexistent. Now, they can be simplified > by plugging into reduceByKey(), potentially seeing performance and stability > improvements. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org