[ https://issues.apache.org/jira/browse/SPARK-16551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ruiming Zhou updated SPARK-16551: --------------------------------- Comment: was deleted (was: I can look at this issue.) > Accumulator Examples should demonstrate different use case from UDAFs > --------------------------------------------------------------------- > > Key: SPARK-16551 > URL: https://issues.apache.org/jira/browse/SPARK-16551 > Project: Spark > Issue Type: Documentation > Reporter: Vladimir Feinberg > Priority: Minor > > Currently, the Spark programming guide demonstrates Accumulators > (http://spark.apache.org/docs/latest/programming-guide.html#accumulators) by > taking the sum of an RDD. > This example makes new users think that Accumulators serve the role that > UDAFs do, which they don't. They're meant to be out-of-band, small values > that don't break pipe-lining. Documentation examples and notes should reflect > this (and warn that they may cause driver bottlenecks). -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org