updateStateByKey will run for all keys, whether they have new data in a batch 
or not so you should be able to still use it.



On 7/3/15, 7:34 AM, "micvog" <mich...@micvog.com> wrote:

>UpdateStateByKey is useful but what if I want to perform an operation to all
>existing keys (not only the ones in this RDD).
>
>Word count for example - is there a way to decrease *all* words seen so far
>by 1?
>
>I was thinking of keeping a static class per node with the count information
>and issuing a broadcast command to take a certain action, but could not find
>a broadcast-to-all-nodes functionality or a better way.
>
>Thanks,
>Michael
>
>
>
>-----
>Michael Vogiatzis
>@mvogiatzis 
>--
>View this message in context: 
>http://apache-spark-user-list.1001560.n3.nabble.com/Spark-Streaming-broadcast-to-all-keys-tp23609.html
>Sent from the Apache Spark User List mailing list archive at Nabble.com.
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
>For additional commands, e-mail: user-h...@spark.apache.org
>

Reply via email to