First you'd need to sort the RDD to give it a meaningful order, but I
assume you have some kind of timestamp in your data you can sort on.

I think you might be after the sliding() function, a developer API in MLlib:

https://github.com/apache/spark/blob/master/mllib/src/main/scala/org/apache/spark/mllib/rdd/RDDFunctions.scala#L43

On Tue, Jan 6, 2015 at 5:25 PM, Asim Jalis <asimja...@gmail.com> wrote:

> Is there an easy way to do a moving average across a single RDD (in a
> non-streaming app). Here is the use case. I have an RDD made up of stock
> prices. I want to calculate a moving average using a window size of N.
>
> Thanks.
>
> Asim
>

Reply via email to