There is a way. Please see the scala docs.
http://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.streaming.dstream.PairDStreamFunctions

The first version of updateStateByKey has the parameter "initialRDD"

On Fri, Nov 20, 2015 at 6:52 PM, Bryan <bryan.jeff...@gmail.com> wrote:

> All,
>
> Is there a way to introduce an initial RDD without doing updateStateByKey?
> I have an initial set of counts, and the algorithm I am using requires that
> I accumulate additional counts from streaming data, age off older counts,
> and make some calculations on them. The accumulation of counts uses
> reduceByKeyAndWindow. Is there another method to seed in the initial counts
> beyond updateStateByKey?
>
> Regards,
>
> Bryan Jeffrey
>

Reply via email to