There is a way. Please see the scala docs. http://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.streaming.dstream.PairDStreamFunctions
The first version of updateStateByKey has the parameter "initialRDD" On Fri, Nov 20, 2015 at 6:52 PM, Bryan <bryan.jeff...@gmail.com> wrote: > All, > > Is there a way to introduce an initial RDD without doing updateStateByKey? > I have an initial set of counts, and the algorithm I am using requires that > I accumulate additional counts from streaming data, age off older counts, > and make some calculations on them. The accumulation of counts uses > reduceByKeyAndWindow. Is there another method to seed in the initial counts > beyond updateStateByKey? > > Regards, > > Bryan Jeffrey >