Re: Join RDDs with DStreams

2015-01-08 Thread Akhil Das
Here's how you do it: val joined_stream = *myStream*.transform((x: RDD[(String, String)]) = { val prdd = new PairRDDFunctions[String, String](x) prdd.join(*myRDD*)}) Thanks Best Regards On Thu, Jan 8, 2015 at 10:20 PM, Asim Jalis asimja...@gmail.com wrote: Is there a way

Join RDDs with DStreams

2015-01-08 Thread Asim Jalis
Is there a way to join non-DStream RDDs with DStream RDDs? Here is the use case. I have a lookup table stored in HDFS that I want to read as an RDD. Then I want to join it with the RDDs that are coming in through the DStream. How can I do this? Thanks. Asim

Re: Join RDDs with DStreams

2015-01-08 Thread Gerard Maas
You are looking for dstream.transform(rdd = rdd.op(otherRdd)) The docs contain an example on how to use transform. https://spark.apache.org/docs/latest/streaming-programming-guide.html#transformations-on-dstreams -kr, Gerard. On Thu, Jan 8, 2015 at 5:50 PM, Asim Jalis asimja...@gmail.com