Re: Spark Streaming Data Sharing

2014-08-18 Thread Ruchir Jha
The Spark Job that has the main DStream, could have another DStream that is listening for stream subscription requests. So when a subscription is received, you could do a filter/forEach on the main DStream and respond to that one request. So you're basically creating a stream server that is

Re: Can I share the RDD between multiprocess

2014-08-11 Thread Ruchir Jha
Look at: https://github.com/ooyala/spark-jobserver On Mon, Aug 11, 2014 at 11:48 AM, coolfrood aara...@quantcast.com wrote: Reviving this discussion again... I'm interested in using Spark as the engine for a web service. The SparkContext and its RDDs only exist in the JVM that started it.

Overriding dstream window definition

2014-08-09 Thread Ruchir Jha
Hi I intend on using the same Spark Streaming program for both real time and batch processing of my time stamped data. However with batch processing all window based operations would be meaningless because (I assume) the window is defined by the arrival times of data and it is not possible to

Debugging spark

2014-07-19 Thread Ruchir Jha
I am a newbie and am looking for pointers to start debugging my spark app and did not find a straightforward tutorial. Any help is appreciated? Sent from my iPhone