I know that Spark is using data parallelism over, say, HDFS - optimally running 
computations on local data (aka data locality).
I was wondering how Spark streaming moves data (messages) around? since the 
data is streamed in as DStreams and is not on a distributed FS like HDFS.

Thanks!

Reply via email to