Seems they started to play with streaming algorithms also in Spark and MLlib. https://databricks.com/blog/2015/01/28/introducing-streaming-k-means-in-spark-1-2.html
I wonder how much the mini-batch programming model they have fits traditional streaming algorithms. Also, I guess the concept of state across the stream does not fit very well the abstraction of RDDs. Interesting to read nevertheless. Cheers, -- Gianmarco
