I am new to SPARK streaming. I was running the "kafka_wordcount" example with a local KAFKA and SPARK instance. It was very easy to set this up and get going :)I tried running both SCALA and Python versions of the word count example. Python versions seems to be extremely slow. Sometimes it has delays of more than couple of minutes. On the other hand SCALA versions seems to be way better. I am running on a windows machine.I am trying to understand what is the cause slowness in python streaming? Is there anything that I am missing? For real time streaming analysis should I prefer SCALA?
-- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Performance-Python-streaming-v-s-Scala-streaming-tp24415.html Sent from the Apache Spark User List mailing list archive at Nabble.com.