Spark Streaming - batchDuration for streaming

alJune Wed, 17 Sep 2014 11:47:09 -0700

hi estimated Sparkes,
I have some doubt about Streaming Context batchDuration parameter. 
I've already read excellent explication by Tathagata Das about difference
between batch & window duration.
But the issue seem a few confusion as for exam.- without any window
SparkStreaming plays as implicit window by batchDuration actions. 
So, 
1. What is simple strategy to get the correct value batchDuration? 
  My UCase-is Flume spooling directory source for new text file =>
SparkStreaming analytic app.
  And one question more-
2. What is the reason for batching streaming? 
  I used BigInsights Streams and there is not this "batch" strategy as "the
stream is the continuous   flow".You run Streams with one from many source
connectors to catch data input flow(stream).If you  need- you can work with
data on window frame.But anything case - your streams app gets(listens)
continuously stream/ flow.
Perhaps - I don't understand some important and powerful characteristic of
Spark Streaming architecture.  
Thanks in advance.




--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Spark-Streaming-batchDuration-for-streaming-tp14469.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Spark Streaming - batchDuration for streaming

Reply via email to