The different between Stream vs Micro Batch is about Ordering of Messages
> Spark Streaming guarantees ordered processing of RDDs in one DStream. Since
> each RDD is processed in parallel, there is not order guaranteed within the
> RDD. This is a tradeoff design Spark made. If you want to
mini batch or near real time: processing frames within 500 ms or more
real time: processing frames in 5 ms-10ms.
The main difference is processing velocity, i think.
Apache Spark Streaming is mini batch, not true real time.
Alonso Isidoro Roman
[image: https://]about.me/alonso.isidoro.roman
I understand the difference between fraud detection and fraud prevention in
general but I am not interested in the semantic war on what these terms
precisely mean. I am more interested in understanding the difference
between mini-batch vs real time streaming from CS perspective.
On Tue, Sep
Replace mini-batch with micro-batching and do a search again. what is your
understanding of fraud detection?
Spark streaming can be used for risk calculation and fraud detection
(including stopping fraud going through for example credit card
fraud) effectively "in practice". it can even be used
What is the difference between mini-batch vs real time streaming in practice
(not theory)? In theory, I understand mini batch is something that batches in
the given time frame whereas real time streaming is more like do something as
the data arrives but my biggest question is why not have mini