Hello sir,
I'm currently working on a project where i would've to detect anomalies in
real time streaming data pushing data from kafka into apache spark. I chose
to go with streaming kmeans clustering algorithm, but I couldn't find much
about it. Do you think it is a suitable algorithm to go
n/0.10/comparisons/spark-streaming.html>
> On Sep 27, 2016, at 2:12 PM, kant kodali <kanth...@gmail.com> wrote:
>
> What is the difference between mini-batch vs real time streaming in practice
> (not theory)? In theory, I understand mini batch is something that batches in
&
am not interested in the semantic war on what these terms
> precisely mean. I am more interested in understanding the difference
> between mini-batch vs real time streaming from CS perspective.
>
>
>
> On Tue, Sep 27, 2016 12:54 AM, Mich Talebzadeh mich.talebza...@gmail.com
> wrote:
>
I understand the difference between fraud detection and fraud prevention in
general but I am not interested in the semantic war on what these terms
precisely mean. I am more interested in understanding the difference
between mini-batch vs real time streaming from CS perspective.
On Tue, Sep
uch loss, damage or destruction.
On 27 September 2016 at 08:12, kant kodali <kanth...@gmail.com> wrote:
> What is the difference between mini-batch vs real time streaming in
> practice (not theory)? In theory, I understand mini batch is something that
> batches in the given time frame wher
What is the difference between mini-batch vs real time streaming in practice
(not theory)? In theory, I understand mini batch is something that batches in
the given time frame whereas real time streaming is more like do something as
the data arrives but my biggest question is why not have mini
the spark tutorial shows that we can create a stream that reads new files
from a directory.
that seems to have some lag time, as we have to write the data to file first
and then wait until spark stream picks it up.
what is the best way to implement REAL 'REAL-TIME' streaming for analysis
the data to file
first
and then wait until spark stream picks it up.
what is the best way to implement REAL 'REAL-TIME' streaming for analysis
in
real time? for example, like streaming videos, sounds, images, etc
continuously?
thanks!
--
View this message in context:
http://apache-spark
time. thanks again.
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/real-time-streaming-tp17526p17528.html
Sent from the Apache Spark User List mailing list archive at Nabble.com