Re: RE: Spark or Storm

2015-06-19 Thread Tathagata Das
; Ashish Soni asoni.le...@gmail.com; ayan guha guha.a...@gmail.com; user@spark.apache.org; Sateesh Kavuri sateesh.kav...@gmail.com; Spark Enthusiast sparkenthusi...@yahoo.in; Sabarish Sasidharan sabarish.sasidha...@manthan.com *Subject:* RE: RE: Spark or Storm My question is not directly related

Re: RE: Spark or Storm

2015-06-19 Thread Enno Shioji
; sparkenthusi...@yahoo.in; sabarish.sasidha...@manthan.com *Subject:* Re: RE: Spark or Storm That general description is accurate, but not really a specific issue of the direct steam. It applies to anything consuming from kafka (or, as Matei already said, any streaming system really). You can't

Re: RE: Spark or Storm

2015-06-19 Thread Ashish Soni
...@gmail.com; ayan guha guha.a...@gmail.com; user@spark.apache.org; Sateesh Kavuri sateesh.kav...@gmail.com; Spark Enthusiast sparkenthusi...@yahoo.in; Sabarish Sasidharan sabarish.sasidha...@manthan.com *Subject:* RE: RE: Spark or Storm My question is not directly related: about the exactly-once

RE: RE: Spark or Storm

2015-06-19 Thread Haopu Wang
: prajod.vettiyat...@wipro.com; Cody Koeninger; bit1...@163.com; Jordan Pilat; Will Briggs; Ashish Soni; ayan guha; user@spark.apache.org; Sateesh Kavuri; Spark Enthusiast; Sabarish Sasidharan Subject: Re: RE: Spark or Storm Fair enough, on second thought, just saying that it should be idempotent is indeed

Re: RE: Spark or Storm

2015-06-19 Thread bit1...@163.com
; Sabarish Sasidharan Subject: RE: RE: Spark or Storm My question is not directly related: about the exactly-once semantic, the document (copied below) said spark streaming gives exactly-once semantic, but actually from my test result, with check-point enabled, the application always re-process

Re: RE: Spark or Storm

2015-06-19 Thread Tathagata Das
; sabarish.sasidha...@manthan.com *Subject:* Re: RE: Spark or Storm That general description is accurate, but not really a specific issue of the direct steam. It applies to anything consuming from kafka (or, as Matei already said, any streaming system really). You can't have exactly once semantics

Re: RE: Spark or Storm

2015-06-19 Thread Enno Shioji
...@yahoo.in; sabarish.sasidha...@manthan.com *Subject:* Re: RE: Spark or Storm That general description is accurate, but not really a specific issue of the direct steam. It applies to anything consuming from kafka (or, as Matei already said, any streaming system really). You can't have exactly

Re: RE: Spark or Storm

2015-06-19 Thread Cody Koeninger
...@gmail.com; Ashish Soni asoni.le...@gmail.com; ayan guha guha.a...@gmail.com; user@spark.apache.org; Sateesh Kavuri sateesh.kav...@gmail.com; Spark Enthusiast sparkenthusi...@yahoo.in; Sabarish Sasidharan sabarish.sasidha...@manthan.com *Subject:* RE: RE: Spark or Storm My question is not directly

Re: RE: Spark or Storm

2015-06-19 Thread Cody Koeninger
asoni.le...@gmail.com; ayan guha guha.a...@gmail.com; user@spark.apache.org; Sateesh Kavuri sateesh.kav...@gmail.com; Spark Enthusiast sparkenthusi...@yahoo.in; Sabarish Sasidharan sabarish.sasidha...@manthan.com *Subject:* RE: RE: Spark or Storm My question is not directly related: about

Re: RE: Spark or Storm

2015-06-18 Thread bit1...@163.com
...@gmail.com; eshi...@gmail.com CC: wrbri...@gmail.com; asoni.le...@gmail.com; guha.a...@gmail.com; user@spark.apache.org; sateesh.kav...@gmail.com; sparkenthusi...@yahoo.in; sabarish.sasidha...@manthan.com Subject: RE: Spark or Storm not being able to read from Kafka using multiple nodes Kafka is plenty

Re: RE: Spark or Storm

2015-06-18 Thread Cody Koeninger
...@wipro.com *Date:* 2015-06-18 16:56 *To:* jrpi...@gmail.com; eshi...@gmail.com *CC:* wrbri...@gmail.com; asoni.le...@gmail.com; guha.a...@gmail.com; user@spark.apache.org; sateesh.kav...@gmail.com; sparkenthusi...@yahoo.in; sabarish.sasidha...@manthan.com *Subject:* RE: Spark or Storm not being able

RE: RE: Spark or Storm

2015-06-18 Thread prajod.vettiyattil
(WT01 - BAS); jrpi...@gmail.com; eshi...@gmail.com; wrbri...@gmail.com; asoni.le...@gmail.com; ayan guha; user; sateesh.kav...@gmail.com; sparkenthusi...@yahoo.in; sabarish.sasidha...@manthan.com Subject: Re: RE: Spark or Storm That general description is accurate, but not really a specific

RE: Spark or Storm

2015-06-18 Thread prajod.vettiyattil
Sasidharan Subject: Re: Spark or Storm not being able to read from Kafka using multiple nodes Kafka is plenty capable of doing this, by clustering together multiple consumer instances into a consumer group. If your topic is sufficiently partitioned, the consumer group can consume the topic

Re: Spark or Storm

2015-06-17 Thread Spark Enthusiast
a good idea.  So in terms of options…. spark streaming, storm, samza, akka and others…  Storm is probably the easiest to pick up,  spark streaming / akka may give you more flexibility and akka would work for CEP.  Just my $0.02 On Jun 16, 2015, at 9:40 PM, Spark Enthusiast sparkenthusi...@yahoo.in

Re: Spark or Storm

2015-06-17 Thread Matei Zaharia
that can consequently only do map. I haven't really used it tho, so can't really comment how it compares to Spark/Storm. Maybe somebody else will be able to comment. On Wed, Jun 17, 2015 at 3:13 PM, ayan guha guha.a...@gmail.com mailto:guha.a...@gmail.com wrote: Thanks

Re: Spark or Storm

2015-06-17 Thread Enno Shioji
haven't really used it tho, so can't really comment how it compares to Spark/Storm. Maybe somebody else will be able to comment. On Wed, Jun 17, 2015 at 3:13 PM, ayan guha guha.a...@gmail.com wrote: Thanks for this. It's kcl based kinesis application. But because its just a Java application we

Re: Spark or Storm

2015-06-17 Thread Matei Zaharia
providing the coordination. My understanding is that it's like a naked Storm worker process that can consequently only do map. I haven't really used it tho, so can't really comment how it compares to Spark/Storm. Maybe somebody else will be able to comment. On Wed, Jun 17, 2015 at 3

Re: Spark or Storm

2015-06-17 Thread Jordan Pilat
still have the fault tolerance associated with clustering the consumers. OK JRP On Jun 17, 2015 1:27 AM, Enno Shioji eshi...@gmail.com wrote: We've evaluated Spark Streaming vs. Storm and ended up sticking with Storm. Some of the important draw backs are: Spark has no back pressure (receiver

RE: Spark or Storm

2015-06-17 Thread Evo Eftimov
Patterns especially suitable for streaming data From: Matei Zaharia [mailto:matei.zaha...@gmail.com] Sent: Wednesday, June 17, 2015 7:14 PM To: Enno Shioji Cc: Ashish Soni; ayan guha; Sabarish Sasidharan; Spark Enthusiast; Will Briggs; user; Sateesh Kavuri Subject: Re: Spark or Storm

Re: Spark or Storm

2015-06-17 Thread Tathagata Das
To add more information beyond what Matei said and answer the original question, here are other things to consider when comparing between Spark Streaming and Storm. * Unified programming model and semantics - Most occasions you have to process the same data again in batch jobs. If you have two

Re: Spark or Storm

2015-06-17 Thread Ashish Soni
. How do i maintain a shared state ( total amount , total min , total data etc ) so that i know how much i accumulated at any given point as events for same phone can go to any node / executor. Can some one please tell me how can i achieve this is spark as in storm i can have a bolt which can do

Re: Spark or Storm

2015-06-17 Thread Sabarish Sasidharan
wrote: Hi All, I am evaluating spark VS storm ( spark streaming ) and i am not able to see what is equivalent of Bolt in storm inside spark. Any help will be appreciated on this ? Thanks , Ashish - To unsubscribe, e-mail

Re: Spark or Storm

2015-06-17 Thread Enno Shioji
We've evaluated Spark Streaming vs. Storm and ended up sticking with Storm. Some of the important draw backs are: Spark has no back pressure (receiver rate limit can alleviate this to a certain point, but it's far from ideal) There is also no exactly-once semantics. (updateStateByKey can achieve

Re: Spark or Storm

2015-06-17 Thread Spark Enthusiast
Shioji eshi...@gmail.com wrote: We've evaluated Spark Streaming vs. Storm and ended up sticking with Storm. Some of the important draw backs are: Spark has no back pressure (receiver rate limit can alleviate this to a certain point, but it's far from ideal)There is also no exactly-once

Re: Spark or Storm

2015-06-17 Thread Enno Shioji
don't become out of sync, you may find the built-in join in Spark Streaming useful. Storm also has a join keyword but its semantics are different. Also, what do you mean by No Back Pressure ? So when a topology is overloaded, Storm is designed so that it will stop reading from the source. Spark

Re: Spark or Storm

2015-06-17 Thread ayan guha
, total min , total data etc ) so that i know how much i accumulated at any given point as events for same phone can go to any node / executor. Can some one please tell me how can i achieve this is spark as in storm i can have a bolt which can do this ? Thanks, On Wed, Jun 17, 2015 at 4:52

Re: Spark or Storm

2015-06-17 Thread Enno Shioji
) so that i know how much i accumulated at any given point as events for same phone can go to any node / executor. Can some one please tell me how can i achieve this is spark as in storm i can have a bolt which can do this ? Thanks, On Wed, Jun 17, 2015 at 4:52 AM, Enno Shioji eshi

Re: Spark or Storm

2015-06-17 Thread Ashish Soni
i accumulated at any given point as events for same phone can go to any node / executor. Can some one please tell me how can i achieve this is spark as in storm i can have a bolt which can do this ? Thanks, On Wed, Jun 17, 2015 at 4:52 AM, Enno Shioji eshi...@gmail.com wrote: I guess

Re: Spark or Storm

2015-06-17 Thread Ashish Soni
given point as events for same phone can go to any node / executor. Can some one please tell me how can i achieve this is spark as in storm i can have a bolt which can do this ? Thanks, On Wed, Jun 17, 2015 at 4:52 AM, Enno Shioji eshi...@gmail.com wrote: I guess both. In terms of syntax, I

Re: Spark or Storm

2015-06-17 Thread Enno Shioji
accumulated at any given point as events for same phone can go to any node / executor. Can some one please tell me how can i achieve this is spark as in storm i can have a bolt which can do this ? Thanks, On Wed, Jun 17, 2015 at 4:52 AM, Enno Shioji eshi...@gmail.com wrote: I guess both

Re: Spark or Storm

2015-06-17 Thread Enno Shioji
a shared state ( total amount , total min , total data etc ) so that i know how much i accumulated at any given point as events for same phone can go to any node / executor. Can some one please tell me how can i achieve this is spark as in storm i can have a bolt which can do this ? Thanks

Re: Spark or Storm

2015-06-17 Thread Enno Shioji
, total min , total data etc ) so that i know how much i accumulated at any given point as events for same phone can go to any node / executor. Can some one please tell me how can i achieve this is spark as in storm i can have a bolt which can do this ? Thanks, On Wed, Jun 17, 2015 at 4

Re: Spark or Storm

2015-06-17 Thread Enno Shioji
, total data etc ) so that i know how much i accumulated at any given point as events for same phone can go to any node / executor. Can some one please tell me how can i achieve this is spark as in storm i can have a bolt which can do this ? Thanks, On Wed, Jun 17, 2015 at 4:52 AM, Enno

Re: Spark or Storm

2015-06-17 Thread ayan guha
as events for same phone can go to any node / executor. Can some one please tell me how can i achieve this is spark as in storm i can have a bolt which can do this ? Thanks, On Wed, Jun 17, 2015 at 4:52 AM, Enno Shioji eshi...@gmail.com wrote: I guess both. In terms of syntax, I

Re: Spark or Storm

2015-06-17 Thread Enno Shioji
really comment how it compares to Spark/Storm. Maybe somebody else will be able to comment. On Wed, Jun 17, 2015 at 3:13 PM, ayan guha guha.a...@gmail.com wrote: Thanks for this. It's kcl based kinesis application. But because its just a Java application we are thinking to use spark on EMR

Re: Spark or Storm

2015-06-17 Thread Ashish Soni
the coordination. My understanding is that it's like a naked Storm worker process that can consequently only do map. I haven't really used it tho, so can't really comment how it compares to Spark/Storm. Maybe somebody else will be able to comment. On Wed, Jun 17, 2015 at 3:13 PM, ayan guha guha.a

Re: Spark or Storm

2015-06-17 Thread Enno Shioji
providing the coordination. My understanding is that it's like a naked Storm worker process that can consequently only do map. I haven't really used it tho, so can't really comment how it compares to Spark/Storm. Maybe somebody else will be able to comment. On Wed, Jun 17, 2015 at 3:13 PM, ayan

Re: Spark or Storm

2015-06-17 Thread Michael Segel
Actually the reverse. Spark Streaming is really a micro batch system where the smallest window is 1/2 a second (500ms). So for CEP, its not really a good idea. So in terms of options…. spark streaming, storm, samza, akka and others… Storm is probably the easiest to pick up, spark streaming

Spark or Storm

2015-06-16 Thread asoni . learn
Hi All, I am evaluating spark VS storm ( spark streaming ) and i am not able to see what is equivalent of Bolt in storm inside spark. Any help will be appreciated on this ? Thanks , Ashish - To unsubscribe, e-mail: user

Re: Spark or Storm

2015-06-16 Thread ayan guha
: Hi All, I am evaluating spark VS storm ( spark streaming ) and i am not able to see what is equivalent of Bolt in storm inside spark. Any help will be appreciated on this ? Thanks , Ashish - To unsubscribe, e-mail

Re: Spark or Storm

2015-06-16 Thread Spark Enthusiast
closer. If you can share your use case, we might be able to provide better guidance. Regards, Will On June 16, 2015, at 9:46 PM, asoni.le...@gmail.com wrote: Hi All, I am evaluating spark VS storm ( spark streaming  ) and i am not able to see what is equivalent of Bolt in storm inside spark

Re: Spark or Storm

2015-06-16 Thread Sateesh Kavuri
PM, asoni.le...@gmail.com wrote: Hi All, I am evaluating spark VS storm ( spark streaming ) and i am not able to see what is equivalent of Bolt in storm inside spark. Any help will be appreciated on this ? Thanks , Ashish

Re: Spark or Storm

2015-06-16 Thread Will Briggs
then, there are significant differences, but it's a bit closer. If you can share your use case, we might be able to provide better guidance. Regards, Will On June 16, 2015, at 9:46 PM, asoni.le...@gmail.com wrote: Hi All, I am evaluating spark VS storm ( spark streaming ) and i am not able to see what