Re: Kafka - streaming from multiple topics

2015-12-21 Thread Cody Koeninger
t;>>>>> Since those 3 topics contain completely different data on which I >>>>>> need to apply different kind of transformations, I am not sure joining >>>>>> them >>>>>> would be really efficient, unless you know something that I don&#

Re: Kafka - streaming from multiple topics

2015-12-21 Thread Neelesh
, unless you know something that I don't. >>>>> >>>>> As I really don't need any interaction between those streams, I think >>>>> I might end up running 3 different streaming apps instead of one. >>>>> >>>>> Tha

Re: Kafka - streaming from multiple topics

2015-12-21 Thread Cody Koeninger
;>> wrote: >>>> >>>>> Using spark.streaming.concurrentJobs for this probably isn't a good >>>>> idea, as it allows the next batch to start processing before current one >>>>> is >>>>> finished, which may have unintende

Re: Kafka - streaming from multiple topics

2015-12-20 Thread Neelesh
g before current one is >>>> finished, which may have unintended consequences. >>>> >>>> Why can't you use a single stream with all the topics you care about, >>>> or multiple streams if you're e.g. joining them? >>>> >>>> &g

Re: Kafka - streaming from multiple topics

2015-12-20 Thread Chris Fregly
;> >>>>> On Wed, Dec 16, 2015 at 3:00 PM, jpocalan wrote: >>>>> Nevermind, I found the answer to my questions. >>>>> The following spark configuration property will allow you to process >>>>> multiple KafkaDirectStream in parallel: >&

Re: Kafka - streaming from multiple topics

2015-12-19 Thread Neelesh
d, Dec 16, 2015 at 3:00 PM, jpocalan wrote: >>> >>>> Nevermind, I found the answer to my questions. >>>> The following spark configuration property will allow you to process >>>> multiple KafkaDirectStream in par

Re: Kafka - streaming from multiple topics

2015-12-17 Thread Cody Koeninger
>>> The following spark configuration property will allow you to process >>> multiple KafkaDirectStream in parallel: >>> --conf spark.streaming.concurrentJobs= >>> >>> >>> >>> >>&g

Re: Kafka - streaming from multiple topics

2015-12-17 Thread Jean-Pierre OCALAN
parallel: >> --conf spark.streaming.concurrentJobs= >> >> >> >> >> >> -- >> View this message in context: >> http://apache-spark-user-list.1001560.n3.nabble.com/Kafka-streaming-from-multiple-topics-tp8678p25723.html >> Sent from the

Re: Kafka - streaming from multiple topics

2015-12-17 Thread Cody Koeninger
> > > > > > -- > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.com/Kafka-streaming-from-multiple-topics-tp8678p25723.html > Sent from the Apache Spark User List mailing list archive at Nabble.com. > > -

Re: Kafka - streaming from multiple topics

2015-12-16 Thread jpocalan
-streaming-from-multiple-topics-tp8678p25723.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h

Re: Kafka - streaming from multiple topics

2015-12-03 Thread Cody Koeninger
t;> On Wed, Dec 2, 2015 at 2:13 PM, dutrow wrote: >> >>> I found the JIRA ticket: >>> https://issues.apache.org/jira/browse/SPARK-2388 >>> >>> It was marked as invalid. >>> >>>

Re: Kafka - streaming from multiple topics

2015-12-03 Thread Dan Dutrow
2015 at 2:13 PM, dutrow wrote: > >> I found the JIRA ticket: https://issues.apache.org/jira/browse/SPARK-2388 >> >> It was marked as invalid. >> >> >> >> -- >> View this message in context: >> http://apache-spark-user-list.1001560.n3.nabbl

Re: Kafka - streaming from multiple topics

2015-12-02 Thread dutrow
://kafka.apache.org/documentation.html#compaction I don't see a way around this without hacking KafkaStream and ConsumerIterator to return the topic name instead of the message key. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Kafka-streaming-from-multiple-t

Re: Kafka - streaming from multiple topics

2015-12-02 Thread Dan Dutrow
> >> It was marked as invalid. >> >> >> >> -- >> View this message in context: >> http://apache-spark-user-list.1001560.n3.nabble.com/Kafka-streaming-from-multiple-topics-tp8678p25550.html

Re: Kafka - streaming from multiple topics

2015-12-02 Thread Cody Koeninger
s invalid. > > > > -- > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.com/Kafka-streaming-from-multiple-topics-tp8678p25550.html > Sent from the Apache Spark User List mail

Re: Kafka - streaming from multiple topics

2015-12-02 Thread dutrow
I found the JIRA ticket: https://issues.apache.org/jira/browse/SPARK-2388 It was marked as invalid. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Kafka-streaming-from-multiple-topics-tp8678p25550.html Sent from the Apache Spark User List mailing list

Re: Kafka - streaming from multiple topics

2015-12-02 Thread dutrow
this issue? -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Kafka-streaming-from-multiple-topics-tp8678p25549.html Sent from the Apache Spark User List mailing list archive at Nabble.com. ---

Re: Kafka - streaming from multiple topics

2014-08-13 Thread maddenpj
Can you link to the JIRA issue? I'm having to work around this bug and it would be nice to monitor the JIRA so I can change my code when it's fixed. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Kafka-streaming-from-multiple-topics-tp8678p12053

Re: Kafka - streaming from multiple topics

2014-07-07 Thread Sergey Malov
ursday, July 3, 2014 at 9:41 PM To: "user@spark.apache.org<mailto:user@spark.apache.org>" mailto:user@spark.apache.org>> Subject: Re: Kafka - streaming from multiple topics Sergey, On Fri, Jul 4, 2014 at 1:06 AM, Sergey Malov mailto:sma...@collective.com>> wrote: On t

Re: Kafka - streaming from multiple topics

2014-07-03 Thread Tobias Pfeiffer
Sergey, On Fri, Jul 4, 2014 at 1:06 AM, Sergey Malov wrote: > > On the other hand, under the hood KafkaInputDStream which is create with > this KafkaUtils call, calls ConsumerConnector.createMessageStream which > returns a Map[String, List[KafkaStream] keyed by topic. It is, however, not > expo

Re: Kafka - streaming from multiple topics

2014-07-03 Thread Sergey Malov
...@preferred.jp>> Reply-To: "user@spark.apache.org<mailto:user@spark.apache.org>" mailto:user@spark.apache.org>> Date: Wednesday, July 2, 2014 at 9:54 PM To: "user@spark.apache.org<mailto:user@spark.apache.org>" mailto:user@spark.apache.org>> S

Re: Kafka - streaming from multiple topics

2014-07-02 Thread Tobias Pfeiffer
Sergey, you might actually consider using two streams, like * val stream1 = KafkaUtils.createStream(ssc,"localhost:2181","logs", Map("retarget" -> 2))* * val stream2 = KafkaUtils.createStream(ssc,"localhost:2181","logs", Map("datapair" -> 2))* to achieve what you want. This has the addit

Kafka - streaming from multiple topics

2014-07-02 Thread Sergey Malov
HI, I would like to set up streaming from Kafka cluster, reading multiple topics and then processing each of the differently. So, I’d create a stream val stream = KafkaUtils.createStream(ssc,"localhost:2181","logs", Map("retarget" -> 2,"datapair" -> 2)) And then based on whether it’s “ret