Thank you! :)

2015-08-10 19:58 GMT+02:00 Cody Koeninger <c...@koeninger.org>:

> There's no long-running receiver pushing blocks of messages, so
> blockInterval isn't relevant.
>
> Batch interval is what matters.
>
> On Mon, Aug 10, 2015 at 12:52 PM, allonsy <luke1...@gmail.com> wrote:
>
>> Hi everyone,
>>
>> I recently started using the new Kafka direct approach.
>>
>> Now, as far as I understood, each Kafka partition /is/ an RDD partition
>> that
>> will be processed by a single core.
>> What I don't understand is the relation between those partitions and the
>> blocks generated every blockInterval.
>>
>> For example, assume:
>>
>> 1000ms batch interval
>> 16 topic partitions (total of 16 cores available)
>>
>> Moreover, we have that the blockInterval is set to 200ms.
>>
>> What am I actually dividing by the blockInterval value in such a scenario?
>> I'd like to tune this value but I cannot understand what it stands for.
>>
>> I hope I made myself clear,
>>
>> thank you all! :)
>>
>>
>>
>>
>> --
>> View this message in context:
>> http://apache-spark-user-list.1001560.n3.nabble.com/Kafka-direct-approach-blockInterval-and-topic-partitions-tp24197.html
>> Sent from the Apache Spark User List mailing list archive at Nabble.com.
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
>> For additional commands, e-mail: user-h...@spark.apache.org
>>
>>
>

Reply via email to