Hi everyone,

I recently started using the new Kafka direct approach.

Now, as far as I understood, each Kafka partition /is/ an RDD partition that
will be processed by a single core.
What I don't understand is the relation between those partitions and the
blocks generated every blockInterval.

For example, assume:

1000ms batch interval
16 topic partitions (total of 16 cores available)

Moreover, we have that the blockInterval is set to 200ms.

What am I actually dividing by the blockInterval value in such a scenario?
I'd like to tune this value but I cannot understand what it stands for.

I hope I made myself clear,

thank you all! :)




--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Kafka-direct-approach-blockInterval-and-topic-partitions-tp24197.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to