Re: How to set Timeout for KafkaConsumer.poll()

Jason Gustafson Thu, 28 Jan 2016 13:30:13 -0800

Hey Yifan,

As far as how the consumer works internally, there's not a big difference
between using a long timeout or a short timeout. Which you choose really
depends on the needs of your application. Typically people use a short
timeout in order to be able to break from the loop with a boolean flag, but
you might also do so if you have some periodic task to execute. I usually
prefer using a long timeout and breaking from the loop using the wakeup()
API.


-Jason

On Thu, Jan 28, 2016 at 1:18 PM, Yifan Ying <[email protected]> wrote:

> Hi All,
>
> I was using the new Kafka Consumer to fetch messages in this way:
>
> while (true) {
>     ConsumerRecords<Object, T> records =
> kafkaConsumer.poll(Long.MAX_VALUE);
>     // do nothing if records are empty
>     ....
> }
>
> Then I realized that blocking until new messages fetched might be a little
> overhead. So I looked into the KafkaConsumer code to figure out get a
> reasonable timeout.
>
> do {
>     Map<TopicPartition, List<ConsumerRecord<K, V>>> records =
> pollOnce(remaining);
>     if (!records.isEmpty()) {
>         // if data is available, then return it, but first send off the
>         // next round of fetches to enable pipelining while the user is
>         // handling the fetched records.
>         fetcher.initFetches(metadata.fetch());
>         client.poll(0);
>         return new ConsumerRecords<>(records);
>     }
>
>     long elapsed = time.milliseconds() - start;
>     remaining = timeout - elapsed;
> } while (remaining > 0);
>
> It seems that even if I set a much lower timeout, like 1000ms, my code will
> still keep fetching messages, as I use while(true) and the code won't do
> anything with an empty message set. So the only difference between a high
> timeout and a low one is that the code is looping in the while loop I wrote
> or the one in poll(). But in terms of connections to Kafka, setting a low
> or high timeout won't affect much in my case.
>
> I might misunderstand the code completely. Anyone is able to shed some
> light on this topic?
>
> Thanks.
>
> --
> Yifan
>

Re: How to set Timeout for KafkaConsumer.poll()

Reply via email to