Re: Got negative offset lag after restarting brokers

tao xiao Mon, 02 Mar 2015 19:18:59 -0800

Since I reused the same consumer group to consume the messages after step 6
data there was no data loss occurred. But if I create a new consumer group
for sure the new consumer will suffer data loss.


I am more concerning about if this is an acceptable behavior by Kafka that
an out of sync broker can be elected as the leader for a partition. Is
there any mechanism built around Kafka to ensure that only the in-sync
broker can be chosen to be a leader? If no, what is the best practice to
restart brokers if some of the replicas are out of sync?

On Tue, Mar 3, 2015 at 2:35 AM, Jiangjie Qin <j...@linkedin.com.invalid>
wrote:

> In this case you have data loss. In step 6, when broker 1 comes up, it
> becomes the leader and has log end offset 1000. When broker 0 comes up, it
> becomes follower and will truncate its log to 1000, i.e. 1000 messages
> were lost. Next time when the consumer starts, its offset will be reset to
> either the smallest or the largest depending on the setting.
>
> Jiangjie (Becket) Qin
>
> On 3/2/15, 9:32 AM, "Stuart Reynolds" <s...@stureynolds.com> wrote:
>
> >Each topic has:  earliest and latest offsets (per partition)
> >Each consumer group has a current offset (per topic, partition pair)
> >
> >I see -1 for the current offsets new consumer groups that haven't yet
> >committed an offset. I think it means that the offsets for that
> >consumer group are undefined.
> >
> >Is it possible you generated new consumer groups when you restarted your
> >broker?
> >
> >
> >
> >
> >On Mon, Mar 2, 2015 at 3:15 AM, tao xiao <xiaotao...@gmail.com> wrote:
> >> Hi team,
> >>
> >> I have 2 brokers (0 and 1) serving a topic mm-benchmark-test. I did some
> >> tests on the two brokers to verify how leader got elected. Here are the
> >> steps:
> >>
> >> 1. started 2 brokers
> >> 2. created a topic with partition=1 and replication-factor=2. Now
> >>brokers 1
> >> was elected as leader
> >> 3. sent 1000 messages to the topic and consumed from a high level
> >>consumer
> >> using zk as the offset storage.
> >> 4. shutdown broker 1 and now broker 0 was elected as leader
> >> 5. sent another 1000 messages to topic and consumed again
> >> 6. completely shutdown broker 0 and then started broker 1. now broker 1
> >> became the leader
> >> 7. started broker 0 and ran ConsumerOffsetChecker which showed negative
> >>lag
> >> (-1000 in my case)
> >>
> >> I think this is because the consumed offset in zk was 2000 and logsize
> >> retrieved from the leader (broker 1) which missed 1000 messages in step
> >>5
> >> in this case was 1000 there -1000 = 1000 - 2000 was given.
> >>
> >> Is this a bug or expected behavior?
> >>
> >> --
> >> Regards,
> >> Tao
>
>


-- 
Regards,
Tao

Re: Got negative offset lag after restarting brokers

Reply via email to