https://bugzilla.wikimedia.org/show_bug.cgi?id=72550

christ...@quelltextlich.at changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
            Summary|analytics1021 getting kiked |analytics1021 getting
                   |out of kafka partition      |kicked out of kafka
                   |leader role on 2014-10-27   |partition leader role on
                   |~07:12                      |2014-10-27 ~07:12

--- Comment #2 from christ...@quelltextlich.at ---
This bug is still missing the numbers of lost messages when
analytics1021 lost it's partition leader role.

For the text cluster, it only affected
  amssq34
  amssq53.esams.wikimedia.org
  amssq56.esams.wikimedia.org
  cp4008.ulsfo.wmnet
. The affected period was 2014-10-27T07:12:29/2014-10-27T07:12:32, and
in total 100 messages got lost, which is <<1 second worth of data for
text.

For the upload cluster, it affected all caches in that clustel except
for cp4015 .
The affected period was 2014-10-27T07:12:29/2014-10-27T07:12:46, and
in total ~51K messages got lost, which is <2 second worth of data for
upload.

When analytics1021 lost its partition leader role, bits, mobile, and
text already had the ACK fix. upload hadn't. So seeing the lost
messages on upload is expected.

It is also expected to see no loss on bits, and mobile.

However, I had expected to see no loss on text, as it already had the
ACK fix. It's strange to see exactly 100 lost messages on text.
100 is a suspiciously nice number.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to