Hi all

We had a Kafka broker failure (too many open files, stupid), and now the 
partitions on that broker will no longer become part of the ISR set. It's been 
a few days (organizational issues), and we have significant amounts of data on 
the ISR partitions.

In order to make the partitions on the broker become part of the ISR set again, 
should I:

* increase `replica.lag.time.max.ms` on the broker to the number of ms that the 
partitions are behind. I can guesstimate the value to about 7 days, or should I 
measure it somehow?
* stop the broker and wipe files (which ones?) and then restart it. Should I 
also do stuff on zookeeper ?

Is there any _official_ information on how to deal with this situation?

Thanks for helping!

Reply via email to