Hi All, In the past my team was using burrow by linkedin for monitoring consumer group lag. We discovered it crashed a lot and we found ourselves constantly restarting it.
As a fix, a co-worker has created his own solution that I would like to share with you https://github.com/zalando-incubator/remora It effectively just exposes the output of kafka-consume-group --describe --group <group> over http. Hope this project is useful for you, all feedback is welcome :) Ian.