[ceph-users] Re: Mgr stability

2019-08-15 Thread Reed Dier
I had already disabled prometheus plugin (again, only using for the rbd stats), but will also remove the rbd pool from the rbd_support module, as well as disable the rbd_support module. It seems slightly more stable so far, but still not rock solid as it was before. Thanks, Reed > On Aug 15,

[ceph-users] Re: Mgr stability

2019-08-15 Thread Mykola Golub
On Wed, Aug 14, 2019 at 12:12:36PM -0500, Reed Dier wrote: > My main metrics source is the influx plugin, but I enabled the > prometheus plugin to get access to the per-rbd image metrics. I may > disable prometheus and see if that yields better stability, until > possibly the influx plugin gets

[ceph-users] Re: Mgr stability

2019-08-14 Thread Reed Dier
Thanks for that insight. My main metrics source is the influx plugin, but I enabled the prometheus plugin to get access to the per-rbd image metrics. I may disable prometheus and see if that yields better stability, until possibly the influx plugin gets updated to support those metric exports.

[ceph-users] Re: Mgr stability

2019-08-14 Thread shubjero
I'm having a similar issue with ceph-mgr stability problems since upgrading from 13.2.5 to 13.2.6. I have isolated the crashing to the prometheus module being enabled and notice much better stability when the prometheus module is NOT enabled. No more failovers, however I do notice that even with