I had already disabled prometheus plugin (again, only using for the rbd stats),
but will also remove the rbd pool from the rbd_support module, as well as
disable the rbd_support module.
It seems slightly more stable so far, but still not rock solid as it was before.
Thanks,
Reed
> On Aug 15,
On Wed, Aug 14, 2019 at 12:12:36PM -0500, Reed Dier wrote:
> My main metrics source is the influx plugin, but I enabled the
> prometheus plugin to get access to the per-rbd image metrics. I may
> disable prometheus and see if that yields better stability, until
> possibly the influx plugin gets
Thanks for that insight.
My main metrics source is the influx plugin, but I enabled the prometheus
plugin to get access to the per-rbd image metrics.
I may disable prometheus and see if that yields better stability, until
possibly the influx plugin gets updated to support those metric exports.
I'm having a similar issue with ceph-mgr stability problems since
upgrading from 13.2.5 to 13.2.6. I have isolated the crashing to the
prometheus module being enabled and notice much better stability when
the prometheus module is NOT enabled. No more failovers, however I do
notice that even with