Hello Josh,
just wanted to confirm that setting bluefs_buffered_io immediately
helped hotfix the problem. I've also updated to 14.2.22 and we'll
discuss adding more NVME modules to move OSD databases out of spinners
to prevent further occurances
thanks a lot for your time!
with best regards
Hello Josh,
>
> Was there PG movement (backfill) happening in this cluster recently?
> Do the OSDs have stray PGs (e.g. 'ceph daemon osd.NN perf dump | grep
> numpg_stray' - run this against an affected OSD from the housing
> node)?
yes, some nodes have stray pgs (1..5) shell I do something
Hello Eugen,
thank you for you reply. Yes, restarting all OSDs, monitors, also
increasing osd_map_cache_size to 5000 (this helped us in case
of problem with not pruning OSD maps). none of this helped..
with best regards
nik
On Wed, Nov 03, 2021 at 11:41:28AM +, Eugen Block wrote:
> Hi,
>
Hi,
I don't have an explanation but I remember having a similar issue a
year ago or so. IIRC a simple OSD restart fixed that, so I never got
to the bottom of it. Have you tried to restart OSD daemons?
Zitat von Nikola Ciprich :
Hello fellow ceph users,
I'm trying to catch ghost here..