Robin,
Thank you very much for your quick response. It was really helpful.
The issue has been successfully resolved.
The RC:
podman (by default) allows 2048 threads, while (for some reason) Ceph had
8K thread pool size for RGWs. Changing this parameter helps bring the
cluster to a normal state.
On Sat, Feb 10, 2024 at 10:05:02AM -0500, Vladimir Sigunov wrote:
> Hello Community!
> I would appreciate any help/suggestions with the massive RGWs outage we are
> facing.
> The cluster's overall status is acceptable (HEALTH_WARN because of some pgs
> not scrubbed in time), and the cluster is