Thanks Enrico,

We are only syncing metadata between sites, so I don't think that bug will be 
the cause of our issues.

I have been able to delete ~30k objects without causing the RGW to stop 
processing.


Thanks
Iain
________________________________
From: Enrico Bocchi <enrico.boc...@cern.ch>
Sent: 22 May 2024 13:48
To: Iain Stott <iain.st...@thg.com>; ceph-users@ceph.io <ceph-users@ceph.io>
Subject: Re: [ceph-users] Reef RGWs stop processing requests

CAUTION: This email originates from outside THG

Hi Iain,

Can you check if it relates to this? --
https://tracker.ceph.com/issues/63373<https://tracker.ceph.com/issues/63373>
There is a bug when bulk deleting objects, causing the RGWs to deadlock.

Cheers,
Enrico


On 5/17/24 11:24, Iain Stott wrote:
> Hi,
>
> We are running 3 clusters in multisite. All 3 were running Quincy 17.2.6 and 
> using cephadm. We upgraded one of the secondary sites to Reef 18.2.1 a couple 
> of weeks ago and were planning on doing the rest shortly afterwards.
>
> We run 3 RGW daemons on separate physical hosts behind an external HAProxy HA 
> pair for each cluster.
>
> Since we upgrade to Reef we have had issues with the RGWs stopping processing 
> requests. We can see that they don't crash as they still have entries in the 
> logs about syncing, but as far as request processing goes, they just stop. 
> While debugging this we have 1 of the 3 RGWs running a Quincy image, and this 
> has never had an issue where it stops processing requests. Any Reef 
> containers we deploy have always stopped within 48Hrs of being deployed. We 
> have tried Reef versions 18.2.1, 18.2.2 and 18.1.3 and all exhibit the same 
> issue. We are running podman 4.6.1 on Centos 8 with kernel 
> 4.18.0-513.24.1.el8_9.x86_64.
>
> We have enabled debug logs for the RGWs but we have been unable to find 
> anything in them that would shed light on the cause.
>
> We are just wondering if anyone had any ideas on what could be causing this 
> or how to debug it further?
>
> Thanks
> Iain
>
> Iain Stott
> OpenStack Engineer
> iain.st...@thg.com
> [THG Ingenuity Logo]<https://www.thg.com<https://www.thg.com>>
> www.thg.com<http://www.thg.com><https://www.thg.com/<https://www.thg.com/>>
> [LinkedIn]<https://www.linkedin.com/company/thgplc/?originalSubdomain=uk<https://www.linkedin.com/company/thgplc/?originalSubdomain=uk>>
>  [Instagram] <https://www.instagram.com/thg<https://www.instagram.com/thg>> 
> [X] <https://twitter.com/thgplc?lang=en<https://twitter.com/thgplc?lang=en>>
> _______________________________________________
> ceph-users mailing list -- ceph-users@ceph.io
> To unsubscribe send an email to ceph-users-le...@ceph.io

--
Enrico Bocchi
CERN European Laboratory for Particle Physics
IT - Storage & Data Management - General Storage Services
Mailbox: G20500 - Office: 31-2-010
1211 Genève 23
Switzerland
_______________________________________________
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io

Reply via email to