[ceph-users] Re: What is client request_load_avg? Troubleshooting MDS issues on Luminous

2022-08-22 Thread Chris Smart
On Mon, 2022-08-22 at 11:13 +, Frank Schilder wrote: > Hi Chris. > > > Interestingly, when duration gets long and performance gets bad ... > > This observation is likely due to MDS and client cache. My experience > with ceph's cache implementations is that, well, they seem not that > great.

[ceph-users] Re: What is client request_load_avg? Troubleshooting MDS issues on Luminous

2022-08-22 Thread Chris Smart
On Mon, 2022-08-22 at 16:42 +0200, Stefan Kooman wrote: > On 8/21/22 04:41, Chris Smart wrote: > > > OK, so basically sounds like I should stick with filestore, ugprade > > the > > cluster to Pacific to inherit the newer settings, then do the > > conversion to bluestore which will avoid manual

[ceph-users] Re: binary file cannot execute in cephfs directory

2022-08-22 Thread zxcs
In case someone missing the picture. Just copy the text as below: 1d@***ceph dir**$ 1s -lrth total 13M -rwxr-xr-x 1 ld ld 13M Nov 29 2021 cmake-3.22 1rwxrwxrwx 1 ld ld 10 Jul 26 10:03 cmake > cmake-3.22 -rwxrwxr-x 1 ld ld 25 Aug 19 15:52 test.sh ld@***ceph dir**$./cmake-3.22 bash:

[ceph-users] binary file cannot execute in cephfs directory

2022-08-22 Thread zxcs
Hi, experts, We are using cephfs 15.2.13, and after mount ceph on one node, copy a binary into the ceph dir, see below (cmake-3.22 is a binary), but when i using `./cmake-3.22` it report permission denied, why? this file has “x” permission, and “ld" is the binary file owner. could anyone

[ceph-users] Re: Problem adding secondary realm to rados-gw

2022-08-22 Thread Matt Dunavant
Nevermind. Found the issue was that zone still existed in my period from some old testing. Running from the original cluster 'radosgw-admin zonegroup remove --rgw-zonegroup=us --rgw-zone=chicago-data', updating the period, then re-doing the steps to add a secondary zone on the 2nd cluster

[ceph-users] Re: Problem adding secondary realm to rados-gw

2022-08-22 Thread Matt Dunavant
Thanks, that got me past that issue. However I'm running into another in which it's not finding the zone after creating and then updating the period: #radosgw-admin period update --commit --realm-id=26e78bc5-714c-4993-a4bd-b07918bd223a 2022-08-22T13:45:56.195-0400 7f7183538c80 1 Cannot find

[ceph-users] Re: Problem adding secondary realm to rados-gw

2022-08-22 Thread Casey Bodley
On Mon, Aug 22, 2022 at 12:37 PM Matt Dunavant wrote: > > Hello, > > > I'm trying to add a secondary realm to my ceph cluster but I'm getting the > following error after running a 'radosgw-admin realm pull --rgw-realm=$REALM > --url=http://URL:80 --access-key=$KEY --secret=$SECRET': > > >

[ceph-users] Problem adding secondary realm to rados-gw

2022-08-22 Thread Matt Dunavant
Hello, I'm trying to add a secondary realm to my ceph cluster but I'm getting the following error after running a 'radosgw-admin realm pull --rgw-realm=$REALM --url=http://URL:80 --access-key=$KEY --secret=$SECRET': request failed: (5) Input/output error Nothing on google seems to help

[ceph-users] OSDs crush - Since Pacific

2022-08-22 Thread Wissem MIMOUNA
Dear All, After updating our ceph cluster from Octopus to Pacific , we got a lot of a slow_ops on many osds ( which caused the cluster to become very slow ) . We did our investiguation and search on the ceph-users list and we found that rebuilding all OSD scan improve ( or fix ) the issue ( we

[ceph-users] Re: rbd-mirror stops replaying journal on primary cluster

2022-08-22 Thread Eugen Block
Hi, IIRC the rbd mirror journals will grow if the sync stops to work, which seems to be the case here. Does the primary cluster experience any high load when the replay stops? How is the connection between the two sites and is the link saturated? Does the rbd-mirror log reveal anything

[ceph-users] Re: Ceph Octopus RGW 15.2.17 - files not available in rados while still in bucket index

2022-08-22 Thread Boris
Good morning Istvan, sadly no, it’s not fixed. I just have an idea what might trigger the problem and how I can try to mitigate it. I still don’t know what these errors are and why they happen. I refuse to think that RGW „lose“ data, when OSDs become unstable. Have a good start in in the