[ceph-users] CephFS client-side load issues for write-/delete-heavy workloads

Janek Bevendorff Fri, 13 Sep 2019 01:16:32 -0700

Hi,

There have been various stability issues with the MDS that I reported awhile ago and most of them have been addressed and fixes will beavailable in upcoming patch releases. However, there also seem to beproblems on the client side, which I have not reported so far.

Note: This report is in part inspired by a previous mail to the listabout CephFS deletion performance(http://lists.ceph.com/pipermail/ceph-users-ceph.com/2019-September/036842.html), but since I am not quite sure if we are actually talking about thevery same issue, I decided to start a new thread.

I tried copying 70TB of data (mostly small files like 1TB of Gitrepositories) using parallel rsync jobs. I did this first using fpsync,but after a while the client started locking up with ever increasingload. No IO from or to the mount was possible anymore and `sync` hungindefinitely. Even after force-unmounting the FS, I still had kernelprocesses using 100% of about half my CPU cores. Remounting the FS wasnot possible until forcefully rebooting the entire node.

I then tried parsyncfp, which is more considerate regarding the load andI was able to sync the whole tree without issues after setting`vm.dirty_background_bytes` and `vm.dirty_bytes` via `sysctl` to 1GB and4GB (the defaults of 10 and 20% of total RAM are way too much for amachine with 128GB of memory and write-heavy workloads). Right now, I amrunning another single rsync pass, since the parallel versions cannot do`--delete`. To ensure this one isn't locking up my system either, I usethe same sysctl settings and periodically run `sync` in the background.So far the job has been running for a day with an average 15m load of2.5 on a 32-thread machine).

I am not entirely sure if this is a general kernel bug or a cephfs bug.I believe it may be possible to produce similar issues with otherkernel-space remote file systems like NFS (I had that in the past), butgenerally, it seems to be much more of an issue with the cephfs kerneldriver (at least from my experience).

I am using Nautilus 14.2.3 and a single MDS with optimized recall andcache trimming settings to avoid cache inflation issues caused by thehousekeeping thread not being able to catch up (fixed in futurereleases). Switching to multiple MDSs does not seem to have an impact onthe problem).


Cheers
Janek
_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

[ceph-users] CephFS client-side load issues for write-/delete-heavy workloads

Reply via email to