Re: [ceph-users] cephfs/ceph-fuse: mds0: Client XXX:XXXfailingtorespondto capability release

Burkhard Linke Wed, 14 Sep 2016 07:07:03 -0700

Hi,

My cluster is back to HEALTH_OK, the involved host has been restartedby the user. But I will debug some more on the host when i see thisissue again next time.
PS: For completeness, i've stated that this issue was often seen in mycurrent Jewel environment, I meant to say that this issue comes upsometimes (so not so often). But the times when i *do* have thisissue, it blocks some I/O for clients as a consequence.

That's why I assume that the root cause might be a bug in ceph-fuse.There's support for page cache in ceph-fuse (not sure whether it isactive by default), and afaik it has to keep the capabilities around aslong as the corresponding file is still in the cache. If another clientswants to access the file, the mds might need to revoke the capabilitesfor cached files (e.g. if one client wants to overwrite a file that hasbeen read by another client before). The client has to wait until it isable to acquire the capabilities, resulting in blocked I/O.

We had similar problems in the past with ceph-fuse, especially if pagecache support was active. We have switched to kernel based cephfs in themeantime (with it's own pro and cons).


Regards,
Burkhard
_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Re: [ceph-users] cephfs/ceph-fuse: mds0: Client XXX:XXXfailingtorespondto capability release

Reply via email to