Re: [ceph-users] fsping, why you no work no mo?

Andras Pataki Thu, 13 Apr 2017 12:20:40 -0700

Hi Dan,

I don't have a solution to the problem, I can only second that we'vealso been seeing strange problems when more than one node accesses thesame file in ceph and at least one of them opens it for writing. I'vetried verbose logging on the client (fuse), and it seems that the fuseclient sends some cap request to the MDS and does not get a responsesometimes. And it looks like it has some 5 second polling interval, andthat sometimes (but not always) saves the day and the client continueswith a 5 second-ish delay. This does not happen when multiple processesopen the file for reading, but it does when processes open it forwriting (even if they never write to the file and only readafterwards). I have some earlier mailing list messages from a week ortwo ago describing what we see more in detail (including log outputs).I think the issue has in some way to do with cap requests beinglost/miscommunicated between the client and the MDS.


Andras


On 04/13/2017 01:41 PM, Dan van der Ster wrote:

Dear ceph-*,
A couple weeks ago I wrote this simple tool to measure the round-triplatency of a shared filesystem.
https://github.com/dvanders/fsping
In our case, the tool is to be run from two clients who mount the sameCephFS.
First, start the server (a.k.a. the ping reflector) on one machine ina CephFS directory:
   ./fsping --server
Then, from another client machine and in the same directory, start thefsping client (aka the ping emitter):
    ./fsping --prefix <prefix from the server above>
The idea is that the "client" writes a syn file, the reflector noticesit, and writes an ack file. The time for the client to notice the ackfile is what I call the rtt.
And the output looks like normal ping, so that's neat. (The README.mdshows a working example)
Anyway, two weeks ago when I wrote this, it was working very well onmy CephFS clusters (running 10.2.5, IIRC). I was seeing ~20ms rtt forsmall files, which is more or less what I was expecting on my testcluster.
But when I run fsping today, it does one of two misbehaviours:
1. Most of the time it just hangs, both on the reflector and on theemitter. The fsping processes are stuck in some uninterruptible state-- only an MDS failover breaks them out. I tried with and withoutfuse_disable_pagecache -- no big difference.
2. When I increase the fsping --size to 512kB, it works a bit morereliably. But there is a weird bimodal distribution with most"packets" having 20-30ms rtt, some ~20% having ~5-6 seconds rtt, andsome ~5% taking ~10-11s. I suspected the mds_tick_interval -- butdecreasing that didn't help.
In summary, if someone is curious, please give this tool a try on yourCephFS cluster -- let me know if its working or not (and what rtt youcan achieve with which configuration).And perhaps a dev would understand why it is not working with latestjewel ceph-fuse / ceph MDS's?
Best Regards,

Dan




_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Re: [ceph-users] fsping, why you no work no mo?

Reply via email to