Re: [ceph-users] RBD: How many snapshots is too many?

Piotr Dałek Mon, 18 Sep 2017 06:17:12 -0700

On 17-09-16 01:36 AM, Gregory Farnum wrote:

I got the chance to discuss this a bit with Patrick at the Open SourceSummit Wednesday (good to see you!).
So the idea in the previously-referenced CDM talk essentially involveschanging the way we distribute snap deletion instructions from a"deleted_snaps" member in the OSDMap to a "deleting_snaps" member that getstrimmed once the OSDs report to the manager that they've finished removingthat snapid. This should entirely resolve the CPU burn they're seeing duringOSDMap processing on the nodes, as it shrinks the intersection operationdown from "all the snaps" to merely "the snaps not-done-deleting".
The other reason we maintain the full set of deleted snaps is to preventclient operations from re-creating deleted snapshots — we filter all clientIO which includes snaps against the deleted_snaps set in the PG. Apparentlythis is also big enough in RAM to be a real (but much smaller) problem.
Unfortunately eliminating that is a lot harder and a permanent fix willinvolve changing the client protocol in ways nobody has quite figured outhow to do. But Patrick did suggest storing the full set of deleted snapson-disk and only keeping in-memory the set which covers snapids in the rangewe've actually *seen* from clients. I haven't gone through the code but thatseems broadly feasible — the hard part will be working out the rules whenyou have to go to disk to read a larger part of the deleted_snaps set.(Perfectly feasible.)
PRs are of course welcome! ;)


There you go: https://github.com/ceph/ceph/pull/17493

We are hitting limitations of current implementation - we have over 9thousands of removed snap intervals, with snap counter over 650000. In ourparticular case, this shows up as a bad CPU usage spike every few minutes,and it's going to be only worse, as we're going to have more snapshots overtime. My PR halves that spike, and is a change small enough to be backportedto both Jewel and Luminous without breaking too much at once - not a finalsolution, but should make life a bit more tolerable until actual, workingsolution is in place.


--
Piotr Dałek
piotr.da...@corp.ovh.com
https://www.ovh.com/us/
_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Re: [ceph-users] RBD: How many snapshots is too many?

Reply via email to