PG distribution scattered

Niklas Goerke Thu, 10 Oct 2013 02:31:15 -0700

Hi there

I'm currently evaluating ceph and started filling my cluster for thefirst time. After filling it up to about 75%, it reported some OSDsbeing "near-full".After some evaluation I found that the PGs are not distributed evenlyover all the osds.


My Setup:
* Two Hosts with 45 Disks each --> 90 OSDs

* Only one newly created pool with 4500 PGs and a Replica Size of 2 -->should be about 100 PGs per OSD

What I found was that one OSD only had 72 PGs, while another had 123PGs [1]. That means that - if I did the math correctly - I can only fillthe cluster to about 81%, because thats when the first OSD is completelyfull[2].

I did some experimenting and found, that if I add another pool with4500 PGs, each OSD will have exacly doubled the amount of PGs as withone pool. So this is not an accident (tried it multiple times). Onanother test-cluster with 4 Hosts and 15 Disks each, the Distributionwas similarly worse. I also tried that on a different cluster and gotvery similar results.

To me it looks like the rjenkins algorithm is not working as it - in myopinion - should be.


Am I doing anything wrong?
Is this behaviour to be expected?
Can I do something about it?

Thank you very much in advance
Niklas

P.S.: I did ask on ceph-users before:
http://comments.gmane.org/gmane.comp.file-systems.ceph.user/4317
http://comments.gmane.org/gmane.comp.file-systems.ceph.user/4496

[1] I built a small script that will parse pgdump and output the amountof pgs on each osd: http://pastebin.com/5ZVqhy5M[2] I know I should not fill my cluster completely but I'm talkingabout theory and adding a margin only makes it worse.

--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

PG distribution scattered

Reply via email to