[ceph-users] Low number of inflight compare to op_wip

George Shuklin via ceph-users Tue, 16 Dec 2025 03:54:31 -0800

I'm tryting to debug low performance on nvme-based cluster.

I have 24 NVME in 4 servers, plenty of CPU, cluster perfectly balanced,no scrubbing or replication atm, 1024 pg for 24 OSD/17TB.

I expect to see some performance (~250k IOPS total, 50% r/w, few hundredvolumes with capped by IOPS, pre-warmed). I see about half of it.

I looked at drive utilization, it's about 70% (per atop), but I'venoticed, that in-flight for drives is, basically, around 1. That means,that at a given time only one request is processed. This is match matchfor OSD count/3 /latency formula, and with one in-flight nvme is showingabout 10% of it's specs (Intel, DC grade).

I looked at osd (they loaded uniformely, so any of them shows the sameresults).


I see   "op_wip": 10-27, but in-flight value is about 0-2, mostly around 1.

I can't get away from the feeling that somehow osd is doing operations(almost) sequential. I already played with osd_op_num_threads_per_shard(4), osd_op_num_shards (8), set mclock profile to high_client_ops,ms_async_op_threads 24, and it can't get more inflight ios.

I feel I miss something. How to make Ceph to send more requests tounderlaying NVME?


_______________________________________________
ceph-users mailing list -- [email protected]
To unsubscribe send an email to [email protected]

[ceph-users] Low number of inflight compare to op_wip

Reply via email to