Re: [ceph-users] KPIs for Ceph/OSD client latency / deepscrub latency overhead

2018-07-12 Thread Paul Emmerich
2018-07-12 8:37 GMT+02:00 Marc Schöchlin : > > In a first step i just would like to have two simple KPIs which describe > a average/aggregated write/read latency of these statistics. > > Are there tools/other functionalities which provide this in a simple way? > It's one of the main KPI our manag

Re: [ceph-users] KPIs for Ceph/OSD client latency / deepscrub latency overhead

2018-07-11 Thread Marc Schöchlin
Hello Paul, thanks for your response/hints. I discovered the following tool in the ceph source repository: https://github.com/ceph/ceph/blob/master/src/tools/histogram_dump.py The tool provides output based on the statistics mention by you: # ceph daemon osd.24 perf histogram dump|grep -P "op_.

Re: [ceph-users] KPIs for Ceph/OSD client latency / deepscrub latency overhead

2018-07-11 Thread Paul Emmerich
Hi, from experience: commit/apply_latency are not good metrics, the only good thing about them is that they are really easy to track. But we have found them to be almost completely useless in the real world. We track the op_*_latency metrics from perf dump and found them to be very helpful, they

[ceph-users] KPIs for Ceph/OSD client latency / deepscrub latency overhead

2018-07-11 Thread Marc Schöchlin
Hello ceph-users and ceph-devel list, we got in production with our new shiny luminous (12.2.5) cluster. This cluster runs SSD and HDD based OSD pools. To ensure the service quality of the cluster and to have a baseline for client latency optimization (i.e. in the area of deepscrub optimization)