[ceph-users] Re: HeartbeatMap FAILED assert(0 == "hit suicide timeout")

2019-10-11 Thread 潘东元
I‘m pretty sure, this issue here is that there is a communication issue between the osds. logged over and over again report initiating reconnect. I looked at my network,and have dropped packets,this is probably the tcp queue full at osd daemon listen port. My cluster had 21 nodes, with 5 osds on

[ceph-users] Re: HeartbeatMap FAILED assert(0 == "hit suicide timeout")

2019-10-10 Thread Janne Johansson
Den tors 10 okt. 2019 kl 15:12 skrev 潘东元 : > hi all, > my osd hit suicide timeout. > > common/HeartbeatMap.cc: 79: FAILED assert(0 == "hit suicide timeout") > > ceph version 0.80.7 (6c0127fcb58008793d3c8b62d925bc91963672a3) > > can you give some advice on troubleshooting? > It is a very