Re: osd suicide timeout

2015-07-13 Thread Gregory Farnum
On Fri, Jul 10, 2015 at 10:45 PM, Deneau, Tom tom.den...@amd.com wrote: I have an osd log file from an osd that hit a suicide timeout (with the previous 1 events logged). (On this node I have also seen this suicide timeout happen once before and also a sync_entry timeout. I can see

RE: osd suicide timeout

2015-07-13 Thread Deneau, Tom
To: Deneau, Tom Cc: ceph-devel Subject: Re: osd suicide timeout On Fri, Jul 10, 2015 at 10:45 PM, Deneau, Tom tom.den...@amd.com wrote: I have an osd log file from an osd that hit a suicide timeout (with the previous 1 events logged). (On this node I have also seen this suicide timeout

Re: osd suicide timeout

2015-07-13 Thread Gregory Farnum
[mailto:g...@gregs42.com] Sent: Monday, July 13, 2015 5:07 AM To: Deneau, Tom Cc: ceph-devel Subject: Re: osd suicide timeout On Fri, Jul 10, 2015 at 10:45 PM, Deneau, Tom tom.den...@amd.com wrote: I have an osd log file from an osd that hit a suicide timeout (with the previous 1 events

RE: osd suicide timeout

2015-07-13 Thread Deneau, Tom
- From: Gregory Farnum [mailto:g...@gregs42.com] Sent: Monday, July 13, 2015 11:45 AM To: Deneau, Tom Cc: ceph-devel Subject: Re: osd suicide timeout heartbeat_map reset_timeout 'OSD::osd_op_tp thread 0x3ff6eb0efd0' had suicide timed out after 150 So that's the OSD's op thread, which

Re: osd suicide timeout

2015-07-13 Thread huang jun
Subject: Re: osd suicide timeout heartbeat_map reset_timeout 'OSD::osd_op_tp thread 0x3ff6eb0efd0' had suicide timed out after 150 So that's the OSD's op thread, which is the one that does most of the work. You often see the FileStore::op_tp when it's the disk or filesystem breaking, but I do

osd suicide timeout

2015-07-10 Thread Deneau, Tom
I have an osd log file from an osd that hit a suicide timeout (with the previous 1 events logged). (On this node I have also seen this suicide timeout happen once before and also a sync_entry timeout. I can see that 6 minutes or so before that osd died, other osds on the same node were