Hi Sage,

here is the requested dump file.

http://85.214.49.87/ceph/foo.txt.bz2

-martin


Am 15.01.2012 06:52, schrieb Sage Weil:
Hi Martin-

On Sat, 14 Jan 2012, Sage Weil wrote:
Hi Martin-

On Sat, 14 Jan 2012, Martin Mailand wrote:

Hi
one of four OSD died during the update to v0.40 with an Assertion
os/FileStore.cc: 2438: FAILED assert(0 == "unexpected error")
Even after a complete shutdown of the cluster an a new start with all OSD at
the same version, this osd did not start.

The OSD Log it attached.

It's trying to replay a transaction that appears to be invalid because the
.2 clone is smaller than it thinks.  Is this the first time the OSD
crashed, or did it crash once, and you cranked up logs and generated
this one?  If you have the previous log, that would be helpful... it
should have a similar tranasction dump but a different stack trace.

I pushed a wip-osd-dump-journal branch to git that will make

        ceph-osd -i<whatever>  --dump-journal>  /tmp/foo.txt

dump the contents of your entire osd journal (sans data) to a text file.
Do you mind sending that along as well?  I'd like to see what is in the
journal _after_ the event that is failing (if anything).

Thanks!
sage



Also, are any of the 6 patches on top of 0.40 related to the filestore or
osd?

Thanks!
sage

--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to