Vladimir,

Also, my colleague sent this to me after my first email.  This is roughly the 
plan I intend to follow … if the LOG file gives us a direct pointer to a file 
corruption:

https://gist.github.com/gburd/b88aee6da7fee81dc036

However, your crash point suggests we might have to do a bit more work to 
isolate the bad input file.  But I would be happy to be wrong and the above 
work as is.

Matthew


On Jun 12, 2013, at 5:13 PM, Matthew Von-Maszewski <matth...@basho.com> wrote:

> Vladimir,
> 
> I asked around the Basho chat room and you have a crash that has never been 
> seen.  This should be interesting.
> 
> The crash is happening during a compaction, specifically during the creation 
> of the bloom filter for a new .sst file.  Maybe we can isolate the old file 
> that feeding this compaction and move it out of the way for further debugging 
> … and get you running while the debugging happens off-line.
> 
> Would you tar/zip the following files (changing the paths as appropriate for 
> your system):
> 
> tar -czf vladimir_LOGs.tgz /var/lib/riak/leveldb/*/LOG*
> and your app.config file.
> 
> I will see if I can determine where the bad input file resides and help you 
> get back running.  Then we can decide how to look deeper for root cause.
> 
> Matthew
> 
> 
> On Jun 12, 2013, at 4:02 PM, Vladimir Shabanov <vshaban...@gmail.com> wrote:
> 
>> Hello,
>> 
>> I have a cluster of 8 Riak-1.3.1 nodes. Recently one of my nodes silently 
>> crashed. Nothing unusual was reported in logs.
>> 
>> When I've tried to start my node again it worked for few seconds and 
>> silently crashed again. I've run 'riak console' and seen "Segmentation 
>> fault".
>> 
>> gdb with dumped core shows:
>> 
>> Program terminated with signal 11, Segmentation fault.
>> #0  0x00007f162547fa30 in MurmurHash64A(void const*, int, unsigned int) ()
>>    from /tank/riak-1.3.1/lib/eleveldb-1.3.0/priv/eleveldb.so
>> 
>> Backtrace shows that it happens somewhere in LevelDB compaction.
>> 
>> (gdb) bt
>> #0  0x00007f162547fa30 in MurmurHash64A(void const*, int, unsigned int) ()
>>    from /tank/riak-1.3.1/lib/eleveldb-1.3.0/priv/eleveldb.so
>> #1  0x00007f162547833c in leveldb::(anonymous 
>> namespace)::BloomFilterPolicy2::CreateFilter(leveldb::Slice const*, int, 
>> std::string*) const ()
>>    from /tank/riak-1.3.1/lib/eleveldb-1.3.0/priv/eleveldb.so
>> #2  0x00007f162548382d in leveldb::FilterBlockBuilder::GenerateFilter() ()
>>    from /tank/riak-1.3.1/lib/eleveldb-1.3.0/priv/eleveldb.so
>> #3  0x00007f1625483a58 in leveldb::FilterBlockBuilder::StartBlock(unsigned 
>> long) ()
>>    from /tank/riak-1.3.1/lib/eleveldb-1.3.0/priv/eleveldb.so
>> #4  0x00007f1625475175 in leveldb::TableBuilder::Flush() ()
>>    from /tank/riak-1.3.1/lib/eleveldb-1.3.0/priv/eleveldb.so
>> #5  0x00007f1625475395 in leveldb::TableBuilder::Add(leveldb::Slice const&, 
>> leveldb::Slice const&) () from 
>> /tank/riak-1.3.1/lib/eleveldb-1.3.0/priv/eleveldb.so
>> #6  0x00007f162545b561 in 
>> leveldb::DBImpl::DoCompactionWork(leveldb::DBImpl::CompactionState*) () from 
>> /tank/riak-1.3.1/lib/eleveldb-1.3.0/priv/eleveldb.so
>> #7  0x00007f162545bd3b in leveldb::DBImpl::BackgroundCompaction() ()
>>    from /tank/riak-1.3.1/lib/eleveldb-1.3.0/priv/eleveldb.so
>> #8  0x00007f162545ca5d in leveldb::DBImpl::BackgroundCall() ()
>>    from /tank/riak-1.3.1/lib/eleveldb-1.3.0/priv/eleveldb.so
>> #9  0x00007f162547bb38 in leveldb::(anonymous 
>> namespace)::PosixEnv::BGThreadWrapper(void*) () from 
>> /tank/riak-1.3.1/lib/eleveldb-1.3.0/priv/eleveldb.so
>> #10 0x00007f163366ab50 in start_thread () from 
>> /lib/x86_64-linux-gnu/libpthread.so.0
>> #11 0x00007f16331aca7d in clone () from /lib/x86_64-linux-gnu/libc.so.6
>> #12 0x0000000000000000 in ?? ()
>> 
>> gdb output in gist
>> https://gist.github.com/vshabanov/5768546
>> 
>> Why it's happening and how to bring the node back to life?
>> _______________________________________________
>> riak-users mailing list
>> riak-users@lists.basho.com
>> http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
> 

_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to