1) Regarding the DB update errors, I think there is a wrong 'size' value somewhere in the DB
that creates troubles to build the accounting table.
This might be a 'NULL' value for the size of an entry.

First, you can check that there is no such records in ACCT_STAT:
select * from ACCT_STAT where size is NULL;

And in ENTRIES too (much more longer request...):
select * from ENTRIES where size is NULL;

2) Regarding the segfault, I need more stack
(for example, this can be obtained by typing 'where' in gdb.)

regards,

On 12/31/13 19:10, Daniel Milroy wrote:

Hello,

After completing an initial scan to populate the robinhood database, I've run into some problems running the application as a daemon. The robinhood host is running robinhood 2.4.3, and Lustre client 2.4.2 on RHEL 6.4. Robinhood will segfault and dump core after running for between 1 and 3 days. Nothing is written to /var/log/messages or dmesg. The core files vary in size from 22MB-17GB and contain the following information:

[New Thread 30924]

[New Thread 30931]

[New Thread 30929]

[New Thread 30932]

[New Thread 30934]

[New Thread 30930]

[New Thread 30926]

[New Thread 30928]

[New Thread 30937]

[New Thread 30935]

[New Thread 30933]

Reading symbols from /usr/lib64/libmysqlclient.so.18...done.

Loaded symbols for /usr/lib64/libmysqlclient.so.18

Reading symbols from /lib64/libpthread.so.0...(no debugging symbols found)...done.

[Thread debugging using libthread_db enabled]

Loaded symbols for /lib64/libpthread.so.0

Reading symbols from /lib64/libm.so.6...(no debugging symbols found)...done.

Loaded symbols for /lib64/libm.so.6

Reading symbols from /lib64/librt.so.1...(no debugging symbols found)...done.

Loaded symbols for /lib64/librt.so.1

Reading symbols from /lib64/libdl.so.2...(no debugging symbols found)...done.

Loaded symbols for /lib64/libdl.so.2

Reading symbols from /usr/lib64/liblustreapi.so...done.

Loaded symbols for /usr/lib64/liblustreapi.so

Reading symbols from /lib64/libc.so.6...(no debugging symbols found)...done.

Loaded symbols for /lib64/libc.so.6

Reading symbols from /usr/lib64/libstdc++.so.6...(no debugging symbols found)...done.

Loaded symbols for /usr/lib64/libstdc++.so.6

Reading symbols from /lib64/libgcc_s.so.1...(no debugging symbols found)...done.

Loaded symbols for /lib64/libgcc_s.so.1

Reading symbols from /lib64/ld-linux-x86-64.so.2...(no debugging symbols found)...done.

Loaded symbols for /lib64/ld-linux-x86-64.so.2

Reading symbols from /lib64/libnss_files.so.2...(no debugging symbols found)...done.

Loaded symbols for /lib64/libnss_files.so.2

Reading symbols from /lib64/libnss_sss.so.2...(no debugging symbols found)...done.

Loaded symbols for /lib64/libnss_sss.so.2

Core was generated by `/usr/sbin/robinhood -d -f /etc/robinhood.d/tmpfs/robinhood_janus.conf -p /var/r'.

Program terminated with signal 6, Aborted.

#0  0x00007f218c4e18e5 in raise () from /lib64/libc.so.6

On a possibly related note, there are thousands of the following entries in robinhood.log:

2013/12/31 10:53:37 robinhood@jana[2716/8]: ListMgr | Error 1 executing query 'UPDATE ENTRIES SET owner='user12',gr_name='user12grp',size=9026,blocks=24,last_access=1388512416,last_mod=1388512416,type='file',md_update=1388512417,whitelisted=0,release_class='@default@',rel_cl_update=1388512417 WHERE id='0x210a18513:0x19e15:0x0'': Out of range value for column 'sz1K' at row 1

2013/12/31 10:53:37 robinhood@jana[2716/8]: ListMgr | Unhandled error 1264: default conversion to DB_REQUEST_FAILED

2013/12/31 10:53:37 robinhood@jana[2716/8]: EntryProc | Error 7 performing database operation.

2013/12/31 10:53:37 robinhood@jana[2716/7]: ListMgr | Unhandled error 1264: default conversion to DB_REQUEST_FAILED

I followed the instructions in "Re: [robinhood-devel] RBH DB Fail - Out of range value for column 'sz1k'" and dropped the ACCT_STAT table. After rebuilding the table the errors continue, and now include several other columns. Would upgrading to robinhood 2.5.0 be advisable?

Please let me know if I can provide additional information that may be helpful to the diagnosis of these problems.

Thank you in advance,

Dan Milroy



------------------------------------------------------------------------------
Rapidly troubleshoot problems before they affect your business. Most IT
organizations don't have a clear picture of how application performance
affects their revenue. With AppDynamics, you get 100% visibility into your
Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro!
http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk


_______________________________________________
robinhood-support mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/robinhood-support

------------------------------------------------------------------------------
Rapidly troubleshoot problems before they affect your business. Most IT 
organizations don't have a clear picture of how application performance 
affects their revenue. With AppDynamics, you get 100% visibility into your 
Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro!
http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk
_______________________________________________
robinhood-support mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/robinhood-support

Reply via email to