All,

while upgrading our Lustre file system to Lustre 2.7, I also upgraded 
robinhood to the newly released 2.5.5. I did download the tar file and 
compiled it locally as the pre-built rpms on sourceforge have a 
dependency on lustre-modules but on our site the rpm provides 
lustre-client-modules.

The RPM installed fine, the server is running Lustre 2.7 but with the 
same configuration that previously was running fine (on 2.5.4) the new 
version now segfaults on startup (called as robinhood --read-log). I'm 
currently not sure how to debug this further. Any pointers welcome, 
strace wasn't helpful in determining where it crashes, the log isn't 
that clear either, with normal options the following are the only lines 
in the logfile:

<snip>
2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/1] CheckFS | 
'/mnt/lustre03' matches mount point '/mnt/lustre03', type=lustre, 
fs=cs04r-sc-mds03-01-10ge@tcp:cs04r-sc-mds03-02-10ge@tcp:/lustre03
2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/2] SigHdlr | 
Signals SIGTERM and SIGINT (daemon shutdown) are ready to be used
2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/2] SigHdlr | 
Signal SIGHUP (config reloading) is ready to be used
2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/2] SigHdlr | 
Signal SIGUSR1 (stats dump) is ready to be used
2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/1] EntryProc | No 
class defined in policies, disabling file class matching.
2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/1] EntryProc | No 
class defined in policies, disabling dir class matching.
2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/1] Main | Daemon 
started (running modules: log_reader)
2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/3] ChangeLog | 
LU-1331 is fixed in this version of Lustre.
</snip>

With --log-level=DEBUG there are quite a few lines like this following 
before it just stops:

<snip>
2015/06/15 18:46:19 robinhood@cs04r-sc-serv-92[110019/11] ChangeLog | 
MDT0000: 3435143616 14SATTR 1434376871.895757017 0x14 
t=[0x20001026e:0x86f9:0x0]
2015/06/15 18:46:19 robinhood@cs04r-sc-serv-92[110019/11] ChangeLog | 
MDT0000: 3435143617 08RENME 1434376871.897757063 0x1 
t=[0x20000fa28:0xb3ef:0x0] p=[0xecedd2c:0x52385992:0x0] 
LineScan$py.class s=[0x20001026e:0x86f9:0x0] 
sp=[0xecedd2c:0x52385992:0x0] .LineScan$py.class.3FMhca
</snip>


Cheers,
Frederik
-- 
Frederik Ferner
Senior Computer Systems Administrator (storage) phone: +44 1235 77 8624
Diamond Light Source Ltd.                       mob:   +44 7917 08 5110

Duty Sys Admin can be reached at x8596


(Apologies in advance for the lines below. Some bits are a legal
requirement and I have no control over them.)

-- 
This e-mail and any attachments may contain confidential, copyright and or 
privileged material, and are for the use of the intended addressee only. If you 
are not the intended addressee or an authorised recipient of the addressee 
please notify us of receipt by returning the e-mail and do not use, copy, 
retain, distribute or disclose the information in or attached to the e-mail.
Any opinions expressed within this e-mail are those of the individual and not 
necessarily of Diamond Light Source Ltd. 
Diamond Light Source Ltd. cannot guarantee that this e-mail or any attachments 
are free from viruses and we cannot accept liability for any damage which you 
may sustain as a result of software viruses which may be transmitted in or with 
the message.
Diamond Light Source Limited (company no. 4375679). Registered in England and 
Wales with its registered office at Diamond House, Harwell Science and 
Innovation Campus, Didcot, Oxfordshire, OX11 0DE, United Kingdom

------------------------------------------------------------------------------
_______________________________________________
robinhood-support mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/robinhood-support

Reply via email to