All, while upgrading our Lustre file system to Lustre 2.7, I also upgraded robinhood to the newly released 2.5.5. I did download the tar file and compiled it locally as the pre-built rpms on sourceforge have a dependency on lustre-modules but on our site the rpm provides lustre-client-modules.
The RPM installed fine, the server is running Lustre 2.7 but with the same configuration that previously was running fine (on 2.5.4) the new version now segfaults on startup (called as robinhood --read-log). I'm currently not sure how to debug this further. Any pointers welcome, strace wasn't helpful in determining where it crashes, the log isn't that clear either, with normal options the following are the only lines in the logfile: <snip> 2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/1] CheckFS | '/mnt/lustre03' matches mount point '/mnt/lustre03', type=lustre, fs=cs04r-sc-mds03-01-10ge@tcp:cs04r-sc-mds03-02-10ge@tcp:/lustre03 2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/2] SigHdlr | Signals SIGTERM and SIGINT (daemon shutdown) are ready to be used 2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/2] SigHdlr | Signal SIGHUP (config reloading) is ready to be used 2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/2] SigHdlr | Signal SIGUSR1 (stats dump) is ready to be used 2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/1] EntryProc | No class defined in policies, disabling file class matching. 2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/1] EntryProc | No class defined in policies, disabling dir class matching. 2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/1] Main | Daemon started (running modules: log_reader) 2015/06/15 18:27:36 robinhood@cs04r-sc-serv-92[103117/3] ChangeLog | LU-1331 is fixed in this version of Lustre. </snip> With --log-level=DEBUG there are quite a few lines like this following before it just stops: <snip> 2015/06/15 18:46:19 robinhood@cs04r-sc-serv-92[110019/11] ChangeLog | MDT0000: 3435143616 14SATTR 1434376871.895757017 0x14 t=[0x20001026e:0x86f9:0x0] 2015/06/15 18:46:19 robinhood@cs04r-sc-serv-92[110019/11] ChangeLog | MDT0000: 3435143617 08RENME 1434376871.897757063 0x1 t=[0x20000fa28:0xb3ef:0x0] p=[0xecedd2c:0x52385992:0x0] LineScan$py.class s=[0x20001026e:0x86f9:0x0] sp=[0xecedd2c:0x52385992:0x0] .LineScan$py.class.3FMhca </snip> Cheers, Frederik -- Frederik Ferner Senior Computer Systems Administrator (storage) phone: +44 1235 77 8624 Diamond Light Source Ltd. mob: +44 7917 08 5110 Duty Sys Admin can be reached at x8596 (Apologies in advance for the lines below. Some bits are a legal requirement and I have no control over them.) -- This e-mail and any attachments may contain confidential, copyright and or privileged material, and are for the use of the intended addressee only. If you are not the intended addressee or an authorised recipient of the addressee please notify us of receipt by returning the e-mail and do not use, copy, retain, distribute or disclose the information in or attached to the e-mail. Any opinions expressed within this e-mail are those of the individual and not necessarily of Diamond Light Source Ltd. Diamond Light Source Ltd. cannot guarantee that this e-mail or any attachments are free from viruses and we cannot accept liability for any damage which you may sustain as a result of software viruses which may be transmitted in or with the message. Diamond Light Source Limited (company no. 4375679). Registered in England and Wales with its registered office at Diamond House, Harwell Science and Innovation Campus, Didcot, Oxfordshire, OX11 0DE, United Kingdom ------------------------------------------------------------------------------ _______________________________________________ robinhood-support mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/robinhood-support
