On 08/23/16 19:25, Jessica Otey wrote:

Our file system is mounted with the "user_xattr" option. Although it took FOREVER to do this readlog (a few times as long as the initial scan did), it did complete successfully. Can I assume, then, that these errors are not some sort of misconfiguration but rather they reflect changes in the filesystem?
This error suggest there are existing entries without "trusted.link" attribute. I think this may occur for still existing entries that have no longer a path in Lustre, in particular if the file has been opened and deleted - but is still opened. An advice from Lustre expert could help to confirm that.

Perhaps it took so long because it was run so long after a file system scan?
Yes of course. The longer you wait to dequeue changelogs, the more history of changes robinhood has to process to resync its DB.


What I'm trying to understand better is the relationship between scanning a file system and using the changelogs feature.

1) If changelogs is in fact working, does that mean that I *never* need to do a file system scan again? Or is it still a good idea to do occasional, full-system scans? If so, at what interval?
There may be some changes that are not reported by changelogs, in particular in case of client or server crash or brutal shutdown. So it is a good idea to rescan occasionally (with a interval of several months), depending if you had many of client or servers crashes.

2) With changelogs in the picture, is there a way to indicate how up-to-date the information issued by robinhood reports is? Does one just assume it is real-time at that point, and thus the date of last scan (as reported by rbh-report -a) becomes irrelevant as an indicator of data freshness?
Robinhood regularly dump changelogs stats in its log.
The "last read record time" indicates the timestamp of the last changelog robinhood processed.
If it is too far from the log line time, robinhood is late.

2016/08/25 *11:29:04* [945/2] STATS | last read record time = 2016/08/25 *11:28:31*.600763

Also, you can see the number of queued changelog records that robinhood has to process on the MDS
(difference between the current index and the cl1 index).

# cat /proc/fs/lustre/mdd/fsname-MDT0000/changelog_users
current index: *1425**7232*
ID    index
cl1 **1425*7022*

HTH
Thomas


Thanks,
Jessica

To troubleshoot this you can:
- check this entry path:
    lfs fid2path fsname 0x93ca92:0x709d8371:0x0
- try to get this attribute from command line:
    getfattr -n trusted.link <path_to_entry>
- verify that the filesystem is mounted with the "user_xattr" option.

Regards,
Thomas


------------------------------------------------------------------------------
_______________________________________________
robinhood-support mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/robinhood-support

Reply via email to