On Tue, May 10, 2011 at 09:25:45PM -0400, Bruce Furber wrote: | A file will be skipped if it has unprintable chars near the beginning. | Does the user running the Indexer have read access to the files?
Will the file be silently skipped, or will the indexer simply switch to the FileAnalyzer? In any event, this doesn't seem to be it, as the files in question do not generally appear to have any special characters or weird encodings. The permissions are also globally readable. Looking at the logs, this one project had 26k files skipped out of 28k -- almost all of them. So I moved all projects but this one out of the source directory, erased the indexes and re-indexed. This time, only four files were skipped -- and they were all dot files, which I imagine are supposed to be skipped. I wonder if it's some sort of de-duplication? Does opengrok try not to index identical files? I don't think that would explain everything, but it might explain some of it. -- Doug McLaren, dou...@frenzied.us _______________________________________________ opengrok-discuss mailing list opengrok-discuss@opensolaris.org http://mail.opensolaris.org/mailman/listinfo/opengrok-discuss