On Tue, May 10, 2011 at 09:25:45PM -0400, Bruce Furber wrote:

|    A file will be skipped if it has unprintable chars near the beginning.
|    Does the user running the Indexer have read access to the files?

Will the file be silently skipped, or will the indexer simply switch
to the FileAnalyzer?

In any event, this doesn't seem to be it, as the files in question do
not generally appear to have any special characters or weird encodings.

The permissions are also globally readable.

Looking at the logs, this one project had 26k files skipped out of 28k
-- almost all of them.  So I moved all projects but this one out of
the source directory, erased the indexes and re-indexed.

This time, only four files were skipped -- and they were all dot
files, which I imagine are supposed to be skipped.

I wonder if it's some sort of de-duplication?  Does opengrok try not
to index identical files?  I don't think that would explain
everything, but it might explain some of it.

-- 
Doug McLaren, dou...@frenzied.us
_______________________________________________
opengrok-discuss mailing list
opengrok-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/opengrok-discuss

Reply via email to