> I appear to have the pdf parser working partly. Some pdf files are > indexed but most are not. I am using xpdf and doc2html programmes. > Is there a reason why this should happen?
Assuming you're talking about ht://Dig 3.1.x: The reason that gets most people is that any document bigger than max_doc_size will not be retrieved completely, and thus not indexed completely. See http://www.htdig.org/attrs.html#max_doc_size cu, Martin -- | Martin Vorlaender VMS & WNT programmer OpenVMS is today | work: [EMAIL PROTECTED] what Microsoft wants | http://www.pdv-systeme.de/users/martinv/ Windows NT 8.0 to be! | home: [EMAIL PROTECTED] ------------------------------------------------------- This SF.NET email is sponsored by: SourceForge Enterprise Edition + IBM + LinuxWorld = Something 2 See! http://www.vasoftware.com _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

