RE: [htdig] pdf parser

Martin Vorlaender Mon, 13 Jan 2003 00:42:53 -0800

> I appear to have the pdf parser working partly. Some pdf files are
> indexed but most are not. I am using xpdf and doc2html programmes.
> Is there a reason why this should happen?


Assuming you're talking about ht://Dig 3.1.x:

The reason that gets most people is that any document bigger than
max_doc_size will not be retrieved completely, and thus not indexed
completely.

See http://www.htdig.org/attrs.html#max_doc_size

cu,
  Martin
-- 
                       | Martin Vorlaender         VMS & WNT programmer
 OpenVMS is today      | work: [EMAIL PROTECTED]
 what Microsoft wants  |       http://www.pdv-systeme.de/users/martinv/
 Windows NT 8.0 to be! | home: [EMAIL PROTECTED]



-------------------------------------------------------
This SF.NET email is sponsored by:
SourceForge Enterprise Edition + IBM + LinuxWorld = Something 2 See!
http://www.vasoftware.com
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

RE: [htdig] pdf parser

Reply via email to