It sounds as though no text is being extracted, do you get any 
warning/error messages?  The documents may be encrypted, or may have 
the extraction of text forbidden.

What method of converting the PDF documents are you using? The 
doc2html.pl converter script can log each file it converts and tell 
you how many bytes it is sending back to htdig.  


On Fri, 4 May 2001 12:31:51 +0200 
[EMAIL PROTECTED] wrote:

> Hi,
> I'm indexing some PDF-documents without some probs. (the maxdocsize is 5MB!)
> But when I run htmerge there is the following error:
> 'deleted no excerpt'.
> I think that the PDF-files were indexed first, but the update in the db does not
> work.
> What ist to do?
> 
> 
> Uli Rebmann
> [EMAIL PROTECTED]
> 
> 
> 

----------------------
David Adams
[EMAIL PROTECTED]


_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to