Re: Webboard: Titles incorrect for pdf files

Alexander Barkov Thu, 26 Apr 2001 10:55:39 -0700

Richard Wall wrote:
> 
> ----- Original Message -----
> From: "Richard Wall" <[EMAIL PROTECTED]>
> 
> > Yeah, it works really well. Infact it accepts a third argument, the URL of
> > the page so I've modified your shell script as follows, using the $UDM_URL
> > environment variable set by mnogosearch...
> 
> Actually, I've discovered a problem. When indexing certain pdf documents,
> the doc2html perl script hangs and uses 100% processor resources.
> 
> It always gets stuck at the same place...
> "confident that the automotive sector can"
> 
> But I can't understand why.
> 
> Alexander, could you try indexing this document with doc2html.pl...
> 
> http://elkie.coventry-id.co.uk/~richard/wb58.pdf
> 
> to see if you get the same problem.

pdfinfo called from doc2html does not return anything to
stdout. It warns about bad format to stderr:

/usr/home/bar > pdfinfo wb58.txt 
Error: May not be a PDF file (continuing anyway)
Error (0): PDF file is damaged - attempting to reconstruct xref table...
Error: Couldn't find trailer dictionary
Error: Couldn't read xref table

  So, doc2html seems to wait for pdfinfo output forever.
___________________________________________
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]

Re: Webboard: Titles incorrect for pdf files

Reply via email to