Richard Wall wrote:
>
> ----- Original Message -----
> From: "Richard Wall" <[EMAIL PROTECTED]>
>
> > Yeah, it works really well. Infact it accepts a third argument, the URL of
> > the page so I've modified your shell script as follows, using the $UDM_URL
> > environment variable set by mnogosearch...
>
> Actually, I've discovered a problem. When indexing certain pdf documents,
> the doc2html perl script hangs and uses 100% processor resources.
>
> It always gets stuck at the same place...
> "confident that the automotive sector can"
>
> But I can't understand why.
>
> Alexander, could you try indexing this document with doc2html.pl...
>
> http://elkie.coventry-id.co.uk/~richard/wb58.pdf
>
> to see if you get the same problem.
pdfinfo called from doc2html does not return anything to
stdout. It warns about bad format to stderr:
/usr/home/bar > pdfinfo wb58.txt
Error: May not be a PDF file (continuing anyway)
Error (0): PDF file is damaged - attempting to reconstruct xref table...
Error: Couldn't find trailer dictionary
Error: Couldn't read xref table
So, doc2html seems to wait for pdfinfo output forever.
___________________________________________
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]