> I run XPDF version .80 on a BSDI 4.0 box. I use htDig 3.1.1 in
> conjunction with XPDF to index the pdf documents. This morning I got
> several of the folowing error messages from my cron:
> ______________________________________________________________________________
> Error (1024): PDF version 1.3 -- xpdf supports version 1.2 (continuing anyway)
> Error (0): PDF file is damaged - attempting to reconstruct xref table...
> Error: Top-level pages object is wrong type (null)
> Error: Couldn't read page catalog
> ______________________________________________________________________________
>
>
> They result from the following documents that someone unleashed yesterday;)
> ______________________________________________________________________________
> http://www.ccsf.cc.ca.us/Services/Human_Resources/jobs/pdf/A99034.pdf
I checked this first one. Xpdf 0.80 doesn't have any trouble
displaying it (on my Linux box). It's just scanned images, one per
page, so pdftotext isn't going to get anything.
I'm planning to add PDF 1.3 support, but it doesn't look like there are
too many major differences, so xpdf 0.80 should do ok for now.
As for the 'file is damaged' error, maybe you got a bad file download?
For example, I've seen cases where (flaky) web servers die in the middle
of a transfer, with no visible error. Your error message is consistent
with what I'd expect for a truncated PDF file.
- Derek
------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.