Your external_parsers statement is in error, and you may also be truncating the PDF file. See FAQ 4.9: http://www.search.soton.ac.uk/htdig/FAQ.html#q4.9 and FAQ 5.2 http://www.search.soton.ac.uk/htdig/FAQ.html#q5.2
David Adams Corporate Information Services Information Systems Services University of Southampton ----- Original Message ----- From: "Anne Durand" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Thursday, March 27, 2003 3:01 PM Subject: [htdig] indexing pdf files > Hello > When I run on command line > doc2html.pl /full/path/to/sample/Maison_Guiette.pdf "application/pdf" url > I don't get any error and the parsing looks ok. > > The htdig.conf file contains > external_parsers: application/pdf /usr/local/bin/doc2html.pl > > When I run htdig, I get the following errors : > Error (0): PDF file is damaged - attempting to reconstruct xref table... > Error: Top-level pages object is wrong type (null) > Error: Couldn't read page catalog > External parser error in line:<HTML> > URL: http://www.archi.fr/UIA/htmEdifices/DOCOMOMO/Belgium/Maison_Guiette.pdf > External parser error in line:<HEAD> > .... > > Thanks for any suggestion > @nne > > > > > > ------------------------------------------------------- > This SF.net email is sponsored by: > The Definitive IT and Networking Event. Be There! > NetWorld+Interop Las Vegas 2003 -- Register today! > http://ads.sourceforge.net/cgi-bin/redirect.pl?keyn0001en > _______________________________________________ > htdig-general mailing list <[EMAIL PROTECTED]> > To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe > FAQ: http://htdig.sourceforge.net/FAQ.html > ------------------------------------------------------- This SF.net email is sponsored by: The Definitive IT and Networking Event. Be There! NetWorld+Interop Las Vegas 2003 -- Register today! http://ads.sourceforge.net/cgi-bin/redirect.pl?keyn0001en _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

