Hi, Anne Durand said : > When I run on command line > doc2html.pl /full/path/to/sample/Maison_Guiette.pdf "application/pdf" > url I don't get any error and the parsing looks ok. > > The htdig.conf file contains > external_parsers: application/pdf /usr/local/bin/doc2html.pl > > When I run htdig, I get the following errors : > Error (0): PDF file is damaged - attempting to reconstruct xref table... > Error: Top-level pages object is wrong type (null) > Error: Couldn't read page catalog > External parser error in line:<HTML> > URL: > http://www.archi.fr/UIA/htmEdifices/DOCOMOMO/Belgium/Maison_Guiette.pdf > External parser error in line:<HEAD> > ....
I think this is a FAQ. max_doc_size defaults to 100000. Your PDF file is over 300000 bytes ! OK, it's FAQ #5.2 : <http://www.htdig.org/FAQ.html#q5.2> Regards, Olivier -- Olivier Korn. Strasbourg, France. ------------------------------------------------------- This SF.net email is sponsored by: The Definitive IT and Networking Event. Be There! NetWorld+Interop Las Vegas 2003 -- Register today! http://ads.sourceforge.net/cgi-bin/redirect.pl?keyn0001en _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

