When I run "htdig -i" I get this error for every single .pdf file:
Error: Page count in top-level pages object is incorrect
I'm using external parsers to search trough pdf, word and excel files.
--- in htdig.conf ---
external_parsers: application/pdf->text/html /opt/www/bin/doc2html.pl \
application/msword->text/html /opt/www/bin/doc2html.pl \
application/msexcel->text/html /opt/www/bin/doc2html.pl
---------------------
----- in doc2html.pl ------
my $PDF2HTML = '/opt/www/bin/pdf2html.pl';
...
# Adobe PDF file using Perl script
if ($PDF2HTML) {
$mime_type = "application/pdf";
$cmd = $PDF2HTML;
# Replace default title (if used) with filename:
$cmdl = "$cmd $Input $mime_type $name";
$magic = '%PDF-|\0PDF CARO\001\000\377';
&store_html_method('PDF (pdf2html)',$cmd,$cmdl,$mime_type,$magic);
}
-----------------------------------------------------------------------------
I've upgraded to the latest version of xpdf (xpdf-1.01) and still get the
error.
Any idea where to look to fix the problem?
Thanks
Peter
-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html