Hi,
i�m using htdig 3.1.5 with Suse Linux 7.1.
1)I downloaded "conv_doc.pl" and modified the pathes inside.
2)I downloaded the sources of xpdf, compiled and installed them.
3)I put the lines
external_parsers: application/msword->text/html
/usr/local/bin/conv_doc.pl \
application/postscript->text/html /usr/local/bin/conv_doc.pl \
application/pdf->text/html /usr/local/bin/conv_doc.pl
to htdig.conf.
After starting rundig with -vvv, i�ve got the following messages:
.
.
.
Header line: Accept-Ranges: bytes
Header line: Content-Length: 56171
Header line: Connection: close
Header line: Content-Type: application/pdf
Header line:
returnStatus = 0
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 7019 from document
Read a total of 56171 bytes
sh: /usr/local/bin/conv_doc.pl: No such file or directory
size = 56171
pick: 127.0.0.1, # servers = 1
htmerge: Sorting...
htmerge: Merging...
0/http://127.0.0.1/
Deleted, no excerpt: 1/http://127.0.0.1/isdn_faq.pdf
but the file is there:
predator:/usr/local/bin # ll
total 7709
drwxr-xr-x 2 root root 202 Sep 18 14:58 .
drwxr-xr-x 18 root root 362 Aug 9 11:49 ..
-rwxr-xr-x 1 root root 7826 Sep 18 14:35 conv_doc.pl
-rwxr-xr-x 1 root root 1044958 Sep 18 13:04 pdfimages
-rwxr-xr-x 1 root root 1041560 Sep 18 13:04 pdfinfo
-rwxr-xr-x 1 root root 1423345 Sep 18 13:04 pdftopbm
-rwxr-xr-x 1 root root 1141186 Sep 18 13:04 pdftops
-rwxr-xr-x 1 root root 1080095 Sep 18 13:04 pdftotext
-rwxr-xr-x 1 root root 2140598 Sep 18 13:04 xpdf
predator:/usr/local/bin #
Any idea?
Mike
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html