i'm trying to get ht://dig configured and working. but for the life of
me i can't get it to index my pdf and djvu documents.

i'm running debian woody so i thought the default configuration would
work for at least the pdfs. here's the relevent parts of my config file:


max_doc_size:     9999999

external_parsers: application/msword /usr/share/htdig/parse_doc.pl \
                  application/postscript /usr/share/htdig/parse_doc.pl \
                  application/pdf /usr/share/htdig/parse_doc.pl \
                  application/djvu->text/plain /usr/local/bin/djvutxt

debian_pdf_parser: xpdf

WHEN I RUN:

rundig -i -v

I GET THIS:

New server: localhost, 80
0:0:0:http://localhost/files/: ++++-++++ size = 1340
1:1:1:http://localhost/files/?N=D: +***-**** size = 1340
2:2:1:http://localhost/files/?M=A: *+**-**** size = 1340
3:3:1:http://localhost/files/?S=A: **+*-**** size = 1340
4:4:1:http://localhost/files/?D=A: ***+-**** size = 1340
5:5:1:http://localhost/files/test2.djvu:  not HTML
6:6:1:http://localhost/files/text1.djvu:  not HTML
7:7:1:http://localhost/files/tty.pdf:  not found
8:8:1:http://localhost/files/word.rhtml:  size = 796
9:9:2:http://localhost/files/?N=A: ****-**** size = 1340
10:10:2:http://localhost/files/?M=D: ****-**** size = 1340
11:11:2:http://localhost/files/?S=D: ****-**** size = 1340
12:12:2:http://localhost/files/?D=D: ****-**** size = 1340
htmerge: Sorting...
htmerge: Removing doc #5
htmerge: Removing doc #6
htmerge: Removing doc #7
htmerge: Merging...

Deleted, no excerpt: 5/http://localhost/files/test2.djvu
Deleted, no excerpt: 6/http://localhost/files/text1.djvu
Deleted, no excerpt: 7/http://localhost/files/tty.pdf
htmerge: 10

WHAT AM I DOING WRONG? IS THERE SOMETHING I HAVE TO DO TO GET MY CONFIG
FILE TO REGISTER EACH TIME I CHANGE IT? PLEASE HELP. THANKS.

-- 
tom sawyer, aka transami
[EMAIL PROTECTED]



-------------------------------------------------------
This sf.net email is sponsored by: Influence the future 
of Java(TM) technology. Join the Java Community 
Process(SM) (JCP(SM)) program now. 
http://ads.sourceforge.net/cgi-bin/redirect.pl?sunm0002en

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to