Try running it with rundig -i -vvv
as we need to see what MIME-type your server gives for file of type .djvu (By the way what is a .djvu file?) You don't seem to have any .pdf files to be indexed. I second Adrian Bolzan's recommendation thet you move from parse_doc.pl to doc2html.pl -- David Adams Computing Services Southampton University ----- Original Message ----- From: "Tom Sawyer" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Thursday, October 24, 2002 3:53 AM Subject: [htdig] pdf and djvu indexing problems > i'm trying to get ht://dig configured and working. but for the life of > me i can't get it to index my pdf and djvu documents. > > i'm running debian woody so i thought the default configuration would > work for at least the pdfs. here's the relevent parts of my config file: > > > max_doc_size: 9999999 > > external_parsers: application/msword /usr/share/htdig/parse_doc.pl \ > application/postscript /usr/share/htdig/parse_doc.pl \ > application/pdf /usr/share/htdig/parse_doc.pl \ > application/djvu->text/plain /usr/local/bin/djvutxt > > debian_pdf_parser: xpdf > > WHEN I RUN: > > rundig -i -v > > I GET THIS: > > New server: localhost, 80 > 0:0:0:http://localhost/files/: ++++-++++ size = 1340 > 1:1:1:http://localhost/files/?N=D: +***-**** size = 1340 > 2:2:1:http://localhost/files/?M=A: *+**-**** size = 1340 > 3:3:1:http://localhost/files/?S=A: **+*-**** size = 1340 > 4:4:1:http://localhost/files/?D=A: ***+-**** size = 1340 > 5:5:1:http://localhost/files/test2.djvu: not HTML > 6:6:1:http://localhost/files/text1.djvu: not HTML > 7:7:1:http://localhost/files/tty.pdf: not found > 8:8:1:http://localhost/files/word.rhtml: size = 796 > 9:9:2:http://localhost/files/?N=A: ****-**** size = 1340 > 10:10:2:http://localhost/files/?M=D: ****-**** size = 1340 > 11:11:2:http://localhost/files/?S=D: ****-**** size = 1340 > 12:12:2:http://localhost/files/?D=D: ****-**** size = 1340 > htmerge: Sorting... > htmerge: Removing doc #5 > htmerge: Removing doc #6 > htmerge: Removing doc #7 > htmerge: Merging... > > Deleted, no excerpt: 5/http://localhost/files/test2.djvu > Deleted, no excerpt: 6/http://localhost/files/text1.djvu > Deleted, no excerpt: 7/http://localhost/files/tty.pdf > htmerge: 10 > > WHAT AM I DOING WRONG? IS THERE SOMETHING I HAVE TO DO TO GET MY CONFIG > FILE TO REGISTER EACH TIME I CHANGE IT? PLEASE HELP. THANKS. > > -- > tom sawyer, aka transami > [EMAIL PROTECTED] > > > > ------------------------------------------------------- > This sf.net email is sponsored by: Influence the future > of Java(TM) technology. Join the Java Community > Process(SM) (JCP(SM)) program now. > http://ads.sourceforge.net/cgi-bin/redirect.pl?sunm0002en > > _______________________________________________ > htdig-general mailing list <[EMAIL PROTECTED]> > To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe > FAQ: http://htdig.sourceforge.net/FAQ.html > ------------------------------------------------------- This sf.net email is sponsored by: Influence the future of Java(TM) technology. Join the Java Community Process(SM) (JCP(SM)) program now. http://ad.doubleclick.net/clk;4729346;7592162;s?http://www.sun.com/javavote _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

