On Monday, January 20, 2003, at 10:11 AM, Jason Morse wrote:
However, the 'rundig -vvv' log is still mystifying. I get the exact samehtdig does not search a directory for the files it contains. In a normal configuration, the files that are indexed are those discovered by extracting links from other pages. Take a look at http://www.htdig.org/FAQ.html#q5.25
rundig results whether the target directory is empty or contains several
PDF files. If I add an HTML file to the directory along with the PDF
files, rundig indicates that the file is indexed, but does not mention the
PDFs. It seems that the PDFs are being completely ignored. Here is the
log after running on the directory with 1 HTML and 10 PDFs:
My guess is that htdig is finding the web page you added because you named it in a manner that your web server recognizes as an index file (e.g. index.html). In this case the '/' at the end of the URL is implicitly mapped to a particular file name, and you are not really giving htdig a directory, but instead a specific file from which to start. Try adding a link to each of your PDF's in the HTML file you added. That should result in htdig seeing the PDF files. If it more appropriate to your needs, there are also ways to specify explicit file lists to htdig and configure your web server to automatically generate links for files contained in a directory.
Jim
-------------------------------------------------------
This SF.NET email is sponsored by: FREE SSL Guide from Thawte
are you planning your Web Server Security? Click here to get a FREE
Thawte SSL guide and find the answers to all your SSL security issues.
http://ads.sourceforge.net/cgi-bin/redirect.pl?thaw0026en
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

