On Sun, 28 Mar 2004, Douglas Kline wrote:

> where htsearch finds them?  Also, why are some pages not reported by htsearch
> which should be reported?  The tests I ran suggest (but don't prove) that the
> key difference between reporting matching pages and not doing so is that the
> search terms which lead to reports are found in the top-level page of the URL
> given as argument to htdig while search terms which lead to the htsearch report
> of no matching pages when in fact there are some aren't in the top-level page.

There are quite a few reasons why a page might be excluded during
indexing and thus not show up later when running htsearch against
the databases. Many of those reasons are covered in the following
FAQ (as well as some of the other items linked to from this FAQ).

  http://www.htdig.org/FAQ.html#q5.27

Verbose output from the dig will often show you why a particular
document is being rejected.

Generally all documents linked from your starting URL, through
any number of hops, are indexed unless one of your configuration
settings explicitly excludes the document.

Jim


-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to