On 3 Oct 2001 at 9:27, Gilles Detillieux wrote:

> According to Dan Langille:
> > On 1 Oct 2001 at 17:53, Geoff Hutchison wrote:
> > > On Mon, 1 Oct 2001, Dan Langille wrote:
> > > > redirect: http://www.unixathome.org/adsl/archives/2001_06/
> > > > 
> > > >    Rejected: URL not in the limits!
> > > 
> > > Right. This is what I suspected. In your config file, the
> > > limit_urls_to attribute is restricting the indexing from looking at
> > > these URLs. So it would help if you could post from your configuration
> > > things like:
> > > 
> > > limit_urls_to:
> > > exclude_urls:
> > > max_hop_count:
> > 
> > limit_urls_to:          ${start_url}
> > exclude_urls:           /cgi-bin/ .cgi /phorum/
> > max_hop_count: <== not found in config file.
> 
> That's the problem.  Your start_url is something like
> 
>    http://unixathome.org/
> 
> but the redirect gives http://www.unixathome.org/adsl/..., which doesn't
> match the pattern in limit_urls_to as it has simply taken on the value of
> start_url.  You should probably set the following in your htdig.conf:
> 
> limit_urls_to:        http://unixathome.org/ http://www.unixathome.org/
> server_aliases:       www.unixathome.org:80=unixathome.org:80
> 
> The limit_urls_to will allow URLs with or without the "www.", and the
> server_aliases will strip off the "www." to avoid getting duplicates in
> the database, with and without the "www." prefix.  If you prefer, you
> couls also set limit_urls_to as...
> 
> limit_urls_to:        ${start_url} http://www.unixathome.org/
> 
> so any subsequent additions to start_url won't be excluded by
> limit_urls_to.

*click*

I understand now.  Thank you.

FWIW: I'm planning an htdig article for my website.  I'll be sure to include this.  
Fine traps for 
young players.

The other main thing I want to cover is incremental indexing.  I've
seen it referenced in the list several times over the past few months.
It will be especially applicable to my website: write a new article;
add that article to the index.
-- 
Dan Langille
The FreeBSD Diary - http://freebsddiary.org/ - practical examples


_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to