On Mon, 21 Sep 1998, Geoff Hutchison wrote:

> Date: Mon, 21 Sep 1998 23:13:12 -0400
> From: Geoff Hutchison <[EMAIL PROTECTED]>
> To: "Joe R. Jah" <[EMAIL PROTECTED]>
> Cc: [EMAIL PROTECTED]
> Subject: Re: htdig: Problems with using htdig -a
> 
> At 1:23 AM -0400 9/18/98, Joe R. Jah wrote:
> >I assume this increase in size of db files and theincrease in the reported
> >number of documents will be cumulative over time if one uses this
> >workaround; It will probably increase the actual search time as well;(
> 
> I'm not sure what's going on here. Perhaps you could export the ASCII
> database for the db with and without this behavior. I'd be interested to
> see if documents are being duplicated. Do you use "remove_bad_urls"?

Yes documents are being duplicated, triplicated, and ...  That's why I use
the old "Excluding directories and duplicate URLs patch."

Yes I have the line 

        remove_bad_urls:        true

in my htdig.conf file.

Joe

     _/   _/_/_/       _/              ____________    __o
     _/   _/   _/      _/         ______________     _-\<,_
 _/  _/   _/_/_/   _/  _/                     ......(_)/ (_)
  _/_/ oe _/   _/.  _/_/ ah        [EMAIL PROTECTED]


----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.

Reply via email to