On Tue, 7 Dec 1999, Geoff Hutchison wrote:
> Date: Tue, 7 Dec 1999 11:07:01 -0600
> From: Geoff Hutchison <[EMAIL PROTECTED]>
> To: htdig3-dev <[EMAIL PROTECTED]>
> Subject: [htdig3-dev] Re: htdig-3.1.4 prerelease
>
> Hi,
>
> First off, many thanks to Gilles, who has been hard at work on
> cleaning up the 3-1-x tree! The results of that work is currently on
> the site, as a prelease for the 3.1.4 release:
> <http://www.htdig.org/files/snapshots/htdig-3.1.4-prerelease.tar.gz>
>
> As I put in the htdoc/RELEASE.html file, I'd like to release this on
> Thursday evening my time (US Central, 12/9/99), so if people could
> take a look and make sure it compiles, etc. we'd both greatly
> appreciate it. Using CVS, it's the current state of the 3-1-x branch.
I downloaded and installed it on a BSDI 4.0 box; it compiled but, htsearch
dumped core. I followed the old BSDI/htdig fix:
. make clean
. Remove references to regex.o from htlib/Makefile
. rm htlib/regex.h
. make
everything worked except my the old local duplicate suppressor patch:
ftp://sol.ccsf.cc.ca.us/htdig-patches/3.0.8b2/Retriever.cc.0
did not quite do its job.
Here are some stats:
____________________________________________________________________
3.1.4:
Start dig: Tue Dec 7 11:16:16 PST 1999
End dig: Tue Dec 7 11:30:37 PST 1999
-rw-r--r-- 1 jjah www 16614400 Dec 7 11:30 db.docdb
-rw-r--r-- 1 jjah www 418816 Dec 7 11:30 db.docs.index
-rw-r--r-- 1 jjah www 18819167 Dec 7 11:30 db.wordlist
-rw-r--r-- 1 jjah www 19718144 Dec 7 11:30 db.words.db
htdig: Run complete
htdig: 1 server seen:
htdig: www.ccsf.cc.ca.us:80 7069 documents
htmerge: Total word count: 88711
htmerge: Total documents: 3727
htmerge: Total doc db size (in K): 29409
3.1.3:
Start dig: Tue Dec 7 11:32:31 PST 1999
End dig: Tue Dec 7 11:47:09 PST 1999
-rw-r--r-- 1 jjah www 16571392 Dec 7 11:47 db.docdb
-rw-r--r-- 1 jjah www 416768 Dec 7 11:47 db.docs.index
-rw-r--r-- 1 jjah www 18734111 Dec 7 11:46 db.wordlist
-rw-r--r-- 1 jjah www 19638272 Dec 7 11:46 db.words.db
htdig: Run complete
htdig: 1 server seen:
htdig: www.ccsf.cc.ca.us:80 7077 documents
htmerge: Total word count: 88507
htmerge: Total documents: 3727
htmerge: Total doc db size (in K): 29409
_______________________________________________________________
As you see database sizes do not vary too much, but the results pages
point to the same URL MULTIPLE times in 3.1.4 case; baffling;-/?
That reminds me; has the _promised_ duplicate suppression feature been
placed in 3.2.x yet?
Regards,
Joe
--
_/ _/_/_/ _/ ____________ __o
_/ _/ _/ _/ ______________ _-\<,_
_/ _/ _/_/_/ _/ _/ ......(_)/ (_)
_/_/ oe _/ _/. _/_/ ah [EMAIL PROTECTED]
------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.