Re: [htdig] word_list columns

1999-11-25 Thread Gilles Detillieux

According to Aaron Turner:
> there are 6 columns in the wordlist file.  Obviously col1 is the word.
> What are the others? (i, l, w, c a)

First field:indexed word (lower case)
i:  doc ID (to match up with records in db.docs.index)
l:  location of word in doc (0-1000, i.e. tenth of a percent units)
w:  weight of word in searches
c:  no. of occurrences of word in document, if > 1
a:  index into anchor list in db.docdb record, to indicate which
anchor name, if any, preceded this word

Fields are tab separated.  All of this info gets put into db.words.db by
htmerge, so htsearch doesn't actually look at db.wordlist.

-- 
Gilles R. Detillieux  E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You'll receive a message confirming the unsubscription.



[htdig] word_list columns

1999-11-25 Thread Aaron Turner


there are 6 columns in the wordlist file.  Obviously col1 is the word.
What are the others? (i, l, w, c a)

--
Aaron Turner, Core Developer   http://vodka.linuxkb.org/~aturner/
Linux Knowledge Base Organization  http://linuxkb.org/
Because world domination requires quality open documentation.
aka: [EMAIL PROTECTED], [EMAIL PROTECTED], [EMAIL PROTECTED]



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You'll receive a message confirming the unsubscription.