Thanks! That' make sense :)
- Original Message -
From: "Ahmet Arslan"
To:
Sent: Tuesday, August 24, 2010 4:30 PM
Subject: Re: Why it's boosted up?
Then why short fields are boost up?
In other words longer documents are punished. Because they contain
possi
Thanks for your clear explanation! I got it :)
- Original Message -
From: "MitchK"
To:
Sent: Tuesday, August 24, 2010 3:37 PM
Subject: Re: Why it's boosted up?
Hi Scott,
(so shorter fields are automatically boosted up). "
The theory behind that is th
> Then why short fields are boost up?
In other words longer documents are punished. Because they contain possibly
many terms/words. If this mechanism does not exist, longer documents takes over
and pops up usually in the first page.
Hi Scott,
> (so shorter fields are automatically boosted up). "
>
The theory behind that is the following (in easy words):
Let's say you got two documents, each doc contains on 1 field (like it was
in my example).
Additionally we got a query that contains two words.
Let's say doc1 contains o
In Lucene's web page, there's a paragraph:
"Indexing time boosts are preprocessed for storage efficiency and written to
the directory (when writing the document) in a single byte (!) as follows:
For each field of a document, all boosts of that field (i.e. all boosts
under the same field name i