Re: NO_NORM and TOKENIZED

Michael McCandless Wed, 05 Mar 2008 01:07:09 -0800

Correct, they are logically orthogonal, and I agree the API issomewhat confusing since "NO_NORMS" is mixing up two things.

To get a tokenized field without norms you can create the field withIndex.TOKENIZED, and then call setOmitNorms(true).

Note that norms "spread" during merges, so, if you really wantNO_NORMS for a given field X then every doc in the index must haveits field X indexed with NO_NORMS. Ie, build a clean index if youdecide to turn off norms for field X.


Mike

Tobias Hill wrote:

Hi,

I am quite new to the Lucene API. I find the Field-constructor
unintuitive. Maybe I have misunderstood it. Let's find out...

It can be used either as:
new Field("field", "data", Store.NO, TOKENIZED)

or:
new Field("field", "data", Store.NO, NO_NORM)


As I understand it NO_NORM and TOKENIZED are not settings for
a one-dimensional behaviour - on the contrary they are rather
orthogonal.
I.e. it is quite likely that I would want _both_ TOKENIZED andNO_NORM.This is especially true for fields that are of approx. equal andshort length
over the doc-space.
- Am I right in my reasoning (which means that the API is a bitunclear)?
Or
- Have I misunderstood something fundamental about TOKENIZED andNO_NORM?
Thankful for any feedback on this,
Tobias

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: NO_NORM and TOKENIZED

Reply via email to