Re: Case insensitive StringField?

Jack Krupansky Tue, 21 May 2013 07:23:05 -0700

To be clear, analysis is not supported on StringField (or any non-tokenizedfield). But the good news is that by using the keyword tokenizer(KeywordTokenizer) on a TextField, you can get the same effect.

That will preserve the entire input as a single token. You may want toinclude filters to trim exterior white space and normalize interior whitespace.


-- Jack Krupansky

-----Original Message-----From: Shahak Nagiel

Sent: Tuesday, May 21, 2013 10:06 AM
To: java-user@lucene.apache.org
Subject: Case insensitive StringField?

It appears that StringField instances are treated as literals, even thoughmy analyzer lower-cases (on both write and read sides). So, for example, Ican match with a term query (e.g. "NEW YORK"), but only if the case matches.If I use a QueryParser (or MultiFieldQueryParser), it never works becausethese query values are lowercased and don't match.

I've found that using a TextField instead works, presumably because it'stokenized and processed correctly by the write analyzer. However, I wouldprefer that queries match against the entire/exact phrase ("NEW YORK"),rather than among the tokens ("NEW" or "YORK").


What's the solution here?

Thanks in advance.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Case insensitive StringField?

Reply via email to