> > What if the query is '+strong force' and 'strong' is tokenized into
> > 'strong' and the alias 'tough' ? Will the query parser convert it to
> > '+(strong OR tough) force' ?
>
> If strong is tokenized to /strong tough/, then /+strong force/ will
> be tokenized to /+"strong tough" force/.
>
> While this may look weird to you, that's because this is a really
> bogus example. Tokenizers in western languages generally don't expand
> a single token into more than one; this is more common in asian
> languages, where a complicated ideograph is broken down into simpler
> ones. So a more comparable example in English would be to tokenize
> the words "HaagenDazs" into "ice cream". So then a search for
> /+HaagenDazs chocolate/ becomes (+"ice cream" chocolate), which is
> a pretty reasonable-looking behavior.
I see, so this is intended for use in cases where the a token is
replaced with a sequence of tokens that represents, as a sequence, a single
meaning
that is similar to that of the original token.
Another case, which does not seem to be supported
is when a token is replaced with a sequence of tokens, each
representing an *alternative* meaning. Here is an example:
'dog' -> 'dog', 'pet'
'cat' -> 'cat', 'pet'
'pet' -> 'pet'
When you search for 'pet' you want to match also documents with 'dog' and
'cat' but when you search for 'dog' you don't want to match 'cat' or 'pet'.
> > The FAQ is aimed at programmers that embed Lucene in there
> application and
> > not
> > at their end users.
>
> Who will still have to create documentation for _their_ (naive) users.
> Why not call attention to the issues that will undoubtedly bite their
> users? Why not give them something they can basically cut and paste
> from?
Great idea. Write a manual for end users and we
will include it in the FAQ or have it in a separate document.
Tal
>
> _______________________________________________
> Lucene-dev mailing list
> [EMAIL PROTECTED]
> http://lists.sourceforge.net/lists/listinfo/lucene-dev
>
_______________________________________________
Lucene-dev mailing list
[EMAIL PROTECTED]
http://lists.sourceforge.net/lists/listinfo/lucene-dev