On Sat, Sep 25, 2010 at 1:04 AM, Andy <angelf...@yahoo.com> wrote:

>
> But I thought specialized analyzers like CJKAnalyzer are designed for those
> languages, which don't use whitespace to separate words.
>

yes


>
> Isn't it up to the tokenizer, not the QueryParser, to decide how to split
> the query into tokens?
>

yes


> I'm really confused.
>

actually it sounds like you understand the situation perfectly!!


> If Solr's QueryParser will only split on whitespace no matter what then
> what is the point of using CJKAnalyzer?


> It sounds like Solr would be pretty useless for languages like CJK. Is
> there any work around for this? Any CJK sites using Solr?
>

if you do not want all queries to be phrasequeries, you should use:

<fieldType name="text" class="solr.TextField"
autoGeneratePhraseQueries="false">

then the lack of whitespace between words will not cause phrase queries. if
you use this option, phrase queries will only be caused if the user
explicitly puts terms in double quotes.

-- 
Robert Muir
rcm...@gmail.com

Reply via email to