On Thu, Mar 17, 2011 at 9:18 PM, abiratsis <abirat...@gmail.com> wrote:
>
> Basicaly I don't know what the best approach is for handling a multilingual
> case like mine e.g:should I create a seperate index for each language?
>

In this particular case (Greek, English), they use totally distinct
characters. so their terms will never conflate with each other, their
stemmers will never mess with the other language's text, etc.

So I would:
a. switch from LowerCaseFilter to GreekLowerCaseFilter... it
lowercases english the same way, don't worry.
b. add greek stopwords file to your stopfilter. stopfilterfactory can
take multiple file arguments... just separate them with a comma.
c. add the greek stemmer right after the porter stemmer.

then your field works fine for greek and english...

Reply via email to