sergiu gordea writes:
> Daan Hoogland wrote:
> 
> >H all,
> >
> >I try to create different indices using different Analyzer-classes. I 
> >tried standard, german, russian, and cjk. They all produce exactly the 
> >same index file (md5-wise). There are over 280 pages so I expected at 
> >least some differences.
> >
> >  
> >
> Take a look in the lucene source code... Maybe you will find the answer ...
> I asume that all the pages you indexed were written in English, 
> therefore is normal that german, russian and cjk analyzers to
> create identic indexex, but htey should be different  than english one 
> (StandardAnalyzer)
> 
german analyzer definitely won't leave english text as it is, since it
does algorithmic stemming.
E.g. your text get's
tak a look in the luc sourc cod mayb you will find the answ i asum tha all the pag you 
indexed wer writt in english therefor is normal tha germa russia and cjk analyx to 
crea identic indexex but htey should be diff tha english one standardanalyx
  while std analyzer does not stem at all and gives
take a look in the lucene source code maybe you will find the answer i asume that all 
the pages you indexed were written in english therefore is normal that german russian 
and cjk analyzers to create identic indexex but htey should be different than english 
one standardanalyzer

I'd rather suspect some problem with the indexing code.
So my advice is, to check what the analyzer produces.

Morus

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to