Hi Daniel
How to implement custom Russian factory with various Tokenizers and Filters?
Can you provide some code examples?
Regards,
Andrew
Daniel Alheiros wrote:
>
> Hi Andrew
>
> Yes, I saw that. As I'm not knowledgeable in Russian I had to infer it was
> adequate. But as you have much mor
Hi Otis
Yes, I have tried different languages. At least English, French, German and
Finnish.
part of query analyser:
example query: "списки arrondissement turvallisuuden" (russian, french and
finnish words)
results of analysis.jsp (in Solr admin):
org.apache.solr.analysis.WhitespaceTokenize
Hi
RussianAnalyzer produces russian stemmed forms, but
SnowballPorterFilterFactory with language="Russian" leaves _all_ russian
content unchanged.
hossman wrote:
>
>
> : Subject: Stemmer bug?
>
> can you elaborate on what exactly you view as a bug?
>
> if the issue is just that one of the e
Working config (with proper russian stemming):
Non-working config (no russian stemming):
--
View this message in context:
http://www.nabble.com/Problem-with-Russian-stemmer-in-Sol
Hi, Daniel
Stemmer in RussianAnalyser works as expected. But this analyser doesn't
allow any Solr customization. All stopwords are hardcoded, no support for
custom tokenizer, no synonym support.
RussianAnalyser is similar to this scheme:
standard tokenizer
standard filter factory
word deli
other Analyzer?
>
> Regards,
> Daniel
>
>
> On 9/7/07 16:06, "Andrew Stromnov" <[EMAIL PROTECTED]> wrote:
>
>> списки arrondissement turvallisuuden
>
>
> http://www.bbc.co.uk/
> This e-mail (and any attachments) is confidential and may co
Tried on JDK1.6p2 on MS Vista and CentOS.
query analyser config:
...
...
Query: списки arrondissement turvallisuuden
Analysis.jsp result:
org.apache.solr.analysis.WhitespaceTokenizerFactory {}
term position 1 2 3
term text списки arrondissement turvallisuuden
org.a