Re: Problem with Russian stemmer in Solr 1.2

2007-07-17 Thread Andrew Stromnov
Hi Daniel How to implement custom Russian factory with various Tokenizers and Filters? Can you provide some code examples? Regards, Andrew Daniel Alheiros wrote: > > Hi Andrew > > Yes, I saw that. As I'm not knowledgeable in Russian I had to infer it was > adequate. But as you have much mor

Re: Stemmer bug?

2007-07-11 Thread Andrew Stromnov
Hi Otis Yes, I have tried different languages. At least English, French, German and Finnish. part of query analyser: example query: "списки arrondissement turvallisuuden" (russian, french and finnish words) results of analysis.jsp (in Solr admin): org.apache.solr.analysis.WhitespaceTokenize

Re: Stemmer bug?

2007-07-10 Thread Andrew Stromnov
Hi RussianAnalyzer produces russian stemmed forms, but SnowballPorterFilterFactory with language="Russian" leaves _all_ russian content unchanged. hossman wrote: > > > : Subject: Stemmer bug? > > can you elaborate on what exactly you view as a bug? > > if the issue is just that one of the e

Stemmer bug?

2007-07-10 Thread Andrew Stromnov
Working config (with proper russian stemming): Non-working config (no russian stemming): -- View this message in context: http://www.nabble.com/Problem-with-Russian-stemmer-in-Sol

Re: Problem with Russian stemmer in Solr 1.2

2007-07-09 Thread Andrew Stromnov
Hi, Daniel Stemmer in RussianAnalyser works as expected. But this analyser doesn't allow any Solr customization. All stopwords are hardcoded, no support for custom tokenizer, no synonym support. RussianAnalyser is similar to this scheme: standard tokenizer standard filter factory word deli

Re: Problem with Russian stemmer in Solr 1.2

2007-07-09 Thread Andrew Stromnov
other Analyzer? > > Regards, > Daniel > > > On 9/7/07 16:06, "Andrew Stromnov" <[EMAIL PROTECTED]> wrote: > >> списки arrondissement turvallisuuden > > > http://www.bbc.co.uk/ > This e-mail (and any attachments) is confidential and may co

Problem with Russian stemmer in Solr 1.2

2007-07-09 Thread Andrew Stromnov
Tried on JDK1.6p2 on MS Vista and CentOS. query analyser config: ... ... Query: списки arrondissement turvallisuuden Analysis.jsp result: org.apache.solr.analysis.WhitespaceTokenizerFactory {} term position 1 2 3 term text списки arrondissement turvallisuuden org.a