Strange behavior when searching with accents

2007-09-20 Thread Thierry Collogne
Hello, We are experiencing some strange behavior while searching with words containing accents. We are using two examples "rené" and "matthé" When we search for "rené" or for "rene", we get the same results, so that is ok. But when we search for "matthé" or for "matthe", we get two totally differ

Re: Strange behavior when searching with accents

2007-09-20 Thread Thorsten Scherler
On Thu, 2007-09-20 at 10:11 +0200, Thierry Collogne wrote: > Hello, > > We are experiencing some strange behavior while searching with words > containing accents. > We are using two examples "rené" and "matthé" > > When we search for "rené" or for "rene", we get the same results, so that is > ok.

Re: Strange behavior when searching with accents

2007-09-20 Thread Bertrand Delacretaz
On 9/20/07, Thierry Collogne <[EMAIL PROTECTED]> wrote: > ..when we search for "matthé" or for "matthe", we get two totally > different results The analyzer admin tool should help you find out what's happening, see http://wiki.apache.org/solr/FAQ#head-b25df8c8393bbcca28f1f344c432975002e29ca9

Re: Strange behavior when searching with accents

2007-09-20 Thread Thierry Collogne
We are using this schema definition I will take a look at the analyzer took. Thank you both for the quick response. O

Re: Strange behavior when searching with accents

2007-09-20 Thread Thierry Collogne
I have entered the the matthé term in the the analyzer, but as far as I understand, it should be ok. I have made some screenshots with the results. http://farm2.static.flickr.com/1407/1412619772_0b697789cd_o.jpg http://farm2.static.flickr.com/1245/1412619774_3351b287bc_o.jpg I find it strange th

Re: Strange behavior when searching with accents

2007-09-20 Thread Thorsten Scherler
On Thu, 2007-09-20 at 13:33 +0200, Thierry Collogne wrote: > We are using this schema definition > Thierry, try to move the solr.ISOLatin1AccentFilterFactory up the filter cue, like: ... ... for both indexing and query. This way you make sure that all accent are gone before you do further

Re: Strange behavior when searching with accents

2007-09-20 Thread Thorsten Scherler
On Thu, 2007-09-20 at 14:01 +0200, Thierry Collogne wrote: > I have entered the the matthé term in the the analyzer, but as far as I > understand, it should be ok. I have made some screenshots with the results. > > http://farm2.static.flickr.com/1407/1412619772_0b697789cd_o.jpg > > http://farm2.s

Re: Strange behavior when searching with accents

2007-09-20 Thread Bertrand Delacretaz
On 9/20/07, Thierry Collogne <[EMAIL PROTECTED]> wrote: > ...Thank you very much. Moving the up in the chain fixed it Yes, the problem was the EnglishPorterFilterFactory before the accents removal: the stemmer doesn't know about accents, so no stemming occured on "matthé" whereas "matthe" wa

Re: Strange behavior when searching with accents

2007-09-20 Thread Thierry Collogne
Thorsten, Thank you very much. Moving the up in the chain fixed it. On 20/09/2007, Thorsten Scherler <[EMAIL PROTECTED]> wrote: > > On Thu, 2007-09-20 at 14:01 +0200, Thierry Collogne wrote: > > I have entered the the matthé term in the the analyzer, but as far as I > > understand, it should be

Re: Strange behavior when searching with accents

2007-09-20 Thread Thierry Collogne
We are indexing both french and dutch. I will take a look at SnowballPorterFilterFactory later, but thanks for the advice. On 20/09/2007, Bertrand Delacretaz <[EMAIL PROTECTED]> wrote: > > On 9/20/07, Thierry Collogne <[EMAIL PROTECTED]> wrote: > > > ...Thank you very much. Moving the up in the c

Re: Strange behavior when searching with accents

2007-09-20 Thread Thorsten Scherler
On Thu, 2007-09-20 at 15:27 +0200, Bertrand Delacretaz wrote: > On 9/20/07, Thierry Collogne <[EMAIL PROTECTED]> wrote: > > > ...Thank you very much. Moving the up in the chain fixed it > > Yes, the problem was the EnglishPorterFilterFactory before the accents > removal: the stemmer doesn't

Re: Strange behavior when searching with accents

2007-09-20 Thread Bertrand Delacretaz
On 9/20/07, Thorsten Scherler <[EMAIL PROTECTED]> wrote: > ...Betrand, does the French Snowball work fine?... I've seen some weirdnesses, like "tennis" and "tenir" (means to hold) both stemmed to "ten", but in all of our (simple) tests it was ok. The application where we're using it does not requ

RE: Strange behavior when searching with accents

2007-09-20 Thread Lance Norskog
-Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Bertrand Delacretaz Sent: Thursday, September 20, 2007 8:11 AM To: solr-user@lucene.apache.org Subject: Re: Strange behavior when searching with accents On 9/20/07, Thorsten Scherler <[EMAIL PROTECTED]>

RE: Strange behavior when searching with accents

2007-09-21 Thread Thorsten Scherler
t: Thursday, September 20, 2007 8:11 AM > To: solr-user@lucene.apache.org > Subject: Re: Strange behavior when searching with accents > > On 9/20/07, Thorsten Scherler <[EMAIL PROTECTED]> > wrote: > > ...Betrand, does the French Snowball work fine?... > > I'v

RE: Strange behavior when searching with accents

2007-09-21 Thread Lance Norskog
f Bertrand Delacretaz > Sent: Thursday, September 20, 2007 8:11 AM > To: solr-user@lucene.apache.org > Subject: Re: Strange behavior when searching with accents > > On 9/20/07, Thorsten Scherler > <[EMAIL PROTECTED]> > wrote: > > ...Betrand, does the French Snowball