n obsolete.
Thanks a lot for your help!
best
Matthias
-Ursprüngliche Nachricht-
Von: Thomas Traeger [mailto:[EMAIL PROTECTED]
Gesendet: Mittwoch, 26. September 2007 23:44
An: solr-user@lucene.apache.org
Betreff: Re: Converting German special characters / umlaute
Try the SnowballPorterFil
On Thu, 2007-09-27 at 13:26 -0400, J.J. Larrea wrote:
> At 12:13 PM -0400 9/27/07, Steven Rowe wrote:
> >Chris Hostetter wrote:
...
> As for implementation, the first part could easily and flexibly accomplished
> with the current PatternReplaceFilter, and I'm thinking the second could be
> done w
At 12:13 PM -0400 9/27/07, Steven Rowe wrote:
>Chris Hostetter wrote:
>> : is there an analyzer which automatically converts all german special
>> : characters to their specific dissected from, such as ü to ue and ä to
>> : ae, etc.?!
>>
>> See also the ISOLatin1TokenFilter which does this regardle
Accent transforms are language-specific, so an accent filter
should take an ISO langauge code as an argument.
Some examples:
* In French and English, a diereses is a hint to pronounce neighboring
vowels separateley, as in coöp, naïve, or Noël.
* In German, ü transformes to ue.
* In Swedish, ö i
Chris Hostetter wrote:
> : is there an analyzer which automatically converts all german special
> : characters to their specific dissected from, such as ü to ue and ä to
> : ae, etc.?!
>
> See also the ISOLatin1TokenFilter which does this regardless of langauge.
Actually, ISOLatin1TokenFilter doe
: is there an analyzer which automatically converts all german special
: characters to their specific dissected from, such as ü to ue and ä to
: ae, etc.?!
See also the ISOLatin1TokenFilter which does this regardless of langauge.
: I also would like to have, that the search is always run against
Try the SnowballPorterFilterFactory described here:
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters
You should use the German2 variant that converts ä and ae to a, ö and oe
to o and so on. More details:
http://snowball.tartarus.org/algorithms/german2/stemmer.html
Every document in