Hi,
we use a modified ISOLatin1AccentFilter bit to replace German accents by ae,
oe, ue and so on for that purpose.
In the code you will see a switch for the characters.
You need to change it from
case '\u00E4' : // small ä
output[outputPos++] = 'a';
output[outputPos++] = 'e';
break;
To something like this
case '\u00E4' : // small ä
output[outputPos++] = 'a';
break;
for the characters you want to replace.
Best Regards
Lutz Fechner
-----Original Message-----
From: Dyga, Adam [mailto:[email protected]]
Sent: Montag, 19. November 2012 10:47
To: [email protected]
Subject: German 'ue' -> 'u' conversion
Hello,
I have two questin regarding handling German umlauts in Lucene:
1. I'm trying to find a way to convert German Umlauts written as 'ue', 'ae',
etc to folded form 'u', 'a' and so on.
This is done by GermanAnalyzer (and German2StemFilter used by it), but
unfortunately it also does stemming which is very undesired in my case.
Is there any other filter that can do only the 'ua' -> 'u' conversion?
2. Is there any filter that does 'ü' -> 'ue' (NOT 'u') conversion? What I'm
trying to achieve is that word "über" should be found in the index whenever the
user searches for " über" or "ueber" , but NOT "uber".
Regards,
AD
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]