I'm learning how to index/search German today and understanding that vowels with umlauts are conventionally expanded into two ASCII characters, eg "für" -> "fuer", so people may search for the expanded form "fuer", but they might also search with the diacritic, and finally they might lazily search using the stripped form "fur".
My question: is there a standard CharFilter or TokenFilter that expands to both (ASCII) forms, for characters with umlauts and perhaps other diacritics I might be unaware of in other languages having similar multiple renderings in ASCII? -Mike --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org