Because, as far as I understand, in German it's semantically the same to write über or ueber (although ueber is less often used). I guess this is not true only for personal names. Syntactically, "uber" is wrong but users sometimes search for this also.
On Sat, Nov 29, 2014 at 8:29 PM, Itamar Syn-Hershko <ita...@code972.com> wrote: > Why do you need it as ueber? what I'm usually doing is end up with [über, > uber] at the same position, possibly marking the first as being the > original. Seeing Jurgen's response, I seem to be on the right path... > > -- > > Itamar Syn-Hershko > http://code972.com | @synhershko <https://twitter.com/synhershko> > Freelance Developer & Consultant > Author of RavenDB in Action <http://manning.com/synhershko/> > > On Sat, Nov 29, 2014 at 9:21 PM, Krešimir Slugan < > kresimir.slu...@gmail.com> wrote: > >> Which token filter can I use to replace words like über with ueber? >> >> On Saturday, November 29, 2014 8:16:14 PM UTC+1, Itamar Syn-Hershko wrote: >>> >>> What I'm saying is don't use char_filter, and use the token filters >>> chain to achieve that >>> >>> -- >>> >>> Itamar Syn-Hershko >>> http://code972.com | @synhershko <https://twitter.com/synhershko> >>> Freelance Developer & Consultant >>> Author of RavenDB in Action <http://manning.com/synhershko/> >>> >>> On Sat, Nov 29, 2014 at 9:02 PM, Krešimir Slugan <kresimi...@gmail.com> >>> wrote: >>> >>>> Hi Itamar, >>>> >>>> I don't think this solves my problem. I'm aware that you can preserve >>>> original with ASCIIfolding but since char_filter is applied >>>> before ASCIIfolding then there would not be any umlauts to fold :) If I >>>> could apply char_filter on the end that would be ok, or preserve original >>>> with char_filter. >>>> >>>> Best, >>>> >>>> Kresimir >>>> >>>> On Saturday, November 29, 2014 5:41:11 PM UTC+1, Itamar Syn-Hershko >>>> wrote: >>>>> >>>>> You may find the approach I give in the end of this talk helpful: >>>>> https://skillsmatter.com/skillscasts/4968-approaches-to-multi-lingual- >>>>> text-search-with-elasticsearch-and-lucene >>>>> >>>>> -- >>>>> >>>>> Itamar Syn-Hershko >>>>> http://code972.com | @synhershko <https://twitter.com/synhershko> >>>>> Freelance Developer & Consultant >>>>> Author of RavenDB in Action <http://manning.com/synhershko/> >>>>> >>>>> On Tue, Nov 18, 2014 at 12:30 PM, Krešimir Slugan < >>>>> kresimi...@gmail.com> wrote: >>>>> >>>>>> Hi, >>>>>> >>>>>> To handle German language in search I have to be able to provide same >>>>>> results if user searches for e.g über, uber or ueber >>>>>> >>>>>> I would do this at the index time where I would have über in the >>>>>> data. But if I use just asciifolding filter I lose information that >>>>>> this was work with "umlaut" and I can't get ueber token. If I use >>>>>> char_fiter, it is applied before analysis and I would not be able to get >>>>>> uber. >>>>>> >>>>>> Is it possible to preserve original with char filter or apply it >>>>>> after the analysis? >>>>>> >>>>>> Cheers, >>>>>> >>>>>> Kresimir >>>>>> >>>>>> -- >>>>>> You received this message because you are subscribed to the Google >>>>>> Groups "elasticsearch" group. >>>>>> To unsubscribe from this group and stop receiving emails from it, >>>>>> send an email to elasticsearc...@googlegroups.com. >>>>>> To view this discussion on the web visit https://groups.google.com/d/ >>>>>> msgid/elasticsearch/f18f94bc-58e0-4bbf-a445-b45ba4db11f3%40goo >>>>>> glegroups.com >>>>>> <https://groups.google.com/d/msgid/elasticsearch/f18f94bc-58e0-4bbf-a445-b45ba4db11f3%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>>> . >>>>>> For more options, visit https://groups.google.com/d/optout. >>>>>> >>>>> >>>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "elasticsearch" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to elasticsearc...@googlegroups.com. >>>> To view this discussion on the web visit https://groups.google.com/d/ >>>> msgid/elasticsearch/4d362cd4-21a4-486c-bf57-f2de5949f072% >>>> 40googlegroups.com >>>> <https://groups.google.com/d/msgid/elasticsearch/4d362cd4-21a4-486c-bf57-f2de5949f072%40googlegroups.com?utm_medium=email&utm_source=footer> >>>> . >>>> >>>> For more options, visit https://groups.google.com/d/optout. >>>> >>> >>> -- >> You received this message because you are subscribed to the Google Groups >> "elasticsearch" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to elasticsearch+unsubscr...@googlegroups.com. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/elasticsearch/8e3cc964-59fc-4be7-bb13-b1411a312ade%40googlegroups.com >> <https://groups.google.com/d/msgid/elasticsearch/8e3cc964-59fc-4be7-bb13-b1411a312ade%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> >> For more options, visit https://groups.google.com/d/optout. >> > > -- > You received this message because you are subscribed to a topic in the > Google Groups "elasticsearch" group. > To unsubscribe from this topic, visit > https://groups.google.com/d/topic/elasticsearch/B-JO9993Avo/unsubscribe. > To unsubscribe from this group and all its topics, send an email to > elasticsearch+unsubscr...@googlegroups.com. > To view this discussion on the web visit > https://groups.google.com/d/msgid/elasticsearch/CAHTr4ZuvKNq58xryBXJ5FLewOafWd0LvsaTADh%2BeYCtHGaRK2A%40mail.gmail.com > <https://groups.google.com/d/msgid/elasticsearch/CAHTr4ZuvKNq58xryBXJ5FLewOafWd0LvsaTADh%2BeYCtHGaRK2A%40mail.gmail.com?utm_medium=email&utm_source=footer> > . > > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAK4NRa%3DeXOeYcANXC71qvXLyK8RG%3D4L5ijbNXPO9bwdig3yD%2Bg%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.