Note that ISOLatin1AccentFilter converts accent characters only from ISO-8859-1 character set. Which means that if you need to convert accents of eastern European languages you need to write your own accent filter. wojtek
2008/7/16 Petite Abeille <[EMAIL PROTECTED]>: > > On Jul 16, 2008, at 10:58 AM, [EMAIL PROTECTED] wrote: > >> Simple example is Kraków search should also bring Krakow also in search >> results. > > As pointed out previously, you need to transliterate your input using > something like ISOLatinFilter or such. > > For example, searching for 'aaiun' should return 'Aaiún' and vis-versa: > > http://svr225.stepx.com:3388/search?q=aaiun > http://svr225.stepx.com:3388/el-aaiun > > Sean M. Burke's Unidecode provides an extensive transliteration of Unicode > into ASCII: > > http://interglacial.com/~sburke/tpj/as_html/tpj22.html > > E.g.: > > Москва́ Moskva > 北京 beijing > Ἀθηνᾶ Athena > 서울 seoul > 東京 dongjing > 京都市 jingdushi > नेपाल nepaal > > Cheers, > > -- > PA. > http://alt.textdrive.com/nanoki/ > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > >
