Chinese sorting

Nils Knappmeier Wed, 17 Dec 2014 04:41:15 -0800

Hi,

is there any implementation for a chinese collator in Lucene. I've seenthat there is a chinese analyzer which uses Hidden Markov Models. Butsorting seems to be an issue on its own and all my googling hasn't ledto any results yet.

I understand that this is not a trivial issue and I've read that thechinese tend to prefer other ordering than by name, since sorting ordersare so complicated that nobody wants to use them. But we will have tosort search results by name, even though the name is chinese (simplifiedchinese at the moment, but traditional may also appear later) andcurrenty chinese words seem to be ordered by their unicode-number, whichseems not to be the right order.


Thanks in advance for any suggestion,
 Nils

Chinese sorting

Reply via email to