Hi -

What's the current recommendation for searching/analyzing Korean?

The reference guide only lists CJK:
https://cwiki.apache.org/confluence/display/solr/Language+Analysis

I see a bunch of work was done on
https://issues.apache.org/jira/browse/LUCENE-4956, but it doesn't look like
that was ever committed - and the last comment was years ago.

There seem to be a few version of this in the wild, both more recent:
https://github.com/juncon/arirang.lucene-analyzer-5.0.0, and the original:
https://sourceforge.net/projects/lucenekorean/ but I'm not sure what's the
canonical source at this point.

I also see this: https://bitbucket.org/eunjeon/mecab-ko-lucene-analyzer

Suggestions?

Thanks,

Tom

Reply via email to