[ 
https://issues.apache.org/jira/browse/LUCENE-4956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13798686#comment-13798686
 ] 

SooMyung Lee edited comment on LUCENE-4956 at 10/18/13 1:06 AM:
----------------------------------------------------------------

Hi [~rcmuir],

Thank you for your comment. I can reconstitute the hanja-hangul mappings file 
by myself if we cannot find other sources with clear licenses. I can easily get 
hanja list that often appear in Korean sentence. after then I'll look up online 
dictionary. I can start with 3,000~4,000 hanjas that is most often appeared in 
Korean sentences.


was (Author: soomyung):
Hi [~rcmuir],

Thank you for your comment. I can reconstitute the hanja-hangul mappings file 
by myself if we cannot find other sources with clear licenses. I can easily get 
hanja list that often appear in Korean sentence. after then I'll look up online 
dictionary. I can start with 3,000~4,000 hanjas that is most often appeared 
Korean sentences.

> the korean analyzer that has a korean morphological analyzer and dictionaries
> -----------------------------------------------------------------------------
>
>                 Key: LUCENE-4956
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4956
>             Project: Lucene - Core
>          Issue Type: New Feature
>          Components: modules/analysis
>    Affects Versions: 4.2
>            Reporter: SooMyung Lee
>            Assignee: Christian Moen
>              Labels: newbie
>         Attachments: eval.patch, kr.analyzer.4x.tar, lucene-4956.patch, 
> lucene4956.patch, LUCENE-4956.patch
>
>
> Korean language has specific characteristic. When developing search service 
> with lucene & solr in korean, there are some problems in searching and 
> indexing. The korean analyer solved the problems with a korean morphological 
> anlyzer. It consists of a korean morphological analyzer, dictionaries, a 
> korean tokenizer and a korean filter. The korean anlyzer is made for lucene 
> and solr. If you develop a search service with lucene in korean, It is the 
> best idea to choose the korean analyzer.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to