[ 
https://issues.apache.org/jira/browse/LUCENE-4956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796648#comment-13796648
 ] 

Uwe Schindler commented on LUCENE-4956:
---------------------------------------

I committed a cleanup of most of the broken and slow resources stuff. It now 
*only* uses Class.getResourceAsStream. I also removed code from the FileUtils 
class (now named DictionaryResources) which was clearly code cloned from 
somewhere else.

The resource loading can be further improved:
- It should not be lazy (isn't thread safe), it should load all resources (like 
kuromoji) exactly one time into a singleton "holder" class.
- We should use WordListLoader and nuke the remaining stuff.
- There is very ineffective and slow code at some places, reloading the same 
file over and over again, just to do a lookup.

The code also has legal problems:
- Trie.java seems to be GPLed (thanks Robert). It seems to be just copied from 
GNUTella (the name says all). So its defeinitely not Apache Licensed

> the korean analyzer that has a korean morphological analyzer and dictionaries
> -----------------------------------------------------------------------------
>
>                 Key: LUCENE-4956
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4956
>             Project: Lucene - Core
>          Issue Type: New Feature
>          Components: modules/analysis
>    Affects Versions: 4.2
>            Reporter: SooMyung Lee
>            Assignee: Christian Moen
>              Labels: newbie
>         Attachments: kr.analyzer.4x.tar, lucene-4956.patch, lucene4956.patch, 
> LUCENE-4956.patch
>
>
> Korean language has specific characteristic. When developing search service 
> with lucene & solr in korean, there are some problems in searching and 
> indexing. The korean analyer solved the problems with a korean morphological 
> anlyzer. It consists of a korean morphological analyzer, dictionaries, a 
> korean tokenizer and a korean filter. The korean anlyzer is made for lucene 
> and solr. If you develop a search service with lucene in korean, It is the 
> best idea to choose the korean analyzer.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to