[ https://issues.apache.org/jira/browse/LUCENE-4956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796648#comment-13796648 ]
Uwe Schindler commented on LUCENE-4956: --------------------------------------- I committed a cleanup of most of the broken and slow resources stuff. It now *only* uses Class.getResourceAsStream. I also removed code from the FileUtils class (now named DictionaryResources) which was clearly code cloned from somewhere else. The resource loading can be further improved: - It should not be lazy (isn't thread safe), it should load all resources (like kuromoji) exactly one time into a singleton "holder" class. - We should use WordListLoader and nuke the remaining stuff. - There is very ineffective and slow code at some places, reloading the same file over and over again, just to do a lookup. The code also has legal problems: - Trie.java seems to be GPLed (thanks Robert). It seems to be just copied from GNUTella (the name says all). So its defeinitely not Apache Licensed > the korean analyzer that has a korean morphological analyzer and dictionaries > ----------------------------------------------------------------------------- > > Key: LUCENE-4956 > URL: https://issues.apache.org/jira/browse/LUCENE-4956 > Project: Lucene - Core > Issue Type: New Feature > Components: modules/analysis > Affects Versions: 4.2 > Reporter: SooMyung Lee > Assignee: Christian Moen > Labels: newbie > Attachments: kr.analyzer.4x.tar, lucene-4956.patch, lucene4956.patch, > LUCENE-4956.patch > > > Korean language has specific characteristic. When developing search service > with lucene & solr in korean, there are some problems in searching and > indexing. The korean analyer solved the problems with a korean morphological > anlyzer. It consists of a korean morphological analyzer, dictionaries, a > korean tokenizer and a korean filter. The korean anlyzer is made for lucene > and solr. If you develop a search service with lucene in korean, It is the > best idea to choose the korean analyzer. -- This message was sent by Atlassian JIRA (v6.1#6144) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org