[
https://issues.apache.org/jira/browse/LUCENE-4956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796648#comment-13796648
]
Uwe Schindler commented on LUCENE-4956:
---------------------------------------
I committed a cleanup of most of the broken and slow resources stuff. It now
*only* uses Class.getResourceAsStream. I also removed code from the FileUtils
class (now named DictionaryResources) which was clearly code cloned from
somewhere else.
The resource loading can be further improved:
- It should not be lazy (isn't thread safe), it should load all resources (like
kuromoji) exactly one time into a singleton "holder" class.
- We should use WordListLoader and nuke the remaining stuff.
- There is very ineffective and slow code at some places, reloading the same
file over and over again, just to do a lookup.
The code also has legal problems:
- Trie.java seems to be GPLed (thanks Robert). It seems to be just copied from
GNUTella (the name says all). So its defeinitely not Apache Licensed
> the korean analyzer that has a korean morphological analyzer and dictionaries
> -----------------------------------------------------------------------------
>
> Key: LUCENE-4956
> URL: https://issues.apache.org/jira/browse/LUCENE-4956
> Project: Lucene - Core
> Issue Type: New Feature
> Components: modules/analysis
> Affects Versions: 4.2
> Reporter: SooMyung Lee
> Assignee: Christian Moen
> Labels: newbie
> Attachments: kr.analyzer.4x.tar, lucene-4956.patch, lucene4956.patch,
> LUCENE-4956.patch
>
>
> Korean language has specific characteristic. When developing search service
> with lucene & solr in korean, there are some problems in searching and
> indexing. The korean analyer solved the problems with a korean morphological
> anlyzer. It consists of a korean morphological analyzer, dictionaries, a
> korean tokenizer and a korean filter. The korean anlyzer is made for lucene
> and solr. If you develop a search service with lucene in korean, It is the
> best idea to choose the korean analyzer.
--
This message was sent by Atlassian JIRA
(v6.1#6144)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]