> 1) KeywordTokenizerFactory seems to be a "tokenizer
> factory" while CJKTokenizer seems to be just a tokenizer.
> Are they the same type of things at all? 
> Could I just replace 
> <tokenizer class="solr.KeywordTokenizerFactory"/>
> with
> <tokenizer
> class="org.apache.lucene.analysis.cjk.CJKTokenizer"/>
> ??


You should use org.apache.solr.analysis.CJKTokenizerFactory instead.


> 2) I'm also interested in trying out SmartChineseAnalyzer
> (http://lucene.apache.org/java/2_9_0/api/contrib-smartcn/org/apache/lucene/analysis/cn/smart/SmartChineseAnalyzer.html)
> However SmartChineseAnalyzer doesn't offer a separate
> tokenizer. It's just an analyzer and that's it. How do I use
> it in Solr?

You can use lucene analyzer directly in solr:

<fieldType name="chineese_text" class="solr.TextField">
      <analyzer 
class="org.apache.lucene.analysis.cn.smart.SmartChineseAnalyzer"/>
</fieldType>



      

Reply via email to