> 1) KeywordTokenizerFactory seems to be a "tokenizer > factory" while CJKTokenizer seems to be just a tokenizer. > Are they the same type of things at all? > Could I just replace > <tokenizer class="solr.KeywordTokenizerFactory"/> > with > <tokenizer > class="org.apache.lucene.analysis.cjk.CJKTokenizer"/> > ??
You should use org.apache.solr.analysis.CJKTokenizerFactory instead. > 2) I'm also interested in trying out SmartChineseAnalyzer > (http://lucene.apache.org/java/2_9_0/api/contrib-smartcn/org/apache/lucene/analysis/cn/smart/SmartChineseAnalyzer.html) > However SmartChineseAnalyzer doesn't offer a separate > tokenizer. It's just an analyzer and that's it. How do I use > it in Solr? You can use lucene analyzer directly in solr: <fieldType name="chineese_text" class="solr.TextField"> <analyzer class="org.apache.lucene.analysis.cn.smart.SmartChineseAnalyzer"/> </fieldType>