yangjd created HIVEMALL-122:
-------------------------------

             Summary: Added tokenize_cn UDF based upon SmartChineseAnalyzer
                 Key: HIVEMALL-122
                 URL: https://issues.apache.org/jira/browse/HIVEMALL-122
             Project: Hivemall
          Issue Type: New Feature
            Reporter: yangjd


Support word segmentation for Simplified Chinese text based upon 
[org.apache.lucene.analysis.cn.smart.SmartChineseAnalyzer|http://lucene.apache.org/core/5_3_1/analyzers-smartcn/org/apache/lucene/analysis/cn/smart/SmartChineseAnalyzer.html]



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to