Re: Searching of Chinese characters and English

2012-09-05 Thread waynelam
Analyzer class and Tokenizers. should be: This may not help with your searching problem. - Original Message - | From: "waynelam" | To: solr-user@lucene.apache.org | Sent: Wednesday, September 5, 2012 8:07:36 PM | Subject: Re: Searching of Chinese characters and Engli

Re: Searching of Chinese characters and English

2012-09-05 Thread waynelam
different Analyzer. Am I correct? Anyone have an idea or hints? Thanks so much Wayne On 4/9/2012 13:03, waynelam wrote: Hi all, I tried to modified the schema.xml and solrconfig.xml come with Drupal "search_api_solr" modules. I tried to modified it so that it is suitable for an CJK e

Searching of Chinese characters and English

2012-09-03 Thread waynelam
Hi all, I tried to modified the schema.xml and solrconfig.xml come with Drupal "search_api_solr" modules. I tried to modified it so that it is suitable for an CJK environment. I can see Chinese words cut up each 2 words in "Field Analysis". If i use the following query my_ip_address:8080/sol

SmartChineseAnalyzer

2011-12-09 Thread waynelam
Hi all, I checked the documentation of SmartChineseAnalyzer, It looks like it is for Simplified Chinese Only. Does anyone tried to include Traditional Chinese characters also. As the analyzer is based on a dictionary from ICTCLAS1.0. My first thought is maybe i can get it work by simply conver

Re: Searching in Traditional / Simplified Chinese Record

2011-06-20 Thread waynelam
ne option would be to implement something like the synonyms filter to generate the TC for SC when you index the document, which would index both the TC and the SC in the same location. That way your users would be able to search with either TC or SC. Another option would be to use the same synony

Searching in Traditional / Simplified Chinese Record

2011-06-20 Thread waynelam
Hi, I 've recently make change to my schema.xml to support import of Chinese Record. What i want to do is to search both Traditional Chinese(TC) (e.g. ?? )and Simplified Chinese (SC) (e.g. ??) Record when in the same query. I know I can do that by encoding all SC Record to TC. I want to chang