Re: CJK Analyzer indexing japanese word document

Che Dong Tue, 16 Mar 2004 07:30:58 -0800

some Korean friends tell me they use it successfully for Korean. So I think its also 
work for Japanese. mostly the problem is locale settings


Please check weblucene project for xml indexing samples:
http://sourceforge.net/projects/weblucene/ 

Che Dong
----- Original Message ----- 
From: "Chandan Tamrakar" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Tuesday, March 16, 2004 4:31 PM
Subject: CJK Analyzer indexing japanese word document


> 
> I am using a CJKAnalyzer from apache sandbox , I have set the java
> file.encoding setting to SJIS
> and  i am able to index and search the japanese html page . I can see the
> index dumps as i expected , However when i index a word document containing
> japanese characters it is not indexing as expected . Do I need to change
> anything with CJKTokenizer and CJKAnalyzer classes?
> I have been able to index a word document with StandardAnalyzers.
> 
> thanks in advace
> chandan
> 
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> 
>

Re: CJK Analyzer indexing japanese word document

Reply via email to