some Korean friends tell me they use it successfully for Korean. So I think its also work for Japanese. mostly the problem is locale settings
Please check weblucene project for xml indexing samples: http://sourceforge.net/projects/weblucene/ Che Dong ----- Original Message ----- From: "Chandan Tamrakar" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Tuesday, March 16, 2004 4:31 PM Subject: CJK Analyzer indexing japanese word document > > I am using a CJKAnalyzer from apache sandbox , I have set the java > file.encoding setting to SJIS > and i am able to index and search the japanese html page . I can see the > index dumps as i expected , However when i index a word document containing > japanese characters it is not indexing as expected . Do I need to change > anything with CJKTokenizer and CJKAnalyzer classes? > I have been able to index a word document with StandardAnalyzers. > > thanks in advace > chandan > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > >