Re: CJK Analyzer indexing japanese word document

2004-03-22 Thread Chandan Tamrakar
characters any suggestion ? thnks - Original Message - From: "Scott Smith" <[EMAIL PROTECTED]> To: "Lucene Users List" <[EMAIL PROTECTED]> Sent: Wednesday, March 17, 2004 4:27 AM Subject: RE: CJK Analyzer indexing japanese word document > I have used this analy

Re: CJK Analyzer indexing japanese word document

2004-03-16 Thread Che Dong
please check the java i/o's ByteStream ==> CharactorStream Che Dong - Original Message - From: "Chandan Tamrakar" <[EMAIL PROTECTED]> To: "Lucene Users List" <[EMAIL PROTECTED]> Sent: Wednesday, March 17, 2004 12:37 PM Subject: Re: CJK Analyzer in

Re: CJK Analyzer indexing japanese word document

2004-03-16 Thread Chandan Tamrakar
o: "Lucene Users List" <[EMAIL PROTECTED]> Sent: Wednesday, March 17, 2004 4:27 AM Subject: RE: CJK Analyzer indexing japanese word document > I have used this analyzer with Japanese and it works fine. In fact, I'm > currently doing English, several western European lang

Re: CJK Analyzer indexing japanese word document

2004-03-16 Thread Che Dong
: "Scott Smith" <[EMAIL PROTECTED]> To: "Lucene Users List" <[EMAIL PROTECTED]> Sent: Wednesday, March 17, 2004 6:42 AM Subject: RE: CJK Analyzer indexing japanese word document I have used this analyzer with Japanese and it works fine. In fact, I'm currently do

Re: CJK Analyzer indexing japanese word document

2004-03-16 Thread Erik Hatcher
On Mar 16, 2004, at 8:39 PM, [EMAIL PROTECTED] wrote: My experience tells me that CJKAnalyzer needs to be improved somehow For example, single word "X*" search works perfectly, however, multiple words wildcard "XX*" never works. Well, in this case it is QueryParser, not the analyzer, as the

Re: RE: CJK Analyzer indexing japanese word document

2004-03-16 Thread xx28
, 2004 5:42 pm Subject: RE: CJK Analyzer indexing japanese word document > I have used this analyzer with Japanese and it works fine. In > fact, I'm > currently doing English, several western European languages, > traditionaland simplified Chinese and Japanese. I throw them

RE: CJK Analyzer indexing japanese word document

2004-03-16 Thread Scott Smith
5, etc. to unicode. Once I fixed that, life was good. -Original Message- From: Che Dong [mailto:[EMAIL PROTECTED] Sent: Tuesday, March 16, 2004 8:31 AM To: Lucene Users List Subject: Re: CJK Analyzer indexing japanese word document some Korean friends tell me they use it successfully for

Re: CJK Analyzer indexing japanese word document

2004-03-16 Thread Che Dong
: "Chandan Tamrakar" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Tuesday, March 16, 2004 4:31 PM Subject: CJK Analyzer indexing japanese word document > > I am using a CJKAnalyzer from apache sandbox , I have set the java > file.encoding setting to SJIS > and

CJK Analyzer indexing japanese word document

2004-03-16 Thread Chandan Tamrakar
I am using a CJKAnalyzer from apache sandbox , I have set the java file.encoding setting to SJIS and i am able to index and search the japanese html page . I can see the index dumps as i expected , However when i index a word document containing japanese characters it is not indexing as expected