characters
any suggestion ?
thnks
- Original Message -
From: "Scott Smith" <[EMAIL PROTECTED]>
To: "Lucene Users List" <[EMAIL PROTECTED]>
Sent: Wednesday, March 17, 2004 4:27 AM
Subject: RE: CJK Analyzer indexing japanese word document
> I have used this analy
please check the java i/o's ByteStream ==> CharactorStream
Che Dong
- Original Message -
From: "Chandan Tamrakar" <[EMAIL PROTECTED]>
To: "Lucene Users List" <[EMAIL PROTECTED]>
Sent: Wednesday, March 17, 2004 12:37 PM
Subject: Re: CJK Analyzer in
o: "Lucene Users List" <[EMAIL PROTECTED]>
Sent: Wednesday, March 17, 2004 4:27 AM
Subject: RE: CJK Analyzer indexing japanese word document
> I have used this analyzer with Japanese and it works fine. In fact, I'm
> currently doing English, several western European lang
: "Scott Smith" <[EMAIL PROTECTED]>
To: "Lucene Users List" <[EMAIL PROTECTED]>
Sent: Wednesday, March 17, 2004 6:42 AM
Subject: RE: CJK Analyzer indexing japanese word document
I have used this analyzer with Japanese and it works fine. In fact, I'm
currently do
On Mar 16, 2004, at 8:39 PM, [EMAIL PROTECTED] wrote:
My experience tells me that CJKAnalyzer needs to be improved
somehow
For example, single word "X*" search works perfectly, however,
multiple words wildcard "XX*" never works.
Well, in this case it is QueryParser, not the analyzer, as the
, 2004 5:42 pm
Subject: RE: CJK Analyzer indexing japanese word document
> I have used this analyzer with Japanese and it works fine. In
> fact, I'm
> currently doing English, several western European languages,
> traditionaland simplified Chinese and Japanese. I throw them
5, etc. to unicode. Once I fixed that, life was good.
-Original Message-
From: Che Dong [mailto:[EMAIL PROTECTED]
Sent: Tuesday, March 16, 2004 8:31 AM
To: Lucene Users List
Subject: Re: CJK Analyzer indexing japanese word document
some Korean friends tell me they use it successfully for
: "Chandan Tamrakar" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Tuesday, March 16, 2004 4:31 PM
Subject: CJK Analyzer indexing japanese word document
>
> I am using a CJKAnalyzer from apache sandbox , I have set the java
> file.encoding setting to SJIS
> and
I am using a CJKAnalyzer from apache sandbox , I have set the java
file.encoding setting to SJIS
and i am able to index and search the japanese html page . I can see the
index dumps as i expected , However when i index a word document containing
japanese characters it is not indexing as expected