I don't have a document with chinese characters to verify this, but it
looks right, so I'll add your change to SearchFiles.java.

Thanks,
Otis

--- Eric Chow <[EMAIL PROTECTED]> wrote:

> Search not really correct with UTF-8 !!!
> 
> 
> The following is the search result that I used the SearchFiles in the
> lucene demo.
> 
> d:\Downloads\Softwares\Apache\Lucene\lucene-1.4.3\src>java
> org.apache.lucene.demo.SearchFiles c:\temp\myindex
> Usage: java SearchFiles <idnex>
> Query: 經
> Searching for: g                                <<<<<<<<<<<<     
> strange ??
> 3 total matching documents
> 0. ../docs/ChineseDemo.html            <<<<<<<<<<<<    this files
> contains the 經
>    -
> 1. ../docs/luceneplan.html
>    - Jakarta Lucene - Plan for enhancements to Lucene
> 2. ../docs/api/index-all.html
>    - Index (Lucene 1.4.3 API)
> Query: 
> 
> 
> 
> From the above result only the ChineseDemo.html includes the
> character
> that I want to search !
> 
> 
> 
> 
> The modified code in SearchFiles.java:
> 
> 
> BufferedReader in = new BufferedReader(new
> InputStreamReader(System.in, "UTF-8"));
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> 
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to