I don't have a document with chinese characters to verify this, but it looks right, so I'll add your change to SearchFiles.java.
Thanks, Otis --- Eric Chow <[EMAIL PROTECTED]> wrote: > Search not really correct with UTF-8 !!! > > > The following is the search result that I used the SearchFiles in the > lucene demo. > > d:\Downloads\Softwares\Apache\Lucene\lucene-1.4.3\src>java > org.apache.lucene.demo.SearchFiles c:\temp\myindex > Usage: java SearchFiles <idnex> > Query: 經 > Searching for: g <<<<<<<<<<<< > strange ?? > 3 total matching documents > 0. ../docs/ChineseDemo.html <<<<<<<<<<<< this files > contains the 經 > - > 1. ../docs/luceneplan.html > - Jakarta Lucene - Plan for enhancements to Lucene > 2. ../docs/api/index-all.html > - Index (Lucene 1.4.3 API) > Query: > > > > From the above result only the ChineseDemo.html includes the > character > that I want to search ! > > > > > The modified code in SearchFiles.java: > > > BufferedReader in = new BufferedReader(new > InputStreamReader(System.in, "UTF-8")); > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]