On Wed, Feb 3, 2010 at 11:15 PM, Daniel Cheng <j16sdiz+freenet at gmail.com> 
wrote:
> On Tue, Feb 2, 2010 at 10:41 PM, Matthew Toseland
> <toad at amphibian.dyndns.org> wrote:
>> Is there any chance you could implement or at least test Chinese searching? 
>> As far as I can see all we need to do is:
>> - Decode surrogates
>> - Check whether the character or surrogate is in the relevant range
>> - Treat a character (which might be two UTF16 char's) as a word
>
> I have some (out of date) Chinese-enabled xmlspider code around,
> which I would try to bring up-to-date and commit this week or next week.

I have committed some code to xmlspider, which should make it support creating
chinese index. Please test if anything breaks.

Thanks.

>
> i will try to port that code to library as well,
> but no promise -- library have phase search/boolean operator that may
> need a new tokenizer.
>
>>
>> Thanks!
>>
>> I do have a possible tester if you don't have time, he put me onto the 
>> urgency of this for widespread use in China...
>>
>> https://bugs.freenetproject.org/view.php?id=3837
>>
>

Reply via email to