Re: Japanese text search problem

2001-08-10 Thread Benjamin Franz
On Fri, 10 Aug 2001, Martin Duerst wrote: > At 12:17 01/08/08 -0700, Benjamin Franz wrote: > > >In UTF8 the 'frame' problem doesn't exist because character start > >bytes _ALWAYS_ have bit eight set to 0 while continuation bytes _ALWAYS_ > >have bit eight set to 1. 'quotemeta' works fine if you u

Re: Unicode Normalization Forms

2001-08-10 Thread SADAHIRO Tomoyuki
On Thu, 09 Aug 2001 22:30:16 +0200 Bjoern Hoehrmann <[EMAIL PROTECTED]> wrote: > * SADAHIRO Tomoyuki wrote: > >How about the following interface? > > > >| $normalized_string = normalize($raw_string) > >| > >| You can use this function only if the normalization form > >| you require is specified