Re: More character matching bits

Dan Sugalski Fri, 15 Jun 2001 17:22:54 -0700
At 12:29 AM 6/16/2001 +0100, Simon Cozens wrote:
>On Fri, Jun 15, 2001 at 07:12:45PM -0400, Dan Sugalski wrote:
> > The question, then, is should ya be considered a literal number in either
> > of those contexts?
>
>The phrase "in those contexts" suggests that it should in some and shouldn't
>in others.

Well, they suggest that to me but, to be fair, I don't speak Japanese. 
(Well, not yet at least)

>This means that the regexp engine would need to understand
>Japanese, not a pretty prospect.

Pretty, no. Correct, yes. (OK, maybe)

If we can't effectively do it correctly, I can live with that. I just want 
the suboptimal behaviour to be on purpose (and hopefully overridable by 
someone clever enough) rather than accidental.

>I'll solicit advise from Tokyo.pm on how
>*they*'d expect the matching to work; I have a gut feeling that they
>*wouldn't* want \d to match kanji digits, although possibly wouldn't mind
>a separate character class to do so.

I'm not necessarily thinking that \d will match kanji digits, but that 
doesn't mean that there won't be a character class that will. Or a 
character class that's clever enough to know when something is or isn't a 
digit based on context.

                                        Dan

--------------------------------------"it's like this"-------------------
Dan Sugalski                          even samurai
[EMAIL PROTECTED]                         have teddy bears and even
                                      teddy bears get drunk
Re: More character matching bits

Reply via email to