On Tue, 2002-02-05 at 15:34, Jeff 'japhy' Pinyan wrote:
> On Feb 5, Shawn said:
> 
> >  In a regex, the '\b' can stand for:
> >    1) a boundry between a word and non-word char
> >    2) bakcspace
> >
> >  What is the precedence for figuring out which is being called?
> 
> \b is ONLY "backspace" when found inside a character class.  Everywhere
> else it represents a word boundary.  You might want to use \010 or \x08
> for "backspace", since it's better than [\b].
> 
> -- 
> Jeff "japhy" Pinyan      [EMAIL PROTECTED]      http://www.pobox.com/~japhy/
> RPI Acacia brother #734   http://www.perlmonks.org/   http://www.cpan.org/
> ** Look for "Regular Expressions in Perl" published by Manning, in 2002 **
> <stu> what does y/// stand for?  <tenderpuss> why, yansliterate of course..

How does this change with the coming of Unicode and character vs. byte
semantics?  Obviously we would be safe saying \b in a character class
since this is a logical representation of the character, but is \010
still equal to backspace in Unicode/Utf 8/Utf 16/etc.?  I worry that
assumptions I have always made about the ASCII character set will no
longer hold true in the brave new world of i16n.

-- 
Today is Boomtime the 37th day of Chaos in the YOLD 3168
Umlaut Zebra über alles!

Missle Address: 33:48:3.521N  84:23:34.786W


--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to