On Tue, 2002-02-05 at 15:34, Jeff 'japhy' Pinyan wrote: > On Feb 5, Shawn said: > > > In a regex, the '\b' can stand for: > > 1) a boundry between a word and non-word char > > 2) bakcspace > > > > What is the precedence for figuring out which is being called? > > \b is ONLY "backspace" when found inside a character class. Everywhere > else it represents a word boundary. You might want to use \010 or \x08 > for "backspace", since it's better than [\b]. > > -- > Jeff "japhy" Pinyan [EMAIL PROTECTED] http://www.pobox.com/~japhy/ > RPI Acacia brother #734 http://www.perlmonks.org/ http://www.cpan.org/ > ** Look for "Regular Expressions in Perl" published by Manning, in 2002 ** > <stu> what does y/// stand for? <tenderpuss> why, yansliterate of course..
How does this change with the coming of Unicode and character vs. byte semantics? Obviously we would be safe saying \b in a character class since this is a logical representation of the character, but is \010 still equal to backspace in Unicode/Utf 8/Utf 16/etc.? I worry that assumptions I have always made about the ASCII character set will no longer hold true in the brave new world of i16n. -- Today is Boomtime the 37th day of Chaos in the YOLD 3168 Umlaut Zebra über alles! Missle Address: 33:48:3.521N 84:23:34.786W -- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]