Re: Whitespace in \c[...], \x[...], etc.

2009-04-28 Thread Patrick R. Michaud
On Tue, Apr 28, 2009 at 03:08:05PM -0400, Mark J. Reed wrote: > On Tue, Apr 28, 2009 at 2:27 PM, Patrick R. Michaud > wrote: > > * The Unicode character name database [2] has parens in the > >  name property field for many characters > > > >    000A;;Cc;0;B;N;LINE FEED (LF) > > That's no

Re: Whitespace in \c[...], \x[...], etc.

2009-04-28 Thread Mark J. Reed
On Tue, Apr 28, 2009 at 2:27 PM, Patrick R. Michaud wrote: >> According to the 5.0.0 standard, section 4.8: >> >> "Unicode character names contain only uppercase Latin letters A >> through Z, digits, space, and hyphen-minus." >> >> So it seems the notes in parentheses are not considered part of th

Re: Whitespace in \c[...], \x[...], etc.

2009-04-28 Thread Patrick R. Michaud
On Tue, Apr 28, 2009 at 07:22:18AM -0700, Larry Wall wrote: > On Mon, Apr 27, 2009 at 11:04:03AM +0200, Helmut Wollmersdorfer wrote: > > Std.pm allows e.g. > > > > "\x[ 41 , 42 , 43 ]" > > > > For convenience - especially with long charnames - it should be possible > > to write > > > >

Re: Whitespace in \c[...], \x[...], etc.

2009-04-28 Thread Patrick R. Michaud
On Tue, Apr 28, 2009 at 01:28:40PM -0400, Mark J. Reed wrote: > On Tue, Apr 28, 2009 at 10:22 AM, Larry Wall wrote: > > Does anyone know offhand whether the Unicode Consortium has an explicit > > policy against use of punctuation in a charname?  So far they only > > seem to use hyphen and parens,

Re: Whitespace in \c[...], \x[...], etc.

2009-04-28 Thread Mark J. Reed
On Tue, Apr 28, 2009 at 10:22 AM, Larry Wall wrote: > Does anyone know offhand whether the Unicode Consortium has an explicit > policy against use of punctuation in a charname?  So far they only > seem to use hyphen and parens, but I wonder to what extent we can > depend on that... > According to

Re: Whitespace in \c[...], \x[...], etc.

2009-04-28 Thread Larry Wall
On Mon, Apr 27, 2009 at 11:04:03AM +0200, Helmut Wollmersdorfer wrote: > It's not explicitly specified, if insignificant whitespace is allowed in > \c[...], \x[...], etc. > > Std.pm allows e.g. > > "\x[ 41 , 42 , 43 ]" > > For convenience - especially with long charnames - it should be

Whitespace in \c[...], \x[...], etc.

2009-04-27 Thread Helmut Wollmersdorfer
It's not explicitly specified, if insignificant whitespace is allowed in \c[...], \x[...], etc. Std.pm allows e.g. "\x[ 41 , 42 , 43 ]" For convenience - especially with long charnames - it should be possible to write "\c[ SPACE, # blafasel LATIN SMALL LETT