Re: Unicode Categories

2010-11-12 Thread karl williamson
Tom Christiansen wrote: Patrick wrote: : * Almost. E.g. isL would be nice to have as well. : : Those exist also: : : $ ./perl6 : say 'abCD34' ~~ / isL / : a : say 'abCD34' ~~ / isN / : 3 : They may exist, but I'm not certain it's a good idea to encourage the Is_XXX approach on

Re: Unicode Categories

2010-11-11 Thread Tom Christiansen
The 'Is' prefix can be used on any property in 5.12 for which there is no naming conflict. The only naming conflicts are certain of the block properties, such as Arabic. IsArabic means the Arabic script. InArabic means the base Arabic block. Personally, I find Is and In unintuitive, and

Unicode Categories

2010-11-10 Thread Chase Albert
Sorry if this is the wrong forum. I was wondering if there was a way to specify unicode categorieshttp://www.fileformat.info/info/unicode/category/index.htmin a regular expression (and hence a grammar), or if there would be any consideration for adding support for that (requiring some kind of

Re: Unicode Categories

2010-11-10 Thread Patrick R. Michaud
consideration for adding support for that (requiring some kind of special syntax). Unicode categories are done using assertion syntax with is followed by the category name. Thus isLu (uppercase letter), isNd (decimal digit), isZs (space separator), etc. This even works in Rakudo today

Re: Unicode Categories

2010-11-10 Thread Chase Albert
That's exactly what I was looking for*. Awesome, thank you. ~Cheers * Almost. E.g. isL would be nice to have as well. On Wed, Nov 10, 2010 at 13:15, Patrick R. Michaud pmich...@pobox.comwrote: Unicode properties are always available with a prefix

Re: Unicode Categories

2010-11-10 Thread Patrick R. Michaud
On Wed, Nov 10, 2010 at 01:21:57PM -0500, Chase Albert wrote: That's exactly what I was looking for*. Awesome, thank you. * Almost. E.g. isL would be nice to have as well. Those exist also: $ ./perl6 say 'abCD34' ~~ / isL / a say 'abCD34' ~~ / isN / 3 Pm

Re: Unicode Categories

2010-11-10 Thread Chase Albert
Even awesomer, thank you again. On Wed, Nov 10, 2010 at 13:28, Patrick R. Michaud pmich...@pobox.comwrote: On Wed, Nov 10, 2010 at 01:21:57PM -0500, Chase Albert wrote: That's exactly what I was looking for*. Awesome, thank you. * Almost. E.g. isL would be nice to have as well. Those

Re: Unicode Categories

2010-11-10 Thread Tom Christiansen
consideration for adding support for that (requiring some kind of special syntax). Unicode categories are done using assertion syntax with is followed by the category name. Thus isLu (uppercase letter), isNd (decimal digit), isZs (space separator), etc. This even works in Rakudo today

Re: Unicode Categories

2010-11-10 Thread Tom Christiansen
Patrick wrote: : * Almost. E.g. isL would be nice to have as well. : : Those exist also: : : $ ./perl6 : say 'abCD34' ~~ / isL / : a : say 'abCD34' ~~ / isN / : 3 : They may exist, but I'm not certain it's a good idea to encourage the Is_XXX approach on *anything* except Script=XXX