On 12.10.2011 0:04, Jerry wrote:
I have 2 thoughts.
1) Minor doc typo:
Long form for hex notation should be \U00YYYYYY.
Yeah, \U it is.
2) Unicode set syntax
If you're going to provide unicode set support, why not use ICU syntax
rather than invent another one?
Looks like I was tricked by their technical standard then.
I can't immediately recall where this syntax was ever used but:
http://unicode.org/reports/tr18/#Subtraction_and_Intersection
The prime reason cited here is that e.g. '--' is (almost) unambigious
with range notation '-' and also allows to skip [] where applicable
[\p{letter}--a-z] vs [[\p{letter}]-[a-z]].
Come to think of it '--' is cleaner in this case.
Jerry
--
Dmitry Olshansky