Re: Unicode regular expressions -- buggy?

2005-08-11 Thread Fredrik Lundh
Christopher Subich wrote: > I don't think the python regular expression module correctly handles > combining marks; it gives inconsistent results between equivalent forms > of some regular expressions: > Is this a limitation-by-design, or a bug? limitation by design. if you want correct results

Unicode regular expressions -- buggy?

2005-08-10 Thread Christopher Subich
I don't think the python regular expression module correctly handles combining marks; it gives inconsistent results between equivalent forms of some regular expressions: >>> sys.version '2.4.1 (#65, Mar 30 2005, 09:13:57) [MSC v.1310 32 bit (Intel)]' >>>re.match('\w',unicodedata.normalize('NFD