On Thu, 10 Feb 2005 13:00:42 +0300 "Denis S. Otkidach" <[EMAIL PROTECTED]> wrote:
> On all platfroms \w matches all unicode letters when used with flag > re.UNICODE, but this doesn't work on SuSE 9.2: > > Python 2.3.4 (#1, Dec 17 2004, 19:56:48) > [GCC 3.3.4 (pre 3.3.5 20040809)] on linux2 > Type "help", "copyright", "credits" or "license" for more information. > >>> import re > >>> re.compile(ur'\w+', re.U).match(u'\xe4') > >>> > > BTW, is correctly recognize this character as lowercase letter: > >>> import unicodedata > >>> unicodedata.category(u'\xe4') > 'Ll' > > I've looked through all SuSE patches applied, but found nothing > related. What is the reason for broken behavior? Incorrect configure > options? Just a bit more information. test_re.py fails in SuSE 9.2 with the following errors: <snip> Running re_tests test suite === Failed incorrectly ('(?u)\\b.\\b', u'\xc4', 0, 'found', u'\xc4') === Failed incorrectly ('(?u)\\w', u'\xc4', 0, 'found', u'\xc4') -- Denis S. Otkidach http://www.python.ru/ [ru] -- http://mail.python.org/mailman/listinfo/python-list