On 03/09/2015 03:04 PM, Wolfgang Maier wrote:
On 03/09/2015 02:33 PM, Albert-Jan Roskam wrote:
--------------------------------------------
On Mon, 3/9/15, Tim Chase <python.l...@tim.thechases.com> wrote:

"[^\d\W_]+" means something like "one or more (+) of 'not (a digit, a
non-word, an underscore)'.


interesting (using Python3.4 and
U+2188     ROMAN NUMERAL ONE HUNDRED THOUSAND     ↈ):

 >>> re.search('[^\d\W_]+', '\u2188', re.I | re.U)
<_sre.SRE_Match object; span=(0, 1), match='ↈ'>

ↈ and at least some other Nl (letter numbers) category characters seem
to be part of \w (not part of \W).

Would that be considered a bug ?


Sorry for the potential confusion: I meant in the pattern search above (not in the definition of \w or \W).

--
https://mail.python.org/mailman/listinfo/python-list

Reply via email to