On Nov 14, 11:27 am, "Martin v. Löwis" <[EMAIL PROTECTED]> wrote: > > I'm trying to build a regex in python to identify punctuation > > characters in all the languages. Some regex implementations support an > > extended syntax \p{P} that does just that. As far as I know, python re > > doesn't. Any idea of a possible alternative? > > You should use character classes. You can generate them automatically > from the unicodedata module: check whether unicodedata.category(c) > starts with "P". > > Regards, > Martin
Thanks Martin. I'll do this. -- http://mail.python.org/mailman/listinfo/python-list