Here's something that surprised me about Python regular expressions.

>>> krex = re.compile(r"^([a-z])+$")
>>> s = "abcdef"
>>> ms = krex.match(s)
>>> ms.groups()
('f',)

The parentheses indicate a capturing group within the
regular expression, and the "+" indicates that the
group can appear one or more times.  The regular
expression matches that way.  But instead of returning
a captured group for each character, it returns only the
last one.

The documentation in fact says that, at

http://docs.python.org/library/re.html

"If a group is contained in a part of the pattern that matched multiple times, the last match is returned."

That's kind of lame, though. I'd expect that there would be some way
to retrieve all matches.

                                        John Nagle
--
http://mail.python.org/mailman/listinfo/python-list

Reply via email to