Tempo: > I am having a little trouble extracting text from a string. The > string that I am dealing with is pasted below, and I want to > extract the prices that are contained in the string below.
This may help: >>> import re >>> reg = r"(?<= \$ ) (?: \d* \.? \d* )" >>> prices = re.compile(reg, flags=re.VERBOSE) >>> prices.findall('</span>, $66.99 <span class="sale"> $.99') ['66.99', '.99'] You can read about Python regular expressions: http://www.amk.ca/python/howto/regex/ http://docs.python.org/lib/module-re.html ------------------------ Perl 6 regular expressions are verbose by default, future Python may do the same. >From Apocalypse 5, by Larry Wall: http://dev.perl.org/perl6/doc/design/apo/A05.html >In real life, tokens are more recognizable if they are separated by >whitespace.< >Now, you may rightly point out that + is something we already have, and we >already introduced /x to allow whitespace, so why is this bullet point here? >Well, there's a lot of inertia in culture, and the problem with /x is that >it's not the default, so people don't think to turn it on when it would >probably do a lot of good. The culture is biased in the wrong direction. >Whitespace around tokens should be the norm, not the exception. It should be >acceptable to use whitespace to separate tokens that could be confused.< Bye, bearophile -- http://mail.python.org/mailman/listinfo/python-list