Thanks for the reply, I found out the problem was occurring later on in the
script. The regexp works well.

-----Original Message-----
From: Lawrence D'Oliveiro [mailto:[EMAIL PROTECTED] 
Sent: Tuesday, September 23, 2008 6:51 PM
To: python-list@python.org
Subject: Re: Regex Help

In message <[EMAIL PROTECTED]>, Support
Desk wrote:

> Anybody know of a good regex to parse html links from html code? The one I
> am currently using seems to be cutting off the last letter of some links,
> and returning links like
> 
> http://somesite.co
> 
> or http://somesite.ph
> 
> the code I am using is
> 
> 
> regex = r'<a href=["|\']([^"|\']+)["|\']>'

Can you post some example HTML sequences that this regexp is not handling
correctly?


--
http://mail.python.org/mailman/listinfo/python-list

Reply via email to