* Akash <[EMAIL PROTECTED]> [061129 20:54]: > On 11/30/06, Shitiz Bansal <[EMAIL PROTECTED]> wrote: > > I am using beautiful soup for extracting links from a web page. > > Most pages use relative links in their pages which is causing a problem. Is > > there any library to extract complete links or do i have to parse this > > myself? > > > > Beautiful Soup can also extract text which is present on the page. If > there are no complete links no library can do that for you. But since > you are reaching a certain web page to extract you already have that > URL information with you. All you have to do then is to prefix it to > each extracted URL. Take a look at urlparse.urljoin from the standard library.
Andreas _______________________________________________ Tutor maillist - Tutor@python.org http://mail.python.org/mailman/listinfo/tutor