>>> from BeautifulSoup import BeautifulSoup >>> soup = BeautifulSoup("""<area coords="427,724,432,732" href=" http://BioCyc.org/ECOLI/NEW-IMAGE? ... type=GENE-IN-CHROM-BROWSER&object=EG12309" onmouseover="return ... overlib('<b>Gene:</b> yjtD<BR><b>Product:</ ... b> predicted rRNA methyltransferase, subunit of predicted rRNA ... methyltransferase<BR><b>Intergenic distances (bp):</ ... b> yjjY< +400 yjtD +214 >thrL');"><b>Gene:</b> yjtD<br / ... ><b>Product:</b> predicted rRNA methyltransferase, subunit of ... predicted rRNA methyltransferase<br /><b>Intergenic distances (bp):</ ... b> yjjY< +400 yjtD +214 >thrL');" onmouseout="return nd();"> ... </area>""") >>> soup.area["href"] u'http://BioCyc.org/ECOLI/NEW-IMAGE ?\ntype=GENE-IN-CHROM-BROWSER&object=EG12309'
On Wed, Sep 2, 2009 at 1:25 AM, elsa <kerensael...@hotmail.com> wrote: > Hi all, > > if I have some HTML that looks like this: > > <area coords="427,724,432,732" href="http://BioCyc.org/ECOLI/NEW-IMAGE? > type=GENE-IN-CHROM-BROWSER&object=EG12309" onmouseover="return > overlib('<b>Gene:</b> yjtD<BR><b>Product:</ > b> predicted rRNA methyltransferase, subunit of predicted rRNA > methyltransferase<BR><b>Intergenic distances (bp):</ > b> yjjY< +400 yjtD +214 >thrL');"><b>Gene:</b> yjtD<br / > ><b>Product:</b> predicted rRNA methyltransferase, subunit of > predicted rRNA methyltransferase<br /><b>Intergenic distances (bp):</ > b> yjjY< +400 yjtD +214 >thrL');" onmouseout="return nd();"> > </area> > > is there an easy way to use BeautifulSoup to extract just the value of > the href attribute? > > Thanks, > > elsa > -- > http://mail.python.org/mailman/listinfo/python-list >
-- http://mail.python.org/mailman/listinfo/python-list