I think HTMLParser might be what you're looking for... it's in
standard python.
On Sun, Feb 01, 2004 at 01:19:20PM +0100, [EMAIL PROTECTED] wrote:
> > You can use regexp in Python, too. It's the 're' module. See Python
> > doc about for details and look at gethaedlines, because this functions
> >
> You can use regexp in Python, too. It's the 're' module. See Python
> doc about for details and look at gethaedlines, because this functions
> also removes some tags.
In fact, in PERL, with the HTML::Parser module, you can fetch a whole page
to an object, the call something like :
print mypage->a
[EMAIL PROTECTED] wrote:
> I try to modify now the headlines plugin, by adding an automatic link
> resolver : if a link is found in the content of one title, then the link
> is fetched and the content is displayed. As this content is mainly
> text-based, i want to convert directly the HTML file to