Re: HTMLParser and non-ascii html pages

2011-09-20 Thread Peter Otten
Yaşar Arabacı wrote: > I am using a simple sublclass of HTMLParser like this: > > class LinkCollector(HTMLParser): > > def reset(self): > self.links = [] > HTMLParser.reset(self) > > def handle_starttag(self,tag,attr): > if tag in ("a","link"): > key

HTMLParser and non-ascii html pages

2011-09-20 Thread Yaşar Arabacı
Hi, I am using a simple sublclass of HTMLParser like this: class LinkCollector(HTMLParser): def reset(self): self.links = [] HTMLParser.reset(self) def handle_starttag(self,tag,attr): if tag in ("a","link"): key = "href" elif tag in ("img","sc