As far as I have looked at the source code for HTMLParser, it can't
handle other charsets then ASCII.

2009/3/26 秦锋 <feng.w....@gmail.com>:
>
> When I'm using HTMLParser to access a link below:
>
> http://www.stats.gov.cn/tjsj/ndsj/2007/html/C0301c.htm
>
> SDK keeps reporting:
>
> <pre>Traceback (most recent call last):
>  File &quot;C:\Program Files\Google\google_appengine\google
> \appengine
> \ext\webapp\__init__.py&quot;, line 498, in __call__
>    handler.get(*groups)
>  File &quot;D:\Data\Documents\Workspace\CNStatsSvr\src
> \spider.py&quot;, line 134, in get
>    parser.close()
>  File &quot;C:\Python25\lib\HTMLParser.py&quot;, line 112, in close
>    self.goahead(1)
>  File &quot;C:\Python25\lib\HTMLParser.py&quot;, line 164, in
> goahead
>    self.error(&quot;EOF in middle of construct&quot;)
>  File &quot;C:\Python25\lib\HTMLParser.py&quot;, line 115, in error
>    raise HTMLParseError(message, self.getpos())
> HTMLParseError: EOF in middle of construct, at line 8, column 1
> </pre>
>
> I know it's a python question, is there anyone could help?
> Thanks!
> >
>

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"Google App Engine" group.
To post to this group, send email to google-appengine@googlegroups.com
To unsubscribe from this group, send email to 
google-appengine+unsubscr...@googlegroups.com
For more options, visit this group at 
http://groups.google.com/group/google-appengine?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to