On Fri, 29 Aug 2008, bruce wrote:
Hi john.
Thanks for your reply. I tried your suggestion of using RobustFactory, and
still get a badly maligned html back!!! The html is listed below. I would
That's expected -- this affects the parsing of the HTML. It does not
modify the HTML.
have thou
ECTED]
[mailto:[EMAIL PROTECTED] Behalf
Of John J Lee
Sent: Friday, August 29, 2008 12:34 PM
To: [EMAIL PROTECTED]
Cc: python-list@python.org
Subject: Re: [wwwsearch-general] (no subject)
On Fri, 29 Aug 2008, bruce wrote:
[...]
> does the page (test.html) need to be completely valid html?
N
On Fri, 29 Aug 2008, bruce wrote:
[...]
does the page (test.html) need to be completely valid html?
No, but there are certainly (poorly-defined) limitations.
I haven't tried to understand your script or the HTML, but did you try
this:
br = mechanize.Browser(mechanize.RobustFactory())
...