RE: [wwwsearch-general] (no subject)

2008-08-31 Thread John J Lee
On Fri, 29 Aug 2008, bruce wrote: Hi john. Thanks for your reply. I tried your suggestion of using RobustFactory, and still get a badly maligned html back!!! The html is listed below. I would That's expected -- this affects the parsing of the HTML. It does not modify the HTML. have thou

RE: [wwwsearch-general] (no subject)

2008-08-29 Thread bruce
ECTED] [mailto:[EMAIL PROTECTED] Behalf Of John J Lee Sent: Friday, August 29, 2008 12:34 PM To: [EMAIL PROTECTED] Cc: python-list@python.org Subject: Re: [wwwsearch-general] (no subject) On Fri, 29 Aug 2008, bruce wrote: [...] > does the page (test.html) need to be completely valid html? N

Re: [wwwsearch-general] (no subject)

2008-08-29 Thread John J Lee
On Fri, 29 Aug 2008, bruce wrote: [...] does the page (test.html) need to be completely valid html? No, but there are certainly (poorly-defined) limitations. I haven't tried to understand your script or the HTML, but did you try this: br = mechanize.Browser(mechanize.RobustFactory()) ...