On Fri, 24 Oct 2008 13:15:49 -0700 (PDT), Mike Driscoll
<[EMAIL PROTECTED]> wrote:
>On Oct 24, 2:53 pm, Rex <[EMAIL PROTECTED]> wrote:
>> By the way, if you're doing non-trivial web scraping, the mechanize
>> module might make your work much easier. You can install it with
>> easy_install.http://ww
Lie Ryan <[EMAIL PROTECTED]> wrote:
>
>Cookies?
Yes, please. I'll take two. Chocolate chip. With milk.
--
Tim Roberts, [EMAIL PROTECTED]
Providenza & Boekelheide, Inc.
--
http://mail.python.org/mailman/listinfo/python-list
On Fri, 24 Oct 2008 20:38:37 +0200, Gilles Ganault wrote:
> Hello
>
> After scratching my head as to why I failed finding data from a web
> using the "re" module, I discovered that a web page as downloaded by
> urllib doesn't match what is displayed when viewing the source page in
> FireFox.
>
On Oct 24, 2:53 pm, Rex <[EMAIL PROTECTED]> wrote:
> Right. If you want to get the same results with your Python script
> that you did with Firefox, you can modify the browser headers in your
> code.
>
> Here's an example with
> urllib2:http://vsbabu.org/mt/archives/2003/05/27/urllib2_setting_http
Right. If you want to get the same results with your Python script
that you did with Firefox, you can modify the browser headers in your
code.
Here's an example with urllib2:
http://vsbabu.org/mt/archives/2003/05/27/urllib2_setting_http_headers.html
By the way, if you're doing non-trivial web scr
Gilles Ganault wrote:
> After scratching my head as to why I failed finding data from a web
> using the "re" module, I discovered that a web page as downloaded by
> urllib doesn't match what is displayed when viewing the source page in
> FireFox.
>
> For instance, when searching Amazon for "Wargam
Hello
After scratching my head as to why I failed finding data from a web
using the "re" module, I discovered that a web page as downloaded by
urllib doesn't match what is displayed when viewing the source page in
FireFox.
For instance, when searching Amazon for "Wargames":
URLLIB:
http://www.am