[EMAIL PROTECTED]

> The first level of the web-site (article summaries) can be accessed
without 
> a subscription, but the articles themselves (depth 2) require a username
and 
> password to access. 

> I have tried using the recommended format 
> "http://username:[EMAIL PROTECTED]/gst/pop_top.html", but I have
not 
> had any luck downloading all the links (the actual articles)at depth two. 
> Instead, I get the following message (25 times):
> "You must register to view this page". 

I believe the NY Times uses redirection and cookies; JPluck has been 
able to handle it in the past, but the python plucker cannot yet.
(It's on my todo list, but ... hasn't happened yet.  If someone else
is eager, then I suggest looking at ClientCookie from sourceforge, 
which will be the basis of CookieLib in the standard lib for python 2.4.)

-jJ
_______________________________________________
plucker-list mailing list
[EMAIL PROTECTED]
http://lists.rubberchicken.org/mailman/listinfo/plucker-list

Reply via email to