Hi,

I have a problem using WWWOFFLE with some websites that redirect you to an
authentication page when you request an article without first logging in.
An example of this can found at the site of the New York Times.
For instance, if you try to view the article with the URL
http://www.nytimes.com/2001/12/20/science/20QUAN.html, you get a response
that redirects you to
http://www.nytimes.com/auth/login?URI=http://www.nytimes.com/2001/12/20/science/20QUAN.html,
which automatically authenticates you if have the appropriate cookies set.
The last page then redirects you back to
http://www.nytimes.com/2001/12/20/science/20QUAN.html.
The trouble begins if WWWOFFLE instead of refetching the first URL, gives
the browser the cached version which is a redirection. In this case you get
caught in end endless loop of redirections.

Of course there are various solutions to this problem.
The WWWOFFLE FAQ (Q 3.2) recommends that you put such a site in the
DontCache section of the configuration file. The problem with this is that I
_want_ to cache the articles, so that I can also view them offline.

I've also experimented with request-changed-once, request-expired and
request-no-cache options, but the only setting that seems to work for me is:

<http://*.nytimes.com/*.html> request-changed = 0

The only remaining problem with this solution is that while browsing online,
I risk refetching the contents of static articles needlessly.
The neatest solution would be if I could get WWWOFFLE to cache the contents
of articles, but not the redirections.
Does any one know how to achieve this?

-- 
Paul A. Rombouts <[EMAIL PROTECTED]>
Vincent van Goghlaan 27
NL-5246 GA  Rosmalen
Netherlands

Reply via email to