Chris> ---On Thu, 05 Apr 2001 08:57:05 -0500,  Jeff Maxson said

>> http://199.97.97.184/nytimes-partners/avantgo/main.html from my plucker
>> page.  I just now clicked on that, and IE brought up april 2nd.  However, 
>> if I refresh the page in IE, it then brings up April 5th (the right 
>> one).  Interestingly, once I have refreshed the file from IE, now the 
>> Plucker spyder will grab april 5th instead of April 2nd, and will continue 
>> to get April 5th until I refresh it from IE again (or wait a week or so, I 
>> think).  This happens with the wall street journal as well.  Why would 
>> Plucker be linked to what I refresh in IE? I would rather not have to 
>> refresh all my pages in IE just so I can download them into my Palm...

Well, only a guess. Maybe there are something with the page that cause
that the proxy do not refresh this site (maybe the time stamp not
change). Remember in most ISP you not have a direct Internet
connection you go to a transparent proxy.

Maybe the "Refresh" command in IE add some headers to the request that
say to the proxy "get the original, not the cached".

Does someone here know more about the HTTP request? Maybe we could add
a flag at the parser for this sort of sites.

>>>Plucker has no way (except the cache files) to store web pages, it simply
>>>gathers the pages from the server.

OK on windows if the creation of the DB are fail the previus one are
still here. So to be sure delete the cache files and the old DB in the
DB directory.

cu,
 Dirk

-- 
Permanent URLs to the latest Version (1.1SR2) of the Plucker Windows installer
 - For the Webpage: http://www.dirk-heiser.de/plucker
 - Direct Download: http://www.dirk-heiser.de/plucker/plucker.exe [2.79MB]

Reply via email to