On Sun, Sep 16, 2001 at 05:43:14PM +0100, Andrew M. Bishop wrote:
> Tony Godshall <[EMAIL PROTECTED]> wrote:
> > What wwwoffle 
> > does for me is to let me pull web content onto my notebook 
> > computer, disconnect from the LAN, and read it on the bus or 
> > on the plane or in the park.
> 
> > I'm also using it on my desktop machine to selectively
> > prefetch websites I visit often (great feature) and cut out
> > the most offensive adverts
> 
> > What I'd love to do (if it's possible) is to merge the
> > settings and sites to monitor and sites to fetch and sites 
> > to block to the laptop when I leave and merge the requests
> > I made when on the road back to the desktop machine when I
> > come back.  Ideally it would merge caches too.  My ideal
> > would be to have wwwoffle monitoring the sites I typically
> > visit so I can read the latest slashdot and news and comic
> > strips unconnected.  
> 
> If you are only ever using one of the two machines at any time then
> the easiest way is to copy the whole WWWOFFLE cache from one to the
> other.  I use rsync to copy the files over with the minimum trouble.
> You need to copy all of the files that normally live under
> /var/spool/wwwoffle/ and the wwwoffle.conf file.
>
> This is the way that I do it when I transfer my cache to my laptop.
> But I only use the laptop when away from home and my usual dialup
> computer.  I do use the laptop for dialup so in effect I have only one
> computer that I use although physically it is one of two.
> 
> Doing it this way there is no merging to do.

Thanks.  This could work for me.  Do you know if the
wwwoffle on one machine will clean up files deleted on 
the other?  Files expired from cache?  Or will this 
continual copy back and forth procedure eventually result 
in accumulating a lot of cruft on one that the other was
unaware of and vise versa... larger and larger cache that 
never gets cleaned up?  Is there any doc on wwwoffle's caching
behavior and file formats?

Also, what happens to pages the desktop machine is
monitoring?  Or do you turn off the desktop's wwwoffle when
you go mobile?  If the prior you still have some kind of a
merging thing happening, right?

> > I know where the block sites url regexps are (after all, 
> > you have to edit the .conf file to do it).  I'm not sure if 
> > automatically merging .conf files is good practice, but I
> > could probably do it with a little perl.  
> 
> If you really do want to merge the config files then you are on your
> own.  It shouldn't be impossible, but if you don't edit them both at
> once there should be no merging to do.
> 
> The other option it so put all the machine specific options into
> included files (use '[' & ']' in the config file, read introduction at
> the top of wwwoffle.conf).  This way you can have the main
> configuration different, but the list of adverts common.

Thanks!  That really helps.

> > What I don't know is where wwwoffle stores its list of sites
> > to fetch and monitor, nor what format they are in, nor
> > whether I can just copy cache files into a common directory
> > and expect wwwoffle to find them.
> 
> These are just stored in subdirectories of the /var/spool/wwwoffle
> directory.  The outgoing directory for requests and the ftp and http
> directories for the cached files.  There is absolutely no problem with
> adding and deleting files in these directories.  One of the design
> requirements was that there is not a central database of cached URLs
> that needs to be updated.

Not there... hmmm... oh, /var/cache/wwwoffle has outgoing
and http and ftp !

Outgoing seems to contain files with gibberish names like 
P10dT+28ZE4LZ2A5Xso3lg !?  A lot of the files are 0 bytes 
long too.  But at least the ones that contain something do 
contain plaintext (a URL in the form http://localhost:8080/... ).

http has directories named like hostnames.  This is good.
But the files in them again have those files with weird
gibberish names.  Conents seem to be html/plaintext. 

I think I'm missing some kind of database thingy that would
map the URLs to the files and would indicate the cache
status of each site... where's that?

Thanks for the help

--
T

Reply via email to