Tony Godshall <[EMAIL PROTECTED]> wrote:
> On Sun, Sep 16, 2001 at 05:43:14PM +0100, Andrew M. Bishop wrote:
> > Tony Godshall <[EMAIL PROTECTED]> wrote:
> > > What wwwoffle
> > > does for me is to let me pull web content onto my notebook
> > > computer, disconnect from the LAN, and read it on the bus or
> > > on the plane or in the park.
> >
> > If you are only ever using one of the two machines at any time then
> > the easiest way is to copy the whole WWWOFFLE cache from one to the
> > other. I use rsync to copy the files over with the minimum trouble.
> > You need to copy all of the files that normally live under
> > /var/spool/wwwoffle/ and the wwwoffle.conf file.
> >
> > Doing it this way there is no merging to do.
>
> Thanks. This could work for me. Do you know if the
> wwwoffle on one machine will clean up files deleted on
> the other? Files expired from cache? Or will this
> continual copy back and forth procedure eventually result
> in accumulating a lot of cruft on one that the other was
> unaware of and vise versa... larger and larger cache that
> never gets cleaned up?
If you use rsync then you end up with a duplicate of the cache on one
machine on the other one. Any files deleted from the machine will be
deleted from the other.
The only thing that you should do is delete the contents of the
monitor directory on the machine that you are copying to or you may
find that all the monitored pages don't get requested if the timestamp
changes.
> Is there any doc on wwwoffle's caching
> behavior and file formats?
The source code!
The file format should be easy to work out. Just look at the files,
those starting with a 'U' are the name of the URL for the files
starting with a 'D'. Those starting with a 'O' are requests to be
made and not pages that are cached.
> Also, what happens to pages the desktop machine is
> monitoring? Or do you turn off the desktop's wwwoffle when
> you go mobile? If the prior you still have some kind of a
> merging thing happening, right?
I use the latter option. Only one of the two machines is ever on the
internet, when I am away from one machine with the other it remains
disconnected.
> > > What I don't know is where wwwoffle stores its list of sites
> > > to fetch and monitor, nor what format they are in, nor
> > > whether I can just copy cache files into a common directory
> > > and expect wwwoffle to find them.
> >
> > These are just stored in subdirectories of the /var/spool/wwwoffle
> > directory. The outgoing directory for requests and the ftp and http
> > directories for the cached files.
>
> Not there... hmmm... oh, /var/cache/wwwoffle has outgoing
> and http and ftp !
>
> Outgoing seems to contain files with gibberish names like
> P10dT+28ZE4LZ2A5Xso3lg !? A lot of the files are 0 bytes
> long too. But at least the ones that contain something do
> contain plaintext (a URL in the form http://localhost:8080/... ).
You shouldn't have any that are zero bytes long, that is strange.
> http has directories named like hostnames. This is good.
> But the files in them again have those files with weird
> gibberish names. Conents seem to be html/plaintext.
The contents are what the server sends back.
> I think I'm missing some kind of database thingy that would
> map the URLs to the files and would indicate the cache
> status of each site... where's that?
There isn't one unless you count the files whose names start with
'U'. There is no file for the cache status of each site since
everything is done on a URL-by-URL basis.
--
Andrew.
----------------------------------------------------------------------
Andrew M. Bishop [EMAIL PROTECTED]
http://www.gedanken.demon.co.uk/
WWWOFFLE users page:
http://www.gedanken.demon.co.uk/wwwoffle/version-2.6/user.html