Consider the following situation (running WWWOFFLE 2.9a under Linux):
Suppose I have a list of URLs, say, URL1 URL2 URL3..., some of which
are already present in the cache. Now I want to make sure that, once
I am online again, those and only those URLi will be fetched that are
not already in the cache.
- One way to do this would be to start the browser and request each
one of the URLi, which would do the right thing (show those pages
that are already in the cache, mark for fetching those that are
missing) but is extremely tedious if there are many URLs.
- The simplest I have come up with is
cat <list-of-URLs> | xargs -i wwwoffle -o '{}' > /dev/null
, which seems to work.
But what I would really like is some additional option such that
cat <list-of-URLs> | xargs wwwoffle -<new-option>
will do the right thing, i.e., `wwwoffle -<new-option> URL1 URL2...`
would behave like `wwwoffle URL1 URL2...` for URLi not already cached
but skips those that are not present (and outputs some message like
'URLi is already in the cache, will not be fetched again' on stderr).
Would this be a valuable feature to add? And is there any chance this
will actually be added in the future? Comments?
Best regards, and many thanks to Andrew for WWWOFFLE which has been in
constant use here for many years -
Albert.