Felix Karpfen <[EMAIL PROTECTED]> wrote:
> Attached (edited unprofessionally) are the latest entries on  the list
> displayed by WWWOFFLE Cache Index -> http.
> 
> I have marked on it, the URLs that are explicitly listed in my DontGet
> and DontIndex lists in wwwoffle.conf.
> 
> If I click on these listed items, WWWOFFLE correctly advises that these
> itmes were not <fetched/indexed>.
> 
> Would it also be possible to persuade WWWOFFLE not to display them on
> the Cache Index?

The URLs that are from these servers will not appear in the WWWOFFLE
indexes.  What you are seeing is a list of the http servers that are
cached.

It is not possible for WWWOFFLE to know if any URLs in the directory
will be indexed if you follow the link.

For instance if you had the following in the DontIndex section of the
configuration file:

<!http://www.foo/bar/*> list-any = yes
<http://www.foo/*> list-any = no

Then all of the files on the server www.foo will not be listed in the
indexes except the ones in the /bar/ subdirectory.  When WWWOFFLE
generates the http index how is it to work out if the index will
contain any files or not?  It can't just test for http://www.foo/
since that is not indexed, but others are.


The best option it to purge the files that are in the DontGet section
of the configuration file as was suggested by Marc Boucher
<[EMAIL PROTECTED]> in the other reply.

-- 
Andrew.
----------------------------------------------------------------------
Andrew M. Bishop                             [EMAIL PROTECTED]
                                      http://www.gedanken.demon.co.uk/

WWWOFFLE users page:
        http://www.gedanken.demon.co.uk/wwwoffle/version-2.6/user.html

Reply via email to