Hi all,
Reading about indexing problems, I came to wonder whether wwwoffle might 
have a function that would index its cache 'on the fly'. I think wwwoffle 
might create a task running htdig and then send any new pages being 
cached to that process, which in turn would take care of on the fly indexing.

Just a few thoughts
Remy
>>>>>>>>>>>>>>>>>> Message d'origine <<<<<<<<<<<<<<<<<<

Le 09/04/2001, à 20:31:14 h, Andrew "M." Bishop <[EMAIL PROTECTED]> 
vous a écrit sur le sujet suivant Re: [WWWOFFLE-Users] Full cache list 
missing...:


> Colin Marquardt <[EMAIL PROTECTED]> wrote:
> > "Andrew M. Bishop" <[EMAIL PROTECTED]> writes:
> > > Colin Marquardt <[EMAIL PROTECTED]> wrote:
> > > > With the new version 2.6 (from Debian unstable, if that matters), I
> > > > do not see a way to get the full cache index anymore (called "Latest"
> > > > in older versions, IIRC). Is there a way to get it back?
> > >
> > > This index option has been removed from the latest version.

> > > The reasons for doing this are that it took too long to create (it
> > > needed to search through all hosts to find the newest files) and there
> >
> > IIRC all it took stopping the creation process is to press the Stop
> > button in ones browser, no? Have there been actual complaints from
> > users who were accidentally accessing this index and couldn't stop it?

> No, this would just stop the browser displaying the page, the indexing
> would continue in the background.

> The code for this index was also rather different from the other code
> for the other indexes.  It needed recursion across directories which
> the others didn't.

> The other problem is that to maintain a consistent interface the
> buttons on the top of the index pages mean that you can select
> "Latest" while sorted alphabetically.  If you actually wanted the
> "Latest" index sorted by date then you needed to select that and
> search again.  It didn't cache the result so that it could be
> displayed in a different order.


> > Okay, I haven't yet checked out the htDig interface which might be
> > able to do this too, but I guess it takes even longer to create an
> > index with a * wildcard.

> Using htDig (or other search engine) is definitely the way to go.  If
> you get htdig to index all the cached files every night then you can
> search for a word anywhere in the cache (not just in the URL) in a few
> seconds.  It beats trying to remember what the name of the URL was and
> looking through a list of 200 files from the last 30 days.


> Have you tried using the "Date changed" sort order in the http index?
> It will show you the hosts whose pages you have visited in date
> order.  You can then try likely hosts and they will also be sorted in
> date order.


> I really don't want to add this index back in again (in case you
> hadn't guessed).


> --
> Andrew.
> ----------------------------------------------------------------------
> Andrew M. Bishop                             [EMAIL PROTECTED]
>                                       http://www.gedanken.demon.co.uk/

> WWWOFFLE users page:
>         http://www.gedanken.demon.co.uk/wwwoffle/version-2.6/user.html

Reply via email to