>Seems to me that we need access by url to see if the document has
>already been indexed.
Right. But let's say we keep a (temporary) url -> docID list/database while
indexing...
Then a search request makes one lookup per document returned.
Now a search request makes two lookups per document returned...
Hmm. If given a choice between slowing down the indexing (if at all) to
speed up the search, or vice-versa, I'll choose the faster searching every
time. Besides, with a URL -> docID list, it's only needed when indexing so
you can delete it if pressed for space.
-Geoff
------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.