Re: Help with mass delete from large index

Michael D. Curtin Wed, 15 Feb 2006 07:38:01 -0800

Chandramohan wrote:

perform such a cull again, you might make several
distinct indexes (one perday, per week, per whatever) during that reindexingso the next time will bemuch easier.
How would you search and consolidate the results
across multiple indexes?  Hits from each index will
have independent scoring.

Frankly, I ignore the scores in my application. The data itself isn't Englishprose, so the TF/IDF calcuations are stretched at best, as a measure ofrelevance. I presort the documents to be in "relevance" order (a popularitymetric), then specify index ordering for the results.

If that wouldn't work for your application, it seems to me that large-enoughsub-sections *would* produce equivalent scores. That is, if the sub-indexeswere big enough, one could directly compare scores, so a simple merge wouldwork. If the total document corpus is small, then the need for sub-indexesisn't there anyhow.


--MDC

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Help with mass delete from large index

Reply via email to