Re: BTree

Doug Cutting Thu, 12 Jan 2006 12:17:27 -0800

B-Tree's are best for random, incremental updates. They requirelog_b(N) disk accesses for inserts, deletes and accesses, where b is thenumber of entries per page, and N is the total number of entries in thetree. But that's too slow for text indexing. Rather Lucene uses acombination of file sorting and merging to update indexes, which is muchfaster than a B-tree would be. For access, Lucene is equivalent to aB-Tree with all but the leaves cached in memory, so that accessesrequire only a single disk access.


Slides 5 and 6 of the following presentation discuss this a bit:


http://www.research.ibm.com/haifa/Workshops/ir2005/papers/DougCutting-Haifa05.pdf

Doug

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: BTree

Reply via email to