Re: Spliting the Lucene

Andrzej Bialecki Fri, 08 Dec 2006 01:02:18 -0800

howard chen wrote:

Hi,


A friend from Hadoop told me someone in the list has code for spliting
the Lucene index, can anyone point me to the right place?

You probably refer to the emails we exchanged with Dennis Kubes - inthat case there was no index splitting involved, rather the body ofdocuments to be indexed was split into parts and then indexed separatelyto form many smaller indexes.

True index splitter doesn't exist (yet), but it shouldn't be toodifficult to implement, just tedious - around 3 days of work ... Somepeople on this list also contemplated a semi-splitter (also non-existentyet), which splits the index only on segment boundaries - this should bemuch easier to implement, as it's just a question of copying selectedsegments into new places and re-creating "segments" files - althoughthis method is much less flexible than a true splitter.


--
Best regards,
Andrzej Bialecki     <><
___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Spliting the Lucene

Reply via email to