howard chen wrote:
Hi,

A friend from Hadoop told me someone in the list has code for spliting
the Lucene index, can anyone point me to the right place?

You probably refer to the emails we exchanged with Dennis Kubes - in that case there was no index splitting involved, rather the body of documents to be indexed was split into parts and then indexed separately to form many smaller indexes.

True index splitter doesn't exist (yet), but it shouldn't be too difficult to implement, just tedious - around 3 days of work ... Some people on this list also contemplated a semi-splitter (also non-existent yet), which splits the index only on segment boundaries - this should be much easier to implement, as it's just a question of copying selected segments into new places and re-creating "segments" files - although this method is much less flexible than a true splitter.

--
Best regards,
Andrzej Bialecki     <><
___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to