Vaijanath N. Rao wrote:
Hi all,

How about adding hadoop support for distributed indexing. If required I can start working on this. If Hadoop is the fesiable option.

Also what other technique one can think for doing distributed Indexing. Currently I am planning on extending the SolrJ to keep a map of where the document has gone and trying to get a distibuted Indexing.

DistributedFileSystem performance for random seeks is several times worse than that of LocalFileSystem. This directly impacts Lucene response time.

One solution would be to implement the searching as an application that executes in a distributed fashion (not sure if map-reduce is the best model here), but first copies the indexes to LocalFileSystem.

--
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to