>> I think that the distributed online Index part should be done outside of
>> Nutch (or if done here do it with extreme caution:) so it does not get
>> tied to Nutch.
> 
> I am not sure I understand you here. If I have 10 machines I am using
> for serving indexes(I am assuming I have a Solr instance running on
> each one), IndexerSolr should be able to partition my index to 10
> machines.

There are more dimensions to distribution (or scaling) and the case you
describe is a very basic one.  Of course we could support such special
setups inside nutch too and just remember that once it starts to look
like a "thing" that can manage large online indexes perhaps it would
serve most goodness if it was not tied to nutch.

-- 
 Sami Siren

-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to