Hi

A newbie to the world of lucene, nutch , mahout, spent all weekend on Mahout, 
and now looking at Nutch. So I have a question, its seems (after reading the 
archives) that alot of people are using Nutch to index the web, whether for 
vertical searches, or just the web as a whole. Now rather than everyone 
starting again from scratch, and since very little (if any) "IP" would exist in 
the index, since nothing clever has been done to them except being processed by 
Nutch, would it not be possible to "share" all these indexes with each other, 
i.e if someone has built an index of all blogs, or all car related websites, or 
just indexed 100 million webpages at random. Maybe there is some tech reason I 
am missing.

Paul



      

Reply via email to