Crawling relation database

lewis john mcgibbney Tue, 05 Jul 2011 15:44:38 -0700

Hi,

I'm curious to hear if anyone has information for configuring Nutch to crawl
a RDB such as MySQL. In my hypothetical example there are N number of
databases residing in various distributed geographical locations, to make a
worst case scenario, say that they are NOT all the same type, and I wish to
use Nutch trunk 2.0 to push the results to some other structured data store
which I can then connect to to serve search results.


Does anyone have any information such as an overview of database crawling
and serving using Nutch? I have been unsuccesful obtaining info on the Web
as query results are ambiguous and usually refer to crawldb or linkdb.

If I can get this it would be a real nice entry for inclusion in our wiki.

Thanks for any suggestions or info.

-- 
*Lewis*

Crawling relation database

Reply via email to