We have discussed it but not implemented it. A previous step before
implementing interfaces to use HBase for current Nutch databases was to
may the Nutch architecture itself more flexible. This is what I have
been terming Nutch 2 and what I have been currently working on.
Dennis
Marcus Herou wrote:
Hi.
Anyone tried to implement HBase as storage for:
* CrawlDB
* LinkDB
* Fetched and parsed url data
It would certainly be cool I think to be able to search in all these three
db's. Currently it is a little bit hard to use the data crawled without
actually indexing it.
Kindly
//Marcus