Hi yeshwanth, On Tue, Jul 15, 2014 at 6:32 AM, <[email protected]> wrote:
> > i am using hbase 0.94.10 on top of hadoop 2.2. > > now i need to crawl the websites and store the results in hbase. > i saw that nutch doesn't have integration with gora 0.4 and higher versions > of hbase. > Which Nutch version are you talking about? We can't guess this unless you tell us. Please check out Nutch 2.X branch and use this, you will then be able to push Data into HBase 0.94.14 IIRC. Please tell us how you get on and we can work to make sure your stack is working. https://svn.apache.org/repos/asf/nutch/branches/2.x/ > > i went through nutch java api documentation for the possibility of crawling > through custom code. > where i found the nutch is totally dependent on gora. > i don't see any other possible ways here. > Possible ways for what? Nutch 2.X is dependent upon Gora... meaning that it is independent of backends. In theory this is dependent that versions of the backends are backwards compatible. Additionally, I would always advise you to look into the Gora parent pom.xml for versioning. https://github.com/apache/gora/blob/master/pom.xml#L616 > > can someone suggest me a way to store the crawled data using Nutch into > hbase > > > Please see above. hth Lewis

