hi john, i started with nutch 2.2.1 version.
as everyone suggested i tried 2.x branch which is working on hbase-94.10 on hadoop-1.2 i need to check whether it will work on hadoop 2.2. i will let you know shortly -yeshwanth On Tue, Jul 15, 2014 at 6:41 PM, Lewis John Mcgibbney < [email protected]> wrote: > Hi yeshwanth, > > On Tue, Jul 15, 2014 at 6:32 AM, <[email protected]> > wrote: > > > > > i am using hbase 0.94.10 on top of hadoop 2.2. > > > > now i need to crawl the websites and store the results in hbase. > > i saw that nutch doesn't have integration with gora 0.4 and higher > versions > > of hbase. > > > > Which Nutch version are you talking about? We can't guess this unless you > tell us. > Please check out Nutch 2.X branch and use this, you will then be able to > push Data into HBase 0.94.14 IIRC. Please tell us how you get on and we can > work to make sure your stack is working. > https://svn.apache.org/repos/asf/nutch/branches/2.x/ > > > > > > i went through nutch java api documentation for the possibility of > crawling > > through custom code. > > where i found the nutch is totally dependent on gora. > > i don't see any other possible ways here. > > > > Possible ways for what? > Nutch 2.X is dependent upon Gora... meaning that it is independent of > backends. In theory this is dependent that versions of the backends are > backwards compatible. Additionally, I would always advise you to look into > the Gora parent pom.xml for versioning. > https://github.com/apache/gora/blob/master/pom.xml#L616 > > > > > > can someone suggest me a way to store the crawled data using Nutch into > > hbase > > > > > > Please see above. > hth > Lewis >

