Hi Tom, Download and configure both HBase and Solr and make them up. You do not need to build Gora at your case (also neither Hbase nor Solr). It is a dependency included at Nutch.
Nutch will crawl webpages and use Gora as a backend system to communicate with Hbase and Solr. Kind Regards, Furkan KAMACI 20 Şub 2016 10:45 tarihinde "Tom Running" <runningt...@gmail.com> yazdı: > I meant SOLR 4.10.3 instead SOLR 2.X > > On Sat, Feb 20, 2016 at 3:44 AM, Tom Running <runningt...@gmail.com> > wrote: > >> Great. Thank you. >> >> I am just wondering. How is building GORA will help with anything in my >> situation? probably not, right? it doesn't seem I need to use any of the >> built. >> >> It seems GORA already included in the SOLR 2.X and HBASE .98.9 release. >> Is this a correct assumption? >> >> Thank you. >> Tom >> >> On Sat, Feb 20, 2016 at 1:35 AM, Lewis John Mcgibbney < >> lewis.mcgibb...@gmail.com> wrote: >> >>> Hi Tom, >>> All you need to do is ensure that gora-hbase dependency is uncommented >>> within $NUTCH_HOME/ivy/ivy.xml >>> https://github.com/apache/nutch/blob/2.x/ivy/ivy.xml#L116 >>> >>> You then need to ensure that that the storage.data.store.class is >>> correct in $NUTCH_HOME/conf/nutch-default.xml. This needs to be set to >>> 'org.apache.gora.hbase.store.HBaseStore' >>> >>> https://github.com/apache/nutch/blob/2.x/conf/nutch-default.xml#L1333-L1371 >>> >>> Finally, you need to configure $NUTCH_HOME/conf/gora.properties >>> https://github.com/apache/nutch/blob/2.x/conf/gora.properties >>> Make sure that the correct gora-hbase configuration is included. >>> >>> That is all you need to do. >>> Lewis >>> >>> On Fri, Feb 19, 2016 at 10:29 PM, Tom Running <runningt...@gmail.com> >>> wrote: >>> >>>> Furkan, >>>> >>>> What you had mention is exactly what I am trying to accomplish. >>>> > Using Nutch to crawl websites and storing them at Hbase and indexing >>>> at Solr via Gora? >>>> >>>> >>>> I need a bit more help to ensure what I am about to do is correct.. >>>> >>>> #1. >>>> after successfully build GORA. I have the following two .jar files in >>>> /gora/gora-solr/lib/ directory. Lot of .jar files in the /lib directory >>>> but only two .jar files relative to solr. >>>> solr-solrj-4.10.3.jar >>>> solr-core-4.10.3.jar >>>> >>>> >>>> #2. >>>> In the solr source distribution directory I have also see the same >>>> exact .jar files. This is a source code download. I have not build this >>>> solr yet. >>>> >>>> /home/solr/dist >>>> solr-solrj-4.10.3.jar >>>> solr-core-4.10.3.jar >>>> solr-4.10.3.war >>>> >>>> >>>> My question is. Should I copy the two solr files in #1 to >>>> /home/solr/dist/ then build solr? >>>> >>>> >>>> #3. >>>> Should I also do the same thing for hbase. Copy the >>>> /gora/gora-hbase/lib/hbase-* into /hbase/lib/ then build hbase? >>>> >>>> >>>> >>>> Thank you. >>>> Tom >>>> >>>> On Wed, Feb 17, 2016 at 5:31 PM, Furkan KAMACI <furkankam...@gmail.com> >>>> wrote: >>>> >>>>> Hi Tom, >>>>> >>>>> What do you aim? Using Nutch to crawl websites and storing them at >>>>> Hbase and indexing at Solr via Gora? Do you have any other use cases? >>>>> >>>>> "Simply", you may think that Gora will act as Hibernate of NoSQL >>>>> ecosystem at your use case. So, it will not run as a service, it will be a >>>>> dependency. >>>>> >>>>> Kind Regards, >>>>> Furkan KAMACI >>>>> 17 Şub 2016 22:13 tarihinde "Lewis John Mcgibbney" < >>>>> lewis.mcgibb...@gmail.com> yazdı: >>>>> >>>>> Hi Tom, >>>>>> You can just follow the following tutorial >>>>>> http://wiki.apache.org/nutch/Nutch2Tutorial >>>>>> Replacing the gora-hbase configuration from within your Nutch >>>>>> conf/nutch-default.xml and conf/gora.properties and with the relevant >>>>>> dependency from within ivy/ivy.xml with the gora-solr equivalent. >>>>>> Any more issues then please let us know. Gora does not run as a >>>>>> service no, it is a dependency and is managed through your client >>>>>> dependency manager (which in Nutch 2.X is Ivy). >>>>>> Thanks >>>>>> >>>>>> On Wed, Feb 17, 2016 at 12:04 PM, Tom Running <runningt...@gmail.com> >>>>>> wrote: >>>>>> >>>>>>> Furkan and Lewis, >>>>>>> >>>>>>> Thank you for your response to my SOS. I tried varies suggestion on >>>>>>> editing pom.xlm file and including down grade the java JDK version to >>>>>>> 1.7 >>>>>>> and removed the .m2 folder and run mvn clean install again and it >>>>>>> build successfully. >>>>>>> >>>>>>> Now Gora is successfully build. I am trying to understand how to >>>>>>> get Gora run or start in order get the following three packages to work >>>>>>> together Nutch, Solr and Hbase with GORA >>>>>>> Does Gora start as a service? >>>>>>> Or >>>>>>> To get other three packages to work with GORA I will need to copy >>>>>>> the *.jar to the three packages (Nutch, Solr and Hbase) lib folder? >>>>>>> >>>>>>> >>>>>>> *I am a bit confuse on how to get these packages to work with GORA. >>>>>>> I had read GORA's quickstart guide but am still not too clear on what to >>>>>>> do.* >>>>>>> >>>>>>> >>>>>>> *Can you provide some direction.* >>>>>>> >>>>>>> *Thank you.* >>>>>>> >>>>>>> *Tom* >>>>>>> >>>>>>> On Wed, Feb 17, 2016 at 1:56 PM, Furkan KAMACI < >>>>>>> furkankam...@gmail.com> wrote: >>>>>>> >>>>>>>> Hi Tom, >>>>>>>> >>>>>>>> It seems that your maven is at offline mode. There may be a problem >>>>>>>> with your settings.xml or environment variable for maven home. How do >>>>>>>> you >>>>>>>> build your project? Could you build it with -X option and send the >>>>>>>> output? >>>>>>>> >>>>>>>> Kind Regards, >>>>>>>> Furkan KAMACI >>>>>>>> 17 Şub 2016 20:51 tarihinde "Tom Running" <runningt...@gmail.com> >>>>>>>> yazdı: >>>>>>>> >>>>>>>> What to do with the error below. >>>>>>>> >>>>>>>> >>>>>>>> [INFO] Building Apache Gora :: Accumulo 0.6.1 >>>>>>>> [INFO] >>>>>>>> ------------------------------------------------------------------------ >>>>>>>> [WARNING] The POM for org.apache.accumulo:accumulo-core:jar:1.5.1 >>>>>>>> is missing, no dependency information available >>>>>>>> [WARNING] The POM for >>>>>>>> org.apache.accumulo:accumulo-minicluster:jar:1.5.1 is missing, no >>>>>>>> dependency information available >>>>>>>> [WARNING] The POM for org.jboss.netty:netty:jar:3.2.2.Final is >>>>>>>> missing, no dependency information available >>>>>>>> [INFO] >>>>>>>> ------------------------------------------------------------------------ >>>>>>>> [INFO] Reactor Summary: >>>>>>>> [INFO] >>>>>>>> [INFO] Apache Gora ........................................ SUCCESS >>>>>>>> [ 1.468 s] >>>>>>>> [INFO] Apache Gora :: Compiler ............................ SUCCESS >>>>>>>> [ 0.121 s] >>>>>>>> [INFO] Apache Gora :: Compiler-CLI ........................ SUCCESS >>>>>>>> [ 0.032 s] >>>>>>>> [INFO] Apache Gora :: Shims Hadoop ........................ SUCCESS >>>>>>>> [ 0.543 s] >>>>>>>> [INFO] Apache Gora :: Shims Hadoop 1.x .................... SUCCESS >>>>>>>> [ 0.190 s] >>>>>>>> [INFO] Apache Gora :: Shims Hadoop 2.x .................... SUCCESS >>>>>>>> [ 0.295 s] >>>>>>>> [INFO] Apache Gora :: Shims Distribution .................. SUCCESS >>>>>>>> [ 0.026 s] >>>>>>>> [INFO] Apache Gora :: Core ................................ SUCCESS >>>>>>>> [ 0.806 s] >>>>>>>> [INFO] Apache Gora :: Accumulo ............................ FAILURE >>>>>>>> [ 0.120 s] >>>>>>>> [INFO] Apache Gora :: Cassandra ........................... SKIPPED >>>>>>>> [INFO] Apache Gora :: GoraCI .............................. SKIPPED >>>>>>>> [INFO] Apache Gora :: HBase ............................... SKIPPED >>>>>>>> [INFO] Apache Gora :: MongoDB ............................. SKIPPED >>>>>>>> [INFO] Apache Gora :: Solr ................................ SKIPPED >>>>>>>> [INFO] Apache Gora :: Tutorial ............................ SKIPPED >>>>>>>> [INFO] Apache Gora :: Sources-Dist ........................ SKIPPED >>>>>>>> [INFO] >>>>>>>> ------------------------------------------------------------------------ >>>>>>>> [INFO] BUILD FAILURE >>>>>>>> [INFO] >>>>>>>> ------------------------------------------------------------------------ >>>>>>>> [INFO] Total time: 6.359 s >>>>>>>> [INFO] Finished at: 2016-02-17T02:00:39-05:00 >>>>>>>> [INFO] Final Memory: 25M/61M >>>>>>>> [INFO] >>>>>>>> ------------------------------------------------------------------------ >>>>>>>> [ERROR] Failed to execute goal on project gora-accumulo: Could not >>>>>>>> resolve dependencies for project >>>>>>>> org.apache.gora:gora-accumulo:bundle:0.6.1: The following artifacts >>>>>>>> could >>>>>>>> not be resolved: org.apache.gora:gora-core:jar:0.6.1, >>>>>>>> org.apache.gora:gora-core:jar:tests:0.6.1, >>>>>>>> org.apache.accumulo:accumulo-core:jar:1.5.1, >>>>>>>> org.apache.accumulo:accumulo-minicluster:jar:1.5.1, >>>>>>>> jline:jline:jar:0.9.1, >>>>>>>> org.jboss.netty:netty:jar:3.2.2.Final, >>>>>>>> org.codehaus.jackson:jackson-jaxrs:jar:1.8.3, >>>>>>>> org.codehaus.jackson:jackson-xc:jar:1.8.3: Cannot access central ( >>>>>>>> https://repo.maven.apache.org/maven2) in offline mode and the >>>>>>>> artifact org.apache.gora:gora-core:jar:0.6.1 has not been downloaded >>>>>>>> from >>>>>>>> it before. -> [Help 1] >>>>>>>> [ERROR] >>>>>>>> >>>>>>>> >>>>>>> >>>>>> >>>>>> >>>>>> -- >>>>>> *Lewis* >>>>>> >>>>> >>>> >>> >>> >>> -- >>> *Lewis* >>> >> >> >