Hi!
It worked out for me! Installed and build sucessfully
nutch-2.3.1
Hbase-0.98.8-hadoop2
elasticsearch-2.1.0

But the problem comes out when I inject urls for crawling

InjectorJob: starting at 2016-02-03 10:15:57
InjectorJob: Injecting urlDir: seed/ursl.txt
Exception in thread "main" java.lang.NoClassDefFoundError: 
org/apache/hadoop/hbase/HBaseConfiguration
        at 
org.apache.gora.hbase.store.HBaseStore.initialize(HBaseStore.java:114)
        at 
org.apache.gora.store.DataStoreFactory.initializeDataStore(DataStoreFactory.java:102)
        at 
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:161)
        at 
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:135)
        at 
org.apache.nutch.storage.StorageUtils.createWebStore(StorageUtils.java:78)
        at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:218)
        at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:252)
        at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:275)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
        at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:284)
Caused by: java.lang.ClassNotFoundException: 
org.apache.hadoop.hbase.HBaseConfiguration
        at java.net.URLClassLoader$1.run(URLClassLoader.java:366)
        at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
        at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
        ... 10 more
Is this still a compatibility problem? Many thanks!!!
/Dan

followed this instruction for elasticsearch configuration
https://qbox.io/blog/scraping-the-web-with-nutch-for-elasticsearch
________________________________________
Från: Lewis John Mcgibbney [[email protected]]
Skickat: den 3 februari 2016 00:10
Till: [email protected]
Ämne: Re: configuration nutch with hbase and elasticserach

Hi Dan,

I would advise you to use a JDK 1.7.
Also, if you are just starting out then please follow this tutorial
http://wiki.apache.org/nutch/Nutch2Tutorial
If you have any issues then please let us know.
Thanks

On Fri, Jan 29, 2016 at 10:20 PM, <[email protected]> wrote:

> Hi Lewis,
> Many thanks for your answer. It is not easy for a beginner to download so
> many staffs.
> I am struggling with java and hadoop
> Now I have downloaded jdk1.8.0_71 from Oracle, it seems ok.

Reply via email to