Hi, all,

I am trying to build the 2.0 rc3, but can't make it work.  I strictly
follow the wiki page(http://wiki.apache.org/nutch/Nutch2Tutorial).

Before that, I also ensure that  the hbase works well, as:

hbase(main):004:0> create 'test1', 'cf'
0 row(s) in 1.3080 seconds

The following is what I did and the error I got, Hope you can give me
suggestion where I were doing wrong.

1. checkout the code
svn co http://svn.apache.org/repos/asf/nutch/tags/release-2.0rc3

2. modify the nutch-default.xml, gora.properties and ivy/ivy.xml as
the wiki said.

3. build the code: ant

4. test the code as:
tianwei@132:~/nutch-src/release-2.0rc3/runtime/local$ ./bin/nutch
inject urls/urls.txt
InjectorJob: starting
InjectorJob: urlDir: urls/urls.txt
InjectorJob: org.apache.gora.util.GoraException:
org.apache.hadoop.hbase.ZooKeeperConnectionException: HBase is able to
connect to ZooKeeper but the connection closes immediately. This could
be a sign that the server has too many connections (30 is the
default). Consider inspecting your ZK server logs for that error and
then make sure you are reusing HBaseConfiguration as often as you can.
See HTable's javadoc for more information.
        at 
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:167)
        at 
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:135)
        at 
org.apache.nutch.storage.StorageUtils.createWebStore(StorageUtils.java:69)
        at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:243)
        at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:268)
        at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:288)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
        at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:298)
Caused by: org.apache.hadoop.hbase.ZooKeeperConnectionException: HBase
is able to connect to ZooKeeper but the connection closes immediately.
This could be a sign that the server has too many connections (30 is
the default). Consider inspecting your ZK server logs for that error
and then make sure you are reusing HBaseConfiguration as often as you
can. See HTable's javadoc for more information.
        at 
org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.<init>(ZooKeeperWatcher.java:155)
...........


I also tried to switch to use mysql, but also met the IO connection
exception. I guess there must be something wrong with my setting.
Could you give some suggestions to diagnose and solve this problem?

PS, my hbase version is hbase-0.90.5, and in nutch's lib/ directory,
there is "hbase-0.90.4", I don't know if it matters or not.  The hbase
is installed by user "hadoop" and I ran nutch with another user
"tianwei", don't know if I need to add something into CLASSPATH or
not?

Thanks very much.


Tianwei

Reply via email to