I spoke too soon. Below is the output of errors on mergesegs. This looks more like a Hadoop issue to me, but I will need to dig into it. It also may be something that I am doing on my end. This was a merge of three different crawls of 50K each. I don't know if we want to delay or go ahead.

Dennis Kubes

java.lang.RuntimeException: java.lang.RuntimeException: java.lang.ClassNotFoundException: org.apache.nutch.metadata.MetaWrapper
        at org.apache.hadoop.conf.Configuration.getClass(Configuration.java:344)
        at 
org.apache.hadoop.mapred.JobConf.getOutputValueClass(JobConf.java:451)
at org.apache.hadoop.mapred.JobConf.getMapOutputValueClass(JobConf.java:414)
        at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.(MapTask.java:270)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:115)
        at 
org.apache.hadoop.mapred.TaskTracker$Child.main(TaskTracker.java:1445)
Caused by: java.lang.RuntimeException: java.lang.ClassNotFoundException: org.apache.nutch.metadata.MetaWrapper
        at org.apache.hadoop.conf.Configuration.getClass(Configuration.java:328)
        at org.apache.hadoop.conf.Configuration.getClass(Configuration.java:339)
        ... 5 more
Caused by: java.lang.ClassNotFoundException: org.apache.nutch.metadata.MetaWrapper
        at java.net.URLClassLoader$1.run(URLClassLoader.java:200)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:188)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
        at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:268)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:251)
        at java.lang.ClassLoader.loadClassInternal(ClassLoader.java:319)
        at java.lang.Class.forName0(Native Method)
        at java.lang.Class.forName(Class.java:242)
at org.apache.hadoop.conf.Configuration.getClassByName(Configuration.java:315)
        at org.apache.hadoop.conf.Configuration.getClass(Configuration.java:326)
        ... 6 more



Dennis Kubes wrote:
[X] +1 Release the packages as Apache Nutch 0.9
[ ] -1 Do not release the packages because...

I have been running some bigger crawls with the release this morning. Everything looks good.

Dennis Kubes

Chris Mattmann wrote:
Hi Folks,

I have posted a candidate for the Apache Nutch 0.9 release at

 http://people.apache.org/~mattmann/nutch_0.9/

See the included CHANGES-0.9.txt file for details on release
contents and latest changes. The release was made from the 0.9-dev trunk.

Please vote on releasing these packages as Apache Nutch 0.9.
The vote is open for the next 72 hours. Only votes from Nutch
committers are binding, but everyone is welcome to check the release
candidate and voice their approval or disapproval. The vote  passes if
at least three binding +1 votes are cast.

[ ] +1 Release the packages as Apache Nutch 0.9
[ ] -1 Do not release the packages because...

Thanks!

Cheers,
  Chris


Reply via email to