Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "ErrorMessagesInNutch2" page has been changed by LewisJohnMcgibbney: http://wiki.apache.org/nutch/ErrorMessagesInNutch2?action=diff&rev1=8&rev2=9 This page acts as a repository for potential error messages you might experience whilst using Nutch 2.0. It will most likely be dynamic and fairly general in nature due to the variety of additional software projects which can be combined with Nutch 2.0 and the potential for errors which this presents both for Nutch and which need to be considered when working with other software projects in combination. <<TableOfContents(3)>> + + == gora-cassandra >0.2 InvalidRequestException(why:Keyspace webpage does not exist) == + + This seems to be encountered when attempting to inject URLs into Cassandra after the server is started and stopped intermittently many times. This may possibly lead to the particular 'webpage' Keyspace and/or data in the Cassandra data directory becoming corrupted. + So far, the only solution seems to be deleting the cassandra data directory and starting again. + It should be noted that this is not a common error to encounter. + + {{{ + 2013-02-10 16:32:23,796 WARN mapred.LocalJobRunner - job_local_0001 + me.prettyprint.hector.api.exceptions.HInvalidRequestException: InvalidRequestException(why:Keyspace webpage does not exist) + at me.prettyprint.cassandra.connection.client.HThriftClient.getCassandra(HThriftClient.java:80) + at me.prettyprint.cassandra.connection.HConnectionManager.operateWithFailover(HConnectionManager.java:251) + at me.prettyprint.cassandra.model.ExecutingKeyspace.doExecuteOperation(ExecutingKeyspace.java:97) + at me.prettyprint.cassandra.model.MutatorImpl.execute(MutatorImpl.java:243) + at me.prettyprint.cassandra.model.MutatorImpl.insert(MutatorImpl.java:69) + at org.apache.gora.cassandra.store.HectorUtils.insertColumn(HectorUtils.java:47) + at org.apache.gora.cassandra.store.CassandraClient.addColumn(CassandraClient.java:169) + at org.apache.gora.cassandra.store.CassandraStore.addOrUpdateField(CassandraStore.java:341) + at org.apache.gora.cassandra.store.CassandraStore.flush(CassandraStore.java:228) + at org.apache.gora.cassandra.store.CassandraStore.close(CassandraStore.java:95) + at org.apache.gora.mapreduce.GoraRecordWriter.close(GoraRecordWriter.java:55) + at org.apache.hadoop.mapred.MapTask$NewDirectOutputCollector.close(MapTask.java:651) + at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:766) + at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370) + at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212) + Caused by: InvalidRequestException(why:Keyspace webpage does not exist) + at org.apache.cassandra.thrift.Cassandra$set_keyspace_result.read(Cassandra.java:4874) + at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:78) + at org.apache.cassandra.thrift.Cassandra$Client.recv_set_keyspace(Cassandra.java:489) + at org.apache.cassandra.thrift.Cassandra$Client.set_keyspace(Cassandra.java:476) + at me.prettyprint.cassandra.connection.client.HThriftClient.getCassandra(HThriftClient.java:78) + ... 14 more + 2013-02-10 16:32:24,149 ERROR crawl.InjectorJob - InjectorJob: java.lang.RuntimeException: job failed: name=inject urls, jobid=job_local_0001 + at org.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:54) + at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:233) + at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:251) + at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:273) + at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) + at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:282) + }}} == Nutch 2.1 + HBase 0.90.4 cluster settings - WARN zookeeper.ClientCnxn - Session 0x0 for server node1.xxxxxx.com/xxx.xxx.xxx.xxx:2181, unexpected error, closing socket connection and attempting reconnect java.io.IOException: Connection reset by peer ==