Hi guys,
I'm new to the mailing list therefore I apologize if this posting is not
meant to be in here.
I have been working with Nutch now for quite awhile. I am currently building
a tool that can read and retrieve Nutch CrawlDb information. What am I here
if I'm talking about Nutch? As I understand Nutch uses the Hadoop file
system and many of its API. The CrawlDb is actually an hadoop MapFile
object. I understand that the hadoop filesystem doesn't support update as
it's only write-once.
My problem is when I run my program I get the following errors:
07/01/23 18:33:56 INFO conf.Configuration: parsing
jar:file:/C:/nutch-0.8.2-dev/lib/hadoop-0.4.0-patched.jar!/hadoop-default.xm
l
07/01/23 18:33:56 INFO conf.Configuration: parsing
jar:file:/C:/nutch-0.8.2-dev/nutch-0.8.2-dev.jar!/nutch-default.xml
07/01/23 18:33:56 INFO conf.Configuration: parsing
jar:file:/C:/nutch-0.8.2-dev/nutch-0.8.2-dev.jar!/nutch-site.xml
Exception in thread "main" java.lang.ArithmeticException: / by zero
at
org.apache.hadoop.mapred.lib.HashPartitioner.getPartition(HashPartitioner.ja
va:33)
at
org.apache.hadoop.mapred.MapFileOutputFormat.getEntry(MapFileOutputFormat.ja
va:88)
at org.apache.nutch.crawl.CrawlDbReader.get(CrawlDbReader.java:321)
at
com.idna.indexer.resource.SystemMonitor.main(SystemMonitor.java:188)
I'll be looking forward to your helping reply.
Best regards,
Armel Nene
-------------------------------------------------
Armel T. Nene
iDNA Solutions
Tel: +44 (207) 257 6124
Mobile: +44 (788) 695 0483
<http://blog.idna-solutions.com/> http://blog.idna-solutions.com