Hi,
I'm running nutch 0.9 on a fedora core 7 i368 machine (actually it's a
VMWARE), to testing.
while trying to fetch a single URL ("http://www.ynet.co.il") it takes ages
and then throws the following:
[EMAIL PROTECTED] nutch-0.9]$ bin/nutch inject crawl/crawldb SMALL
Injector: starting
Injector: crawlDb: crawl/crawldb
Injector: urlDir: SMALL
Injector: Converting injected urls to crawl db entries.
Injector: java.lang.IllegalStateException
at java.nio.charset.CharsetEncoder.encode(libgcj.so.8rh)
at org.apache.hadoop.io.Text.encode(Text.java:375)
at org.apache.hadoop.io.Text.encode(Text.java:356)
at org.apache.hadoop.io.Text.writeString(Text.java:396)
at org.apache.hadoop.mapred.JobClient$RawSplit.write(JobClient.java:428)
at org.apache.hadoop.mapred.JobClient.writeSplitsFile(JobClient.java:457)
at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:358)
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:543)
at org.apache.nutch.crawl.Injector.inject(Injector.java:162)
at org.apache.nutch.crawl.Injector.run(Injector.java:192)
at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
at org.apache.nutch.crawl.Injector.main(Injector.java:182)
Java ver:
[EMAIL PROTECTED] nutch-0.9]$ java -version
java version "1.5.0"
gij (GNU libgcj) version 4.1.2 20070502 (Red Hat 4.1.2-12)
ANT build was successful
anyone can help?
--
Eyal Edri