Hi there,

I met an issue to run "nutch/bin index...". I checked out latest nutch from
SVN, so I am running nutch-08.

I searched the achived emails, and there is one email mentioned
that "index-basic" must be in index configuration xml, which I checked my
config and it is already included.

1.
In one case, indexing log showing:
"
Indexing [http://calendar.ufl.edu/] with analyzer
[EMAIL PROTECTED] (null)
"

But, it could still finish indexing and searching is successfully after. A
bit weird thing.

2.
In the other crawling case, I indexing multiple segments and run into fatal
error as

"
Indexer: starting
Indexer: linkdb: crawl/linkdb
Indexer: adding segment: crawl/segments/20060807202736
Indexer: adding segment: crawl/segments/20060807202824
Optimizing index.
Exception in thread "main" java.io.IOException: Job failed!
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:357)
at org.apache.nutch.indexer.Indexer.index(Indexer.java:296)
at org.apache.nutch.indexer.Indexer.main(Indexer.java:313)
"

I wonder what cause the error?

3.
Is the downloadable nutch-08 release package more stable than version of SVN
check out?

thanks you time,

Feng Ji
-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to