Hey,

When updating the db with a certain segment, I get the following error:
050330 235110 Processing pagesByURL: Sorted 28711.666716283053 instructions/second
Exception in thread "main" java.io.IOException: key out of order: gopher://Gopher.wkap..l:70/11gopher_root%3A%5B_journal._jrnl.acbi%5D after gopher://Gopher.wkap.nl/11gopher_root%3a%5b_journal._jrnl.biph%5d
at org.apache.nutch.io.MapFile$Writer.checkKey(MapFile.java:128)
at org.apache.nutch.io.MapFile$Writer.append(MapFile.java:114)
at org.apache.nutch.db.WebDBWriter$PagesByURLProcessor.mergeEdits(WebDBWriter.java:635)
at org.apache.nutch.db.WebDBWriter$CloseProcessor.closeDown(WebDBWriter.java:557)
at org.apache.nutch.db.WebDBWriter.close(WebDBWriter.java:1544)
at org.apache.nutch.tools.UpdateDatabaseTool.close(UpdateDatabaseTool.java:318)
at org.apache.nutch.tools.UpdateDatabaseTool.main(UpdateDatabaseTool.java:368)

Has anyone seen this before? Is it a problem with by webdb or the segment? I tried this same segment on a different webdb and I got the same error (key out of order), but with different URLs referenced.


Luke


------------------------------------------------------- This SF.net email is sponsored by Demarc: A global provider of Threat Management Solutions. Download our HomeAdmin security software for free today! http://www.demarc.com/Info/Sentarus/hamr30 _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to