Mathijs Homminga wrote: > Hi Andrzej, > > Thanks for the tool! > > I found one 'map_xxxxxx' directory which matches the date my segment > was created. > It contains a 'part-0.out' file with a timestamp that matches the time > of the last entries in my log file (just before the process stopped). > > I followed the preparation steps and ran the tool. However, I got the > following error:
The SequenceFile has the following structure (approximately): first, three letter magic 'SEQ', and then either the fully qualified class names for key/value, or abbreviated classnames obtained from the mapping in WritableName. Please check what is this class name in part-0.out file - apparently Hadoop can't find the right mapping (you can send me the first bytes of this file off the list - use 'dd if=part-0.out of=data.out bs=512 count=1'). > By looking at the Hadoop sources I noticed that the FetcherOutput > class mentioned in this error message is determined by the > SequenceFile class and obtained from the sequence file itself. The class itself is not obtained from the file - it's loaded from the classpath. The thing that is missing here is the right name of the class as determined by the symbolic name inside the SequenceFile. -- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
