RE: Fetch content inside nutch parse

2012-11-30 Thread Markus Jelsma
See how the indexchecker fetches URL's: http://svn.apache.org/viewvc/nutch/trunk/src/java/org/apache/nutch/indexer/IndexingFiltersChecker.java?view=markup -Original message- From:Jorge Luis Betancourt Gonzalez jlbetanco...@uci.cu Sent: Fri 30-Nov-2012 16:46 To: user@nutch.apache.org

Re: Wrong ParseData in segment

2012-11-30 Thread Sebastian Nagel
Hi Markus, sounds somewhat similar to NUTCH-1252 but that was rather trivial and easy to reproduce. Sebastian 2012/11/30 Markus Jelsma markus.jel...@openindex.io: Hi, We've got an issue where one in a few thousand records partially contains another record's ParseMeta data. To be specific,