[jira] Commented: (NUTCH-417) After upgrade to hadoop-0.9.1, parsing and indexing doesn't work.
[ http://issues.apache.org/jira/browse/NUTCH-417?page=comments#action_12459073 ] Sean Dean commented on NUTCH-417: - Speculative execution is now off by default with Hadoop 0.9.2 as per issue HADOOP-827. Since there was only two other fixes with that distribution, neither of which should effect Nutch in a bad way can that be updated in trunk? After upgrade to hadoop-0.9.1, parsing and indexing doesn't work. - Key: NUTCH-417 URL: http://issues.apache.org/jira/browse/NUTCH-417 Project: Nutch Issue Type: Bug Affects Versions: 0.9.0 Reporter: Dogacan Güney Attachments: index.patch If you parse while fetching then it is fine, but if you run parse as a different job, it creates an essentially empty parse_data directory(which has index files, but doesn't have data files). I am not sure why this is happening. Also, indexing fails at Indexer.OutputFormat.getRecordWriter. The parameter fs seems to be an instance of PhasedFileSystem which throws exceptions on delete and {start,complete}LocalOutput. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] Commented: (NUTCH-417) After upgrade to hadoop-0.9.1, parsing and indexing doesn't work.
[ http://issues.apache.org/jira/browse/NUTCH-417?page=comments#action_12459079 ] Andrzej Bialecki commented on NUTCH-417: - Yes, I'm going to do exactly this in a few days - I thought about putting this property in hadoop-site.xml in Nutch, but this way it's even better. After upgrade to hadoop-0.9.1, parsing and indexing doesn't work. - Key: NUTCH-417 URL: http://issues.apache.org/jira/browse/NUTCH-417 Project: Nutch Issue Type: Bug Affects Versions: 0.9.0 Reporter: Dogacan Güney Attachments: index.patch If you parse while fetching then it is fine, but if you run parse as a different job, it creates an essentially empty parse_data directory(which has index files, but doesn't have data files). I am not sure why this is happening. Also, indexing fails at Indexer.OutputFormat.getRecordWriter. The parameter fs seems to be an instance of PhasedFileSystem which throws exceptions on delete and {start,complete}LocalOutput. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira