Author: jnioche Date: Thu Jan 5 11:05:43 2012 New Revision: 1227553 URL: http://svn.apache.org/viewvc?rev=1227553&view=rev Log: NUTCH-1146 Prevent generation of _SUCCESS files in output
Modified: nutch/trunk/CHANGES.txt nutch/trunk/conf/nutch-default.xml Modified: nutch/trunk/CHANGES.txt URL: http://svn.apache.org/viewvc/nutch/trunk/CHANGES.txt?rev=1227553&r1=1227552&r2=1227553&view=diff ============================================================================== --- nutch/trunk/CHANGES.txt (original) +++ nutch/trunk/CHANGES.txt Thu Jan 5 11:05:43 2012 @@ -1,5 +1,7 @@ Nutch Change Log +* NUTCH-1146 Prevent generation of _SUCCESS files in output (jnioche) + * NUTCH-1232 Remove site field from index-basic (markus) * NUTCH-1239 Webgraph should remove deleted pages from segment input (markus) Modified: nutch/trunk/conf/nutch-default.xml URL: http://svn.apache.org/viewvc/nutch/trunk/conf/nutch-default.xml?rev=1227553&r1=1227552&r2=1227553&view=diff ============================================================================== --- nutch/trunk/conf/nutch-default.xml (original) +++ nutch/trunk/conf/nutch-default.xml Thu Jan 5 11:05:43 2012 @@ -1165,6 +1165,15 @@ </description> </property> +<property> + <name>mapreduce.fileoutputcommitter.marksuccessfuljobs</name> + <value>false</value> + <description>Hadoop >= 0.21 generates SUCCESS files in the output which can crash + the readers. This should not be an issue once Nutch is ported to the new MapReduce API + but for now this parameter should prevent such cases. + </description> +</property> + <!-- solr index properties --> <property>