Author: jnioche
Date: Thu Jan  5 11:05:43 2012
New Revision: 1227553

URL: http://svn.apache.org/viewvc?rev=1227553&view=rev
Log:
NUTCH-1146 Prevent generation of _SUCCESS files in output

Modified:
    nutch/trunk/CHANGES.txt
    nutch/trunk/conf/nutch-default.xml

Modified: nutch/trunk/CHANGES.txt
URL: 
http://svn.apache.org/viewvc/nutch/trunk/CHANGES.txt?rev=1227553&r1=1227552&r2=1227553&view=diff
==============================================================================
--- nutch/trunk/CHANGES.txt (original)
+++ nutch/trunk/CHANGES.txt Thu Jan  5 11:05:43 2012
@@ -1,5 +1,7 @@
 Nutch Change Log
 
+* NUTCH-1146 Prevent generation of _SUCCESS files in output (jnioche)
+
 * NUTCH-1232 Remove site field from index-basic (markus)
 
 * NUTCH-1239 Webgraph should remove deleted pages from segment input (markus)

Modified: nutch/trunk/conf/nutch-default.xml
URL: 
http://svn.apache.org/viewvc/nutch/trunk/conf/nutch-default.xml?rev=1227553&r1=1227552&r2=1227553&view=diff
==============================================================================
--- nutch/trunk/conf/nutch-default.xml (original)
+++ nutch/trunk/conf/nutch-default.xml Thu Jan  5 11:05:43 2012
@@ -1165,6 +1165,15 @@
   </description>
 </property>
 
+<property>
+  <name>mapreduce.fileoutputcommitter.marksuccessfuljobs</name>
+  <value>false</value>
+  <description>Hadoop >= 0.21 generates SUCCESS files in the output which can 
crash 
+  the readers. This should not be an issue once Nutch is ported to the new 
MapReduce API
+  but for now this parameter should prevent such cases.
+  </description>
+</property>
+
 <!-- solr index properties -->
 
 <property>


Reply via email to