Hi Markus,
Thanks a lot ! Very helpful !!!
José-Marcio
On 06/29/2016 10:57 PM, Markus Jelsma wrote:
Hello Jose-Marcio - Hadoop parameters can also be specified in
nutch-(default|site).xml. It behave identical to command line -D parameter
switchers.
Markus
-----Original message-----
From:Jose-Marcio Martins da Cruz <jose-marcio.mart...@mines-paristech.fr>
Sent: Tuesday 28th June 2016 17:05
To: user@nutch.apache.org
Subject: Some Java parameters defined inside bin/crawl 1.12
Hello,
At bin/crawl there are some Java parameters defined which don't
appear at nutch-defaults.xml file.
These parameters are :
mapreduce.task.skip.start.attemps
mapreduce.map.skip.maxrecords
mapreduce.job.reduces
mapred.child.java.opts
mapreduce.reduce.speculative
mapreduce.map.speculative
mapreduce.map.output.compress
Can these parameters be defined inside nutch-defaults.xml and nutch-site.xml ?
Also, in the version of this file distributed with release 1.12, there appears
a call to the class
__nutch_bin org.apache.nutch.scoring.webgraph.Loops
which was removed from Nutch 1.12.
Best regards
--
Envoyé de ma machine à écrire.
---------------------------------------------------------------
Spam : Classement statistique de messages électroniques -
Une approche pragmatique
Chez Amazon.fr : http://amzn.to/LEscRu ou http://bit.ly/SpamJM
---------------------------------------------------------------
Jose Marcio MARTINS DA CRUZ http://www.j-chkmail.org
Ecole des Mines de Paris http://bit.ly/SpamJM
60, bd Saint Michel 75272 - PARIS CEDEX 06
--
Envoyé de ma machine à écrire.
---------------------------------------------------------------
Spam : Classement statistique de messages électroniques -
Une approche pragmatique
Chez Amazon.fr : http://amzn.to/LEscRu ou http://bit.ly/SpamJM
---------------------------------------------------------------
Jose Marcio MARTINS DA CRUZ http://www.j-chkmail.org
Ecole des Mines de Paris http://bit.ly/SpamJM
60, bd Saint Michel 75272 - PARIS CEDEX 06