Hi Kiran,

The command line arguments to the fetch command shown on wiki page [2]
doesn't seem to be in sync with what is implemented in [0] and [1].

For 1.x [0]
Usage: Fetcher <segment> [-threads n]

For 2.x [1]
Usage: FetcherJob (<batchId> | -all) [-crawlId <id>] [-threads N] [-resume]
[-numTasks N]

On wiki page [2]:
Usage: bin/nutch fetch <segment> [-threads n] [-noParsing]

I strongly feel that these params must be mentioned in the wiki page. Also,
people have been pondering over @user for the differences wrt 1.x and 2.x.
As the options are different for both these versions, providing usage for
both these versions would make things easy for users. What say ?

There were lot of updates for other wiki pages too which might also need
similar change.

[0]
http://svn.apache.org/viewvc/nutch/trunk/src/java/org/apache/nutch/fetcher/Fetcher.java?view=markup
[1]
http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/fetcher/FetcherJob.java?view=markup
[2] http://wiki.apache.org/nutch/bin/nutch%20fetch

Thanks,
Tejas

Reply via email to