[ 
https://issues.apache.org/jira/browse/NUTCH-2218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael Joyce resolved NUTCH-2218.
----------------------------------
    Resolution: Fixed

[~lewismc], This got merged. I added an example to the option you raised as 
well. If that doesn't address your concerns let me know and I'll update in 
another ticket.

{code}
| -> ./bin/nutch crawlcomplete
usage: CrawlCompletionStats [-h] -inputDirs <inputDirs> -mode <mode>
       [-numReducers <numReducers>] -outputDir <outputDir>
 -h,--help                    Show this message
 -inputDirs <inputDirs>       Comma separated list of crawl directories
                              (e.g., "./crawl1,./crawl2")
 -mode <mode>                 Set statistics gathering mode (by 'host' or
                              by 'domain')
 -numReducers <numReducers>   Optional number of reduce jobs to use.
                              Defaults to 1
 -outputDir <outputDir>       Output directory where results should be
                              dumped
{code}

> Switch CrawlCompletion arg parsing to Commons CLI
> -------------------------------------------------
>
>                 Key: NUTCH-2218
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2218
>             Project: Nutch
>          Issue Type: Improvement
>          Components: util
>    Affects Versions: 1.11
>            Reporter: Michael Joyce
>            Assignee: Michael Joyce
>            Priority: Minor
>             Fix For: 1.12
>
>
> The current CrawlCompletion utility should be updated to use commons CLI 
> instead of doing manual arg parsing and checking.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to