[ https://issues.apache.org/jira/browse/NUTCH-2178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Markus Jelsma updated NUTCH-2178: --------------------------------- Summary: DeduplicationJob to optionally group on host or domain (was: DeduplicationJob to optionall group on host or domain) > DeduplicationJob to optionally group on host or domain > ------------------------------------------------------ > > Key: NUTCH-2178 > URL: https://issues.apache.org/jira/browse/NUTCH-2178 > Project: Nutch > Issue Type: Improvement > Affects Versions: 1.10 > Reporter: Markus Jelsma > Assignee: Markus Jelsma > Fix For: 1.12 > > Attachments: NUTCH-2178.patch > > > Add optional grouping to DeduplicationJob. > Usage: DeduplicationJob <crawldb> [-group <none|host|domain>] -- This message was sent by Atlassian JIRA (v6.3.4#6332)