[ https://issues.apache.org/jira/browse/NUTCH-1636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13761888#comment-13761888 ]
lufeng commented on NUTCH-1636: ------------------------------- yes, this patch can solve the issue reported by lain. +1 > Indexer to normalize and filter repr URL > ---------------------------------------- > > Key: NUTCH-1636 > URL: https://issues.apache.org/jira/browse/NUTCH-1636 > Project: Nutch > Issue Type: Bug > Components: indexer > Affects Versions: 1.6, 1.7 > Reporter: Sebastian Nagel > Priority: Minor > Fix For: 1.8 > > Attachments: NUTCH-1636-1.patch > > > Indexer if used with option -normalize and/or -filter (cf. NUTCH-1300) should > also normalize and filter representation URLs. Otherwise URLs which are > target of a redirect, and have repr URL set (see URLUtil.chooseRepr) may show > up in index with an undesirable URL. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira