[ https://issues.apache.org/jira/browse/NUTCH-1719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13884548#comment-13884548 ]
Lewis John McGibbney commented on NUTCH-1719: --------------------------------------------- +1, anyone else have comments? > DomainStatistics fails in 2.x because URL is not unreversed > ----------------------------------------------------------- > > Key: NUTCH-1719 > URL: https://issues.apache.org/jira/browse/NUTCH-1719 > Project: Nutch > Issue Type: Bug > Reporter: Gerhard Gossen > Fix For: 2.3 > > Attachments: domainstats.patch > > > With Nutch 2.x, {{org.apache.nutch.util.domain.DomainStatistics}} always > returns the counts only for {{FETCHED}}/{{NOT_FETCHED}}. The reason is that > the mapper tries to create a java.net.URL directly from the row key without > unreversing it first and silently ignores the thrown exception. > The attached patch calls TableUtil.unreverseUtil first. In my test (against > current 2.x-trunk) it produces correct results. -- This message was sent by Atlassian JIRA (v6.1.5#6160)