[ https://issues.apache.org/jira/browse/NUTCH-1672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13869543#comment-13869543 ]
Hudson commented on NUTCH-1672: ------------------------------- SUCCESS: Integrated in Nutch-nutchgora #884 (See [https://builds.apache.org/job/Nutch-nutchgora/884/]) NUTCH-1672 Inlinks are added twice in DbUpdateReducer (lewismc: http://svn.apache.org/viewvc/nutch/branches/2.x/?view=rev&rev=1557707) * /nutch/branches/2.x/CHANGES.txt * /nutch/branches/2.x/src/java/org/apache/nutch/crawl/DbUpdateReducer.java > Inlinks are added twice in DbUpdateReducer > ------------------------------------------ > > Key: NUTCH-1672 > URL: https://issues.apache.org/jira/browse/NUTCH-1672 > Project: Nutch > Issue Type: Bug > Affects Versions: 2.2.1 > Reporter: Tien Nguyen Manh > Priority: Minor > Fix For: 2.3 > > Attachments: NUTCH-1672.patch > > > The first for loop is redundant > for (ScoreDatum inlink : inlinkedScoreData) { > page.putToInlinks(new Utf8(inlink.getUrl()), new > Utf8(inlink.getAnchor())); > } > ... > for (ScoreDatum inlink : inlinkedScoreData) { > int inlinkDist = inlink.getDistance(); > if (inlinkDist < smallestDist) { > smallestDist=inlinkDist; > } > page.putToInlinks(new Utf8(inlink.getUrl()), new > Utf8(inlink.getAnchor())); > } -- This message was sent by Atlassian JIRA (v6.1.5#6160)