[ https://issues.apache.org/jira/browse/NUTCH-1672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Lewis John McGibbney resolved NUTCH-1672. ----------------------------------------- Resolution: Fixed Committed @revision 1557707 in 2.x HEAD. Thank you [~tiennm] for your contributions. > Inlinks are added twice in DbUpdateReducer > ------------------------------------------ > > Key: NUTCH-1672 > URL: https://issues.apache.org/jira/browse/NUTCH-1672 > Project: Nutch > Issue Type: Bug > Affects Versions: 2.2.1 > Reporter: Tien Nguyen Manh > Priority: Minor > Fix For: 2.3 > > Attachments: NUTCH-1672.patch > > > The first for loop is redundant > for (ScoreDatum inlink : inlinkedScoreData) { > page.putToInlinks(new Utf8(inlink.getUrl()), new > Utf8(inlink.getAnchor())); > } > ... > for (ScoreDatum inlink : inlinkedScoreData) { > int inlinkDist = inlink.getDistance(); > if (inlinkDist < smallestDist) { > smallestDist=inlinkDist; > } > page.putToInlinks(new Utf8(inlink.getUrl()), new > Utf8(inlink.getAnchor())); > } -- This message was sent by Atlassian JIRA (v6.1.5#6160)