[ https://issues.apache.org/jira/browse/NUTCH-1090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Markus Jelsma resolved NUTCH-1090. ---------------------------------- Resolution: Fixed Committed for 1.4 in rev. 1202143. Thanks! > LinkDb (invertlinks) should inform the user when it ignores internal links > -------------------------------------------------------------------------- > > Key: NUTCH-1090 > URL: https://issues.apache.org/jira/browse/NUTCH-1090 > Project: Nutch > Issue Type: Improvement > Components: linkdb > Affects Versions: 1.3 > Reporter: Marek Bachmann > Assignee: Markus Jelsma > Priority: Trivial > Labels: configuration, information, log > Fix For: 1.5 > > Attachments: LinkDb.patch > > > I used nutch to crawl sites on a single domain. After the crawl was complete > I tried to build a LinkDb. The LinkDb was empty. > It comes up that this happens because the invertlinks command ignores > internal links to the same domain by default. > Unfortunately the LinkDb class doesn't tell anything about that. So it was > hard to find out why the LinkDb was empty. > I suggest to add an information for the user when the invertlinks command is > ignoring internal links. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira