[ https://issues.apache.org/jira/browse/NUTCH-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13570019#comment-13570019 ]
lufeng commented on NUTCH-1521: ------------------------------- Hi Lewis, I found the CrawlDbFilter class is only used in Nutch 1.x, not found in Nutch 2.x, so i think this patch is only applied to nutch 1.7. Thanks for your review. > CrawlDbFilter pass null url to urlNormailzers > --------------------------------------------- > > Key: NUTCH-1521 > URL: https://issues.apache.org/jira/browse/NUTCH-1521 > Project: Nutch > Issue Type: Bug > Affects Versions: 1.7 > Reporter: lufeng > Assignee: lufeng > Priority: Trivial > Fix For: 1.7, 2.2 > > Attachments: CrawlDbFilter_v1.patch, NUTCH-1521-trunk.patch, > TestCrawlDbFilter.java > > > urlNormalizers will get null url if we set CRAWLDB_PURGE_404, and it will > throw NullPointerException. and the WARN Log will output something like this > "Skipping null NullPointerException". -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira