[ http://issues.apache.org/jira/browse/NUTCH-249?page=all ]
Stefan Groschupf updated NUTCH-249: ----------------------------------- Attachment: blackWhiteListV3.patch A new patch that fix an bug where to less urls passed the filter. > black- white list url filtering > ------------------------------- > > Key: NUTCH-249 > URL: http://issues.apache.org/jira/browse/NUTCH-249 > Project: Nutch > Type: Improvement > Components: fetcher > Versions: 0.8-dev > Reporter: Stefan Groschupf > Priority: Trivial > Fix For: 0.8-dev > Attachments: blackWhiteListV2.patch, blackWhiteListV3.patch > > Existing url filter mechanisms need to process each url against each filter > pattern. For very large filter sets this may be does not scale very well. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira