Blacklist hits have a higher priority.
Am 26.04.2006 um 14:22 schrieb TDLN:
yes, thank you. What happens if the url matches both lists. There
is not
guarantee that it won't match both lists is there?
Rgrds, Thomas
On 4/26/06, Stefan Groschupf (JIRA) <[EMAIL PROTECTED]> wrote:
[
http://issues.apache.org/jira/browse/NUTCH-249?
page=comments#action_12376477]
Stefan Groschupf commented on NUTCH-249:
----------------------------------------
I mean the Class and method naming isn't very well.
Blacklist or blocklist? Whitelist or positivivelist?
Does this answer the question?
black- white list url filtering
-------------------------------
Key: NUTCH-249
URL: http://issues.apache.org/jira/browse/NUTCH-249
Project: Nutch
Type: Improvement
Components: fetcher
Versions: 0.8-dev
Reporter: Stefan Groschupf
Priority: Trivial
Fix For: 0.8-dev
Attachments: blackWhiteListV2.patch, blackWhiteListV3.patch
Existing url filter mechanisms need to process each url against each
filter pattern. For very large filter sets this may be does not
scale very
well.
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira
---------------------------------------------------------------
company: http://www.media-style.com
forum: http://www.text-mining.org
blog: http://www.find23.net