Blacklist hits have a higher priority.

Am 26.04.2006 um 14:22 schrieb TDLN:

yes, thank you. What happens if the url matches both lists. There is not
guarantee that it won't match both lists is there?

Rgrds, Thomas



On 4/26/06, Stefan Groschupf (JIRA) <[EMAIL PROTECTED]> wrote:

    [
http://issues.apache.org/jira/browse/NUTCH-249? page=comments#action_12376477]

Stefan Groschupf commented on NUTCH-249:
----------------------------------------

I mean the Class and method naming isn't very well.
Blacklist or blocklist? Whitelist or positivivelist?
Does this answer the question?

black- white list url filtering
-------------------------------

         Key: NUTCH-249
         URL: http://issues.apache.org/jira/browse/NUTCH-249
     Project: Nutch
        Type: Improvement

  Components: fetcher
    Versions: 0.8-dev
    Reporter: Stefan Groschupf
    Priority: Trivial
     Fix For: 0.8-dev
 Attachments: blackWhiteListV2.patch, blackWhiteListV3.patch

Existing url filter mechanisms need to process each url against each
filter pattern. For very large filter sets this may be does not scale very
well.

--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira



---------------------------------------------------------------
company:        http://www.media-style.com
forum:        http://www.text-mining.org
blog:            http://www.find23.net


Reply via email to