Allow setting of slop values for non-quote phrase queries on query-basic plugin
-------------------------------------------------------------------------------

                 Key: NUTCH-877
                 URL: https://issues.apache.org/jira/browse/NUTCH-877
             Project: Nutch
          Issue Type: Improvement
          Components: searcher
    Affects Versions: 1.2
         Environment: All
            Reporter: Dennis Kubes
            Assignee: Dennis Kubes
             Fix For: 1.2


Patch adds a configuration variable for setting slop values on phrase queries.  
The default slop value, which currently can't be changed through configuration, 
is Integer.MAX_VALUE.  It produces something like this, which doesn't seem 
right to me.  If you are searching for a phrase you usually want it within a 
certain distance:

2.9141337E-4 = weight(content:"my phrase"~2147483647 in 1029), product of:

    * 0.07163286 = queryWeight(content:"my phrase"~2147483647), product of:
          o 9.657982 = idf(content: my=13470 phrase=534)
          o 0.0074169594 = queryNorm

This patch adds the query.phrase.slop configuration value to the 
nutch-default.xml file.  It has a default setting of 5.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to