Re: [jira] Commented: (LUCENE-965) Implement a state-of-the-art retrieval function in Lucene

Grant Ingersoll Thu, 11 Dec 2008 05:37:02 -0800

I don't think the original authors have followed up on this patch atall since first posting.


On Nov 27, 2008, at 6:44 AM, Ian Holsman (JIRA) wrote:

[ https://issues.apache.org/jira/browse/LUCENE-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12651332#action_12651332 ]
Ian Holsman commented on LUCENE-965:
------------------------------------
It's a bit late over here, but when I try to apply the patch itdoesn't seem to have the AXSimilarity class in it.is there a file missing here, or should i not be looking at applyingpatches late at night?
Implement a state-of-the-art retrieval function in Lucene
---------------------------------------------------------

               Key: LUCENE-965
               URL: https://issues.apache.org/jira/browse/LUCENE-965
           Project: Lucene - Java
        Issue Type: Improvement
        Components: Search
  Affects Versions: 2.2
          Reporter: Hui Fang
           Fix For: 3.0

       Attachments: axiomaticFunction.patch
We implemented the axiomatic retrieval function, which is a state-of-the-art retrieval function, toreplace the default similarity function in Lucene. We compared theperformance of these two functions and reported the results at http://sifaka.cs.uiuc.edu/hfang/lucene/Lucene_exp.pdf.The report shows that the performance of the axiomatic retrievalfunction is much better than the default function. The axiomaticretrieval function is able to find more relevant documents andusers can see more relevant documents in the top-ranked documents.Incorporating such a state-of-the-art retrieval function couldimprove the search performance of all the applications which werebuilt upon Lucene.Most changes related to the implementation are made inAXSimilarity, TermScorer and TermQuery.java. However, many testcases are hand coded to test whether the implementation of thedefault function is correct. Thus, I also made the modification tomany test files to make the new retrieval function pass thosecases. In fact, we found that some old test cases are notreasonable. For example, in the testQueries02 of TestBoolean2.java,the query is "+w3 xx", and we have two documents "w1 xx w2 yy w3"and "w1 w3 xx w2 yy w3".The second document should be more relevant than the first one,because it has moreoccurrences of the query term "w3". But the original test casewould require us to rankthe first document higher than the second one, which is notreasonable.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


--------------------------
Grant Ingersoll

Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ











---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: [jira] Commented: (LUCENE-965) Implement a state-of-the-art retrieval function in Lucene

Reply via email to