As it is probably more likely that you're doing something incorrect than that Lucene is reporting incorrect results :), it might help if you reported the exact query that is being submitted to the IndexSearcher, and then showing us the document that was incorrectly returned. My guess is that either looking at the query itself will immediately reveal the problem to you, or that the query in combination with the document and knowledge of which analyzers you are using will reveal the problem-
Donna Jan <fajer...@informatik.hu-berlin.de> wrote on 11/17/2010 11:47:49 AM: > [image removed] > > uncorrect results > > Jan > > to: > > java-user > > 11/17/2010 11:51 AM > > Please respond to java-user > > Hi, > i have an assignment in my Text Analytics class. I am supposed to create > an index and search it. The corpus is a PubMed-like XML file. it is > possible to query terms (programcall a few terms) and phrases > (programcall "a phrase"). > When a phrase is queried the program should answer how often the phrase > occured. > The problem is, on certain queries the IndexSearcher returns some > documents that do not have that particular query in its fields. > I'd be delighted if someone could tell me what i am doing wrong. > See the source code at my github repo > https://github.com/jangingnicht/TextAnalytics2/tree/master/src/textanalytics2/ > > Thanks in advance > jan > > PS: I use Lucene 3.0.2 and the OpenJDK Runtime Environment (IcedTea6 > 1.8.2) on an 64 bit Linux machine. > [attachment "signature.asc" deleted by Donna L Gresh/Watson/IBM]