Re: PayloadNearQuery and AveragePayloadFunction

2012-02-03 Thread shyama
Hi Peter I have checked payload associated with terms, and they are fine in the index. I was not clear enough I believe. When I say interested in class A, then scorePayload function returns 3 for only for class A terms. Again, When I say interested in class B, then my scorePayload function returns

Re: PayloadNearQuery and AveragePayloadFunction

2012-02-03 Thread Peter Keegan
AveragPayloadFunction is just what it sounds like: return numPayloadsSeen 0 ? (payloadScore / numPayloadsSeen) : 1; What values are you seeing returned from PayloadHelper.decodeFloat ? Peter On Fri, Feb 3, 2012 at 4:13 AM, shyama shyamasree_s...@yahoo.com wrote: Hi Peter I have checked

Performance improvements for fuzzy queries ?

2012-02-03 Thread Paul Taylor
Using Lucene 3.5, I created a query parser based on the dismax parser but in order to get matches on misspellings ecetra I additionally do a fuzzy search and a wildcard search

Re: Why read past EOF

2012-02-03 Thread Michael McCandless
Instead of .getVersion() you should use .getTimestamp()... version is not really a timestamp. (Though, really, you should store your own timestamp inside the commit userData, and retrieve that, instead... the getTimestamp API will be deprecated in 3.6.0). Also, you may need to implement onInit,

Re: PayloadNearQuery and AveragePayloadFunction

2012-02-03 Thread shyama
Hi Peter Thanks for your reply. I guess I found the problem. scorePayload function is only called for query terms. Problem was, when I was retrieving payloads for each tokens in token stream, it was return misleading payloads due to the fact that I did not skip TermPositions that does not

Re: PayloadNearQuery and AveragePayloadFunction

2012-02-03 Thread Peter Keegan
All term queries, including payload queries, deal only with words from the query that exist in a document. They don't know what other terms are in a matching document, due to the inverted nature of the index. Peter On Fri, Feb 3, 2012 at 11:50 AM, shyama shyamasree_s...@yahoo.com wrote: Hi

recording a universal ID from DocID in a CustomScoreQuery

2012-02-03 Thread Paul Allan Hill
My Index does NOT have a simple UID, it uses the file PATH to the file as the unique key. I was implementing a CustomScoreQuery which not only tweaked the score it also wanted to write down which documents had passed through this part of overall rebuilt query, so that I could further mess with