Re: Indexing Term Frequency Vectors

2013-04-09 Thread Adrien Grand
Hi, On Tue, Apr 9, 2013 at 5:24 PM, Sharon Tam wrote: > I tried following following this payloads tutorial to attach the term > frequencies as payloads: > http://searchhub.org/2009/08/05/getting-started-with-payloads/ > > But I'm confused as to where I need to override the term frequency counter

Re: Indexing Term Frequency Vectors

2013-04-02 Thread Adrien Grand
On Tue, Apr 2, 2013 at 4:10 PM, Sharon W Tam wrote: > Are there any other ideas? Since scoring seems to be what you are interested in, you could have a look to payloads: there can store arbitrary data and can be used to score matches. -- Adrien

Re: Indexing Term Frequency Vectors

2013-04-02 Thread Sharon W Tam
Thanks for your help, Adrien. But unfortunately, my term frequencies will be partial counts so they won't be integers, And finding a common denominator and scaling the rest of the frequencies accordingly will affect the relative lengths of the documents which will affect the Lucene scoring becaus

Re: Indexing Term Frequency Vectors

2013-03-28 Thread Adrien Grand
Hi, On Thu, Mar 28, 2013 at 8:25 PM, Sharon Tam wrote: > I believe that when Lucene indexes documents, it generates counts for a > term by counting how many times the term appears in a particular document. > Instead of having Lucene do the counting, I want to do my own counting and > feed a term-

Indexing Term Frequency Vectors

2013-03-28 Thread Sharon Tam
I believe that when Lucene indexes documents, it generates counts for a term by counting how many times the term appears in a particular document. Instead of having Lucene do the counting, I want to do my own counting and feed a term-frequency vector representation of a document directly into the i