reusing the term-frequency count while indexing

2011-10-23 Thread prasenjit mukherjee
I already have the term-frequency-count for all the terms in a document. Is there a way I can re-use that info while indexing. I would like to use solr for this. - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org F

Re: reusing the term-frequency count while indexing

2011-10-23 Thread ppp c
Of curse, it can be reused. But from my point of view, it's meaningless, since the analysis process has to be performed to collect such as prox, offset, or syno, payload and so on. On Sun, Oct 23, 2011 at 11:22 PM, prasenjit mukherjee wrote: > I already have the term-frequency-count for all the t

Re: reusing the term-frequency count while indexing

2011-10-23 Thread prasenjit mukherjee
Can you tell me how I can feed the lucene index by using the term frequency directly ? Actually I am getting the documents along with their term-frequency and don't want to write any additional code to expand them. On 10/23/11, ppp c wrote: > Of curse, it can be reused. > But from my point of v

Re: reusing the term-frequency count while indexing

2011-10-24 Thread Simon Willnauer
so you are saying you got (uniqueTerm, freq) tuples and you want to make lucene use this directly? I think the easiest way is to write a simple tokenFilter that emit the term X times where X is the term frequency. There is no easy way to pass these tuples to lucene directly. simon On Mon, Oct 24,

Re: reusing the term-frequency count while indexing

2011-10-24 Thread prasenjit mukherjee
Thats exactly I was trying to avoid :( I can afford to do that during indexing time, but it will be time-consuming to do that at search time. On Mon, Oct 24, 2011 at 1:05 PM, Simon Willnauer wrote: > so you are saying you got (uniqueTerm, freq) tuples and you want to > make lucene use this direc

Re: reusing the term-frequency count while indexing

2011-10-25 Thread Simon Willnauer
On Tue, Oct 25, 2011 at 5:08 AM, prasenjit mukherjee wrote: > Thats exactly I was trying to avoid :( > > I can afford to do that during indexing time, but it will be > time-consuming to do that at search time. hu? I don't understand, if you provide the terms at indexing time lucene keeps track of

Re: reusing the term-frequency count while indexing

2011-10-25 Thread prasenjit mukherjee
On Tue, Oct 25, 2011 at 1:17 PM, Simon Willnauer wrote: > On Tue, Oct 25, 2011 at 5:08 AM, prasenjit mukherjee > wrote: >> Thats exactly I was trying to avoid :( >> >> I can afford to do that during indexing time, but it will be >> time-consuming to do that at search time. > > hu? I don't underst

Re: reusing the term-frequency count while indexing

2011-10-25 Thread Rene Hackl-Sommer
Use term boosts? "solr^3 rocks^2 apache" http://lucene.apache.org/java/3_4_0/queryparsersyntax.html#Boosting%20a%20Term Am 25.10.2011 11:19, schrieb prasenjit mukherjee: During search time I get the following input ( only for 1 field ) = "solr:3 rocks:2 apache:1" . For this I have to create the

Re: reusing the term-frequency count while indexing

2011-10-25 Thread prasenjit mukherjee
Thanks, this is helpful. Is the affect ( in ranking ) gonna be the same as passing multiple terms ? I will try it out definitely. On Tue, Oct 25, 2011 at 3:21 PM, Rene Hackl-Sommer wrote: > Use term boosts? "solr^3 rocks^2 apache" > > http://lucene.apache.org/java/3_4_0/queryparsersyntax.html#Bo