Get the total term frequency vector of a specific field from the hit results

2007-04-10 Thread Sengly Heng
Hello all, I would like to extract the term freq vector from the hit results as a total vector not by document. I have searched the mailing and I found many have talked about this issue but I still could not find the right solution to this matter. Everyone just suggested to look at getTermFreqVe

Re: Get the total term frequency vector of a specific field from the hit results

2007-04-10 Thread thomas arni
Hello Sengly First of all you have to make sure, that you create new Fields, which you add to a Document, with the appropriate constructor. You have to specify the usage of term vectors (Field.TermVector.YES): new Field("text", "your text...", Field.Store.YES, Field.Index.TOKENIZED,Field.Ter

Re: Get the total term frequency vector of a specific field from the hit results

2007-04-10 Thread karl wettin
10 apr 2007 kl. 16.58 skrev Sengly Heng: I wanted to do this way as well but I am a bit worrying about computational time as I have many documents and each document is a bit large. I am looking for more solutions. We don't really know what your problem is. Explaining that rathern than

Re: Get the total term frequency vector of a specific field from the hit results

2007-04-10 Thread Sengly Heng
Thanks so much Thomas for your prompt reply. First of all you have to make sure, that you create new Fields, which you add to a Document, with the appropriate constructor. You have to specify the usage of term vectors (Field.TermVector.YES): new Field("text", "your text...", Field.Store.YES,

Re: Get the total term frequency vector of a specific field from the hit results

2007-04-10 Thread Sengly Heng
Dear Karl, Thank you for taking your time in my problem. We don't really know what your problem is. Explaining that rathern than the solution you have thought of might render a couple of alternate solutions. Perhaps something could be precalculated and stored in the documents. Perhaps feature

Re: Get the total term frequency vector of a specific field from the hit results

2007-04-10 Thread karl wettin
10 apr 2007 kl. 17.48 skrev Sengly Heng: We don't really know what your problem is. Explaining that rathern than the solution you have thought of might render a couple of alternate solutions. Perhaps something could be precalculated and stored in the documents. Perhaps feature selection (reduct

Re: Get the total term frequency vector of a specific field from the hit results

2007-04-10 Thread Sengly Heng
Once again, thank you for your help. >> We don't really know what your problem is. Explaining that rathern >> than the solution you have thought of might render a couple of >> alternate solutions. Perhaps something could be precalculated and >> stored in the documents. Perhaps feature selection

Re: Get the total term frequency vector of a specific field from the hit results

2007-04-10 Thread Grant Ingersoll
Would some sort of caching strategy work? How big is your overall collection? Also, lately there have been a few threads on TV (term vector) performance. I don't recall anyone having actively profiled or examined it for improvements, so perhaps that would be helpful. Another thought: co

Re: Get the total term frequency vector of a specific field from the hit results

2007-04-11 Thread karl wettin
11 apr 2007 kl. 04.21 skrev Grant Ingersoll: Would some sort of caching strategy work? How big is your overall collection? Also, lately there have been a few threads on TV (term vector) performance. I don't recall anyone having actively profiled or examined it for improvements, so perh

Re: Get the total term frequency vector of a specific field from the hit results

2007-04-11 Thread Grant Ingersoll
On Apr 11, 2007, at 9:07 AM, karl wettin wrote: 11 apr 2007 kl. 04.21 skrev Grant Ingersoll: Would some sort of caching strategy work? How big is your overall collection? Also, lately there have been a few threads on TV (term vector) performance. I don't recall anyone having actively