Hello all,
I would like to extract the term freq vector from the hit results as a total
vector not by document.
I have searched the mailing and I found many have talked about this issue
but I still could not find the right solution to this matter. Everyone just
suggested to look at getTermFreqVe
Hello Sengly
First of all you have to make sure, that you create new Fields, which
you add to a Document, with the appropriate constructor. You have to
specify the usage of term vectors (Field.TermVector.YES):
new Field("text", "your text...", Field.Store.YES,
Field.Index.TOKENIZED,Field.Ter
10 apr 2007 kl. 16.58 skrev Sengly Heng:
I wanted to do this way as well but I am a bit worrying about
computational
time as I have many documents and each document is a bit large.
I am looking for more solutions.
We don't really know what your problem is. Explaining that rathern
than
Thanks so much Thomas for your prompt reply.
First of all you have to make sure, that you create new Fields, which
you add to a Document, with the appropriate constructor. You have to
specify the usage of term vectors (Field.TermVector.YES):
new Field("text", "your text...", Field.Store.YES,
Dear Karl,
Thank you for taking your time in my problem.
We don't really know what your problem is. Explaining that rathern
than the solution you have thought of might render a couple of
alternate solutions. Perhaps something could be precalculated and
stored in the documents. Perhaps feature
10 apr 2007 kl. 17.48 skrev Sengly Heng:
We don't really know what your problem is. Explaining that rathern
than the solution you have thought of might render a couple of
alternate solutions. Perhaps something could be precalculated and
stored in the documents. Perhaps feature selection (reduct
Once again, thank you for your help.
>> We don't really know what your problem is. Explaining that rathern
>> than the solution you have thought of might render a couple of
>> alternate solutions. Perhaps something could be precalculated and
>> stored in the documents. Perhaps feature selection
Would some sort of caching strategy work? How big is your overall
collection?
Also, lately there have been a few threads on TV (term vector)
performance. I don't recall anyone having actively profiled or
examined it for improvements, so perhaps that would be helpful.
Another thought: co
11 apr 2007 kl. 04.21 skrev Grant Ingersoll:
Would some sort of caching strategy work? How big is your overall
collection?
Also, lately there have been a few threads on TV (term vector)
performance. I don't recall anyone having actively profiled or
examined it for improvements, so perh
On Apr 11, 2007, at 9:07 AM, karl wettin wrote:
11 apr 2007 kl. 04.21 skrev Grant Ingersoll:
Would some sort of caching strategy work? How big is your overall
collection?
Also, lately there have been a few threads on TV (term vector)
performance. I don't recall anyone having actively
10 matches
Mail list logo