I believe that when Lucene indexes documents, it generates counts for a term by counting how many times the term appears in a particular document. Instead of having Lucene do the counting, I want to do my own counting and feed a term-frequency vector representation of a document directly into the indexer which will take my counts and proceed to do the other processing such as generating inverse document frequency. My term-frequencies may not all be integers. Is there a way to do this?
- Indexing Term Frequency Vectors Sharon Tam
- Re: Indexing Term Frequency Vectors Adrien Grand
- Re: Indexing Term Frequency Vectors Sharon W Tam
- Re: Indexing Term Frequency Vectors Adrien Grand
- Re: Indexing Term Frequency Vectors Adrien Grand