Have you considered using bi-grams and tri-grams? It might be useful
indexing with NgramFilter and then searching for N-grams through the text.
You could also count the number of times a particular document consists of
"Car Insurance Rate" for term-frequency etc.
-Hemant
Hi,
I have looked at MoreLikeThis functionality. I would like to
add moreDisLikeThis functionality as well. It is important for me to
learn from similarity as well as dissimilarity with other documents. I
have done the basic ground work of forming two queries (one with
MoreLikeThis c
You have to set bq.setMaxClauseCount value as the default number of
clauses BooleanQuery supports is 1024.
I am guessing you have categoryIDs between 1-3 which means more than
1024 clauses.
-Hemant
setMaxClauseCount
Ashley Rajaratnam wrote:
Hi,
Please forgive me if this comes a