subject:"bigram problem"

Re: bigram problem

2014-07-02 Thread Manjula Wijewickrema

Dear Parnab, Thanks a lot for your guidance. I prefer to follow the second method, as I have already indexed the bigrams using ShingleFilterWrapper. But, I have no any idea about how to use NGramTokenizer here. So, could you please write one or two lines of the code which shows how to use NGramTok

Re: bigram problem

2014-07-02 Thread parnab kumar

TF is straight forward, you can simply count the no of occurrences in the doc by simple string matching. For IDF you need to know total no of docs in the collection and the no. of docs having the bigram. reader.maxDoc() will give you the total no of docs in the collection. To calculate the number o

bigram problem

2014-07-02 Thread Manjula Wijewickrema

Hi, Could please explain me how to determine the tf-idf score for bigrams. My program is able to index and search bigrams correctly, but it does not calculate the tf-idf for bigrams. If someone can, please help me to resolve this. Regards, Manjula.

Re: bigram problem

Re: bigram problem

bigram problem

3 matches

Site Navigation

Mail list logo

Footer information