Hi,
The parameter 'unigram' may be what you want.

  --unigram (-u)              If set, unigrams will be emitted inthe
                      final output alongside collocations

https://mahout.apache.org/users/basics/collocations.html


Joe

2015-12-03 4:38 GMT+08:00 Ankur Desai (ankurdes) <ankur...@cisco.com>:

> Hi,
>
> I am running collocation on mahout and are having trouble understanding
> what the minsupport parameter is doing.
>
> I want to get the bigrams/trigrams that occur at least 5 times in the
> corpus.  I set the minsupport value to 5 and I am still getting results
> that are occur only one time in the entire corpus.
>
> Can someone please help me understand what this parameter is for or how I
> can get bigrams/trigrams that occur at least X number of times.
>
> Thanks,
> Ankur
>

Reply via email to