On Thu, Aug 7, 2014 at 11:34 AM, Ted Dunning <[email protected]> wrote:
> > > Can you say a bit more about what you are trying to do? > Thank you. I would like to customize co-oc code not to just yank top N scored co-occurrences, but also make sure that all of them satisfy rejection of coincidence with a given confidence level (anywhere between say 60% to 99%). I guess it is ok even if it is an asymptotical approximation, since we in practice will be testing far right in the tail. In other words, i don't want to bother heaping clear and pre-judged coincidences. In applications other than direct co-occurrence, similar decisioning would be sensible. > [1] https://dl.dropboxusercontent.com/u/36863361/llr-table.png > [2] http://www.aclweb.org/anthology/J93-1003 > > >
