On Feb 16, 2010, at 3:18 PM, Ted Dunning wrote:

> On Tue, Feb 16, 2010 at 11:13 AM, Jason Rennie <[email protected]> wrote:
> 
>> Am I incorrect in thinking that the events used for LLR here are the
>> occurrences of the individual terms in a bigram?  I'm looking here:
>> 
>> 
>> http://svn.apache.org/viewvc/lucene/mahout/trunk/math/src/main/java/org/apache/mahout/math/stats/LogLikelihood.java?view=markup
>> 
> 
> Here is my take on the matter:
> http://tdunning.blogspot.com/2008/03/surprise-and-coincidence.html
> 
> The events are occurrences of word A (and complementarily, any non-A word)
> in the first position and word B (and non-B words) in the second position.

Jason, the Javadocs on the file you mentioned have more or less plagiarized 
Ted's most excellent blog post, so hopefully it explains what you need, but 
there may still be room for more clarification.

Reply via email to