I believe, and I haven't checked my notes from way back when, that it doesn't need to be calculated b/c P(C) is the same across all the comparisons, so P(D|C) is the only factor that matters b/c you only need relative scores for ordering purposes.
On Apr 29, 2010, at 2:25 AM, Gurudev Devanla wrote: > Hello All, > > This is my first post ever on any open source mailing list. So, please > excuse me if I am not following certain standards. > > I was walking through the code for Naive Bayes classifier and I notice that > in TestClassifier.java, at the point where the document wieghts are > calculated the probability of the class(label) is not taken into > consideration. My knowledge of document wt in Naive Bayes is : > > Pr(C|D ) = Pr(D|C) * P(C) , but in the implementation I have downloaded, I > don't see Pr(C) being used in the calculation. > > Any pointers would be great. I am probably overlooking this be considered > elsewhere in the program. > > Thanks > betacoder -------------------------- Grant Ingersoll http://www.lucidimagination.com/ Search the Lucene ecosystem using Solr/Lucene: http://www.lucidimagination.com/search
