Re: I believe the TanimotoSimilarity scorer actually uses the Jaccard similarity measure

2013-04-08 Thread Ted Dunning
I don't see the problem here. We only want to compare two items so Jaccard and Tanimoto are identical. Could you file a JIRA and suggest a javadoc patch? Why did this take you to an ancient journal instead of Wikipedia? On Apr 7, 2013, at 6:54 AM, James Endicott wrote: As far as I can

Re: I believe the TanimotoSimilarity scorer actually uses the Jaccard similarity measure

2013-04-08 Thread James Endicott
I didn't want to file a suggestion for a javadoc patch without hearing from someone who knows a bit more about the math history behind it because I didn't want to suggest something that may be in error. When I checked the Wikipedia article on it, the article noted that there was confusion an

Re: I believe the TanimotoSimilarity scorer actually uses the Jaccard similarity measure

2013-04-08 Thread Ted Dunning
To my mind, you as the reader have a major voice here. So if you were confused/not happy with the doc, then there is a problem. You will know best how to fix that when you get done. So let us know how! On Mon, Apr 8, 2013 at 2:16 PM, James Endicott endicott.ja...@gmail.comwrote: I didn't

I believe the TanimotoSimilarity scorer actually uses the Jaccard similarity measure

2013-04-07 Thread James Endicott
I might be wrong about this because I'm just a grad student and my knowledge of statistics and ability to read French leave something to be desired but I think that the TanimotoSimilarity scorer actually uses the Jaccard similarity measure instead of the TanimotoSimilarity measure. The javadocs

Re: I believe the TanimotoSimilarity scorer actually uses the Jaccard similarity measure

2013-04-07 Thread Sean Owen
I had not heard of Tanimoto being generalized to n-way similarity, but then again, I can't say I know much at all authoritative about the term. The Wikipedia page says it's incorrectly used to describe a lot of things. Here, we're only looking at 2-way comparisons, pair-wise similarity. As far as