Jake Mannix wrote: > Remember: we're not really doing cosine at all here. This, I think, is fuzzy right? It seems to be common to still call this cosine scoring loosely - pretty much every practical impl fudges things somewhat when doing the normalization (though we are on the heavy side of fudgers) - I think its pretty rare to do the true cosine because its so expensive. It can be somewhat misleading though.
Have you looked at the Similarity scoring explanation page that was recently improved? Have any suggestions on changes to it? Doron put a fair amount of work into improving it recently, but I think it could always get better. Its currently leaning towards presenting this as cosine - that seems in line with the few text books I've seen, but I'm admittedly not that deep into any of this. --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional commands, e-mail: java-dev-h...@lucene.apache.org