Jake Mannix wrote:
> Remember: we're not really doing cosine at all here.
This, I think, is fuzzy right? It seems to be common to still call this
cosine scoring loosely - pretty much every practical impl fudges things
somewhat when doing the normalization (though we are on the heavy side
of fudgers) - I think its pretty rare to do the true cosine because its
so expensive. It can be somewhat misleading though.

Have you looked at the Similarity scoring explanation page that was
recently improved? Have any suggestions on changes to it? Doron put a
fair amount of work into improving it recently, but I think it could
always get better. Its currently leaning towards presenting this as
cosine - that seems in line with the few text books I've seen, but I'm
admittedly not that deep into any of this.

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to