The features all take on non-negative values here, right? Then the cosine can't be negative.
In another context, where features could be negative, cosine could indeed be negative. -1 means most dissimilar of all -- the feature vectors are exactly opposed. On Wed, Jun 15, 2011 at 10:10 AM, Stefan Wienert <[email protected]> wrote: > Ignoring is no option... so I have to interpret these values. > Can one say that documents with similarity = -1 are the less similar > documents? I don't think this is right. > Any other assumptions?
