Hi,

The similarity measures other than cosine - match, jacc, dice etc. do not use the frequency information for their computation, they just need binary information about features presence/absence which is supported by bitsimat.pl.

Thus if the requirement is to cluster in similarity space using similarity measure other than cosine then you only need binary context representation and then proceed with using bitsimat.pl.

Regards,
Anagha



Hi Ted,

Is there planned support for different measures of similarity in senseclusters' simat.pl? Right now there's the cosine function, but I'd be curious to compare results based on jaccard and other measures as well. I noticed that you have some choices when using CLUTO, but i'm working most of the time on raw similarities.

If there's evidence or experiences that suggests one measure over another, I'd also be interested finding out more about that

kind regards,

s.



-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
senseclusters-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/senseclusters-users

Reply via email to