Hi, I find this project very interesting. But I'd like to check how you define contexts.
Am I right in thinking that the context "features" which are used to construct context vectors and then similarity matrices are bigram word pairs only, albeit bigrams defined flexibly (by the NSP package) to range over the space of a sampling window? I think I do something very similar, but I found for my purposes it has been necessary to have both the preceding and following context in a single calculation to get good results (perhaps because I relate both single word and multiple word "tokens"). -Rob Freeman ------------------------------------------------------- This SF.net email is sponsored by: IBM Linux Tutorials. Become an expert in LINUX or just sharpen your skills. Sign up for IBM's Free Linux Tutorials. Learn everything from the bash shell to sys admin. Click now! http://ads.osdn.com/?ad_id=1278&alloc_id=3371&op=click _______________________________________________ senseclusters-users mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/senseclusters-users
