On Mon, 2015-06-22 at 00:55 +0900, Anthony Beylerian wrote:
> Dear Jörn,
> Thank you for that.
> 
> After further surveying, I was thinking of beginning the implementation of an 
> approach based on context clustering as a next step.
> Maybe similar to the one in [1] which relies on a public (CC-A licensed) 
> dataset [2].Since clustering is usually done using K-means, which could take 
> some time with large data, this was already done previously and the results 
> were made publicly available in [3] with up to 20 closest clusters per 
> "phrase".
> The authors in [1] propose to subsequently apply a Naive Bayes classifier as 
> described in their paper.I believe this is straight-forward enough to 
> implement as another unsupervised approach for the proposed time-frame.
> Would like your opinion.

Your users can just download the dataset and do the clustering them
self. It should be possible to do that anyway. All the code necessary to
do that should be available as part of your contribution.

Jörn

Attachment: signature.asc
Description: This is a digitally signed message part

Reply via email to