Get (better) cluster labels using Log Likelihood Ratio ------------------------------------------------------
Key: MAHOUT-163 URL: https://issues.apache.org/jira/browse/MAHOUT-163 Project: Mahout Issue Type: Improvement Reporter: Shashikant Kore Attachments: mahout-cluster-labels-llr.patch Log Likelihood Ratio (LLR) is a better technique to identify cluster labels instead of the top features of the centroid vector. LLR finds terms/phrases which are common in the cluster but rare outside. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.