[ https://issues.apache.org/jira/browse/MAHOUT-863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Miroslav Pankov updated MAHOUT-863: ----------------------------------- Status: Patch Available (was: Open) Here is the solution to the problem. The original proposed solution with the lines connecting all of the points in a cluster is included, but it doesn't look very good with the sample data and it is very hard to understand which points belong to which cluster. We have added the following display options to the program (they can be found in the documentation of the class too): 1. Highlight different cluster's points in a slide show: This presentation is turned by the -p command-line parameter. Then it starts a slide show highlighting the different clusters in a certain update period. The slide show can be paused/continued with the space key. This is the default option. The update period is configurable too and it is read from the second command-line parameter. It is expected to be an integer which represents the seconds for which each of the clusters will stay highlighted. 2. Display lines between all points in a cluster: This presentation is turned by the -l command-line parameter. Each cluster has lines with different colors because a point can belong to more than one cluster and it is not possible to track it if the colors do not differ. This display doesn't look good with the sample data because it is very big. It can be used to view low number of clusters and points. 3. Display clusters as symbols: This presentation is turned by the -s command-line parameter. Each cluster has a unique symbol representation which is a character symbol in a specific (randomly chosen) color. Near all of the points the symbols of the clusters in which they belong are drawn. However with the sample data this presentation doesn't look really good too because each point belongs to 4+ clusters. This presentation is good when the points belong to 1 or maximum 2 clusters. > Add DisplayMinhash clustering example > ------------------------------------- > > Key: MAHOUT-863 > URL: https://issues.apache.org/jira/browse/MAHOUT-863 > Project: Mahout > Issue Type: Improvement > Reporter: Grant Ingersoll > Priority: Minor > Labels: MAHOUT_INTRO_CONTRIBUTE > Attachments: MAHOUT-863.patch > > > We've got simple GUI tools for many of the clustering algorithms, we should > add one for Minhash, too -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira