[ 
https://issues.apache.org/jira/browse/MAHOUT-863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Miroslav Pankov updated MAHOUT-863:
-----------------------------------

    Status: Patch Available  (was: Open)

Here is the solution to the problem. The original proposed solution with the 
lines connecting all of the points in a cluster is included, but it doesn't 
look very good with the sample data and it is very hard to understand which 
points belong to which cluster. We have added the following display options to 
the program (they can be found in the documentation of the class too):
1. Highlight different cluster's points in a slide show: This presentation is 
turned by the -p command-line parameter. Then it starts a slide show 
highlighting the different clusters in a certain update period. The slide show 
can be paused/continued with the space key. This is the default option. The 
update period is configurable too and it is read from the second command-line 
parameter. It is expected to be an integer which represents the seconds for 
which each of the clusters will stay highlighted.
2. Display lines between all points in a cluster: This presentation is turned 
by the -l command-line parameter. Each cluster has lines with different colors 
because a point can belong to more than one cluster and it is not possible to 
track it if the colors do not differ. This display doesn't look good with the 
sample data because it is very big. It can be used to view low number of 
clusters and points.
3. Display clusters as symbols: This presentation is turned by the -s 
command-line parameter. Each cluster has a unique symbol representation which 
is a character symbol in a specific (randomly chosen) color. Near all of the 
points the symbols of the clusters in which they belong are drawn. However with 
the sample data this presentation doesn't look really good too because each 
point belongs to 4+ clusters. This presentation is good when the points belong 
to 1 or maximum 2 clusters.
                
> Add DisplayMinhash clustering example
> -------------------------------------
>
>                 Key: MAHOUT-863
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-863
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Grant Ingersoll
>            Priority: Minor
>              Labels: MAHOUT_INTRO_CONTRIBUTE
>         Attachments: MAHOUT-863.patch
>
>
> We've got simple GUI tools for many of the clustering algorithms, we should 
> add one for Minhash, too

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to