[
https://issues.apache.org/jira/browse/MAHOUT-863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Miroslav Pankov updated MAHOUT-863:
-----------------------------------
Status: Patch Available (was: Open)
Here is the solution to the problem. The original proposed solution with the
lines connecting all of the points in a cluster is included, but it doesn't
look very good with the sample data and it is very hard to understand which
points belong to which cluster. We have added the following display options to
the program (they can be found in the documentation of the class too):
1. Highlight different cluster's points in a slide show: This presentation is
turned by the -p command-line parameter. Then it starts a slide show
highlighting the different clusters in a certain update period. The slide show
can be paused/continued with the space key. This is the default option. The
update period is configurable too and it is read from the second command-line
parameter. It is expected to be an integer which represents the seconds for
which each of the clusters will stay highlighted.
2. Display lines between all points in a cluster: This presentation is turned
by the -l command-line parameter. Each cluster has lines with different colors
because a point can belong to more than one cluster and it is not possible to
track it if the colors do not differ. This display doesn't look good with the
sample data because it is very big. It can be used to view low number of
clusters and points.
3. Display clusters as symbols: This presentation is turned by the -s
command-line parameter. Each cluster has a unique symbol representation which
is a character symbol in a specific (randomly chosen) color. Near all of the
points the symbols of the clusters in which they belong are drawn. However with
the sample data this presentation doesn't look really good too because each
point belongs to 4+ clusters. This presentation is good when the points belong
to 1 or maximum 2 clusters.
> Add DisplayMinhash clustering example
> -------------------------------------
>
> Key: MAHOUT-863
> URL: https://issues.apache.org/jira/browse/MAHOUT-863
> Project: Mahout
> Issue Type: Improvement
> Reporter: Grant Ingersoll
> Priority: Minor
> Labels: MAHOUT_INTRO_CONTRIBUTE
> Attachments: MAHOUT-863.patch
>
>
> We've got simple GUI tools for many of the clustering algorithms, we should
> add one for Minhash, too
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira