[
https://issues.apache.org/jira/browse/MAHOUT-552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12935407#action_12935407
]
Pere Ferrera Bertran commented on MAHOUT-552:
---------------------------------------------
Thanks for your observations, Jeff. Then I guess the problem I am reporting is
specific to some clustering algorithm. Concretely, I am using Mean Shift
Clustering. There is no way I can preserve vectors names in -cl mode. I am
using the latest code (0.5 snapshot).
In MeanShiftCanopyClusterMapper there is some sort of equivalence between input
vectors and canopies. I can see the vector that is output to clusteredPoints is
canopy.getCenter(). Is this right?
> AbstractCluster eliminates NamedVectors by replacing them with
> RandomAccessSparseVector always
> ----------------------------------------------------------------------------------------------
>
> Key: MAHOUT-552
> URL: https://issues.apache.org/jira/browse/MAHOUT-552
> Project: Mahout
> Issue Type: Bug
> Components: Clustering
> Affects Versions: 0.5
> Reporter: Pere Ferrera Bertran
> Fix For: 0.5
>
> Attachments: MAHOUT-552.patch
>
>
> When clustering using NamedVectors as input - after running seq2sparse with
> patch https://issues.apache.org/jira/browse/MAHOUT-401 - names are lost
> because AbstractCluster replaces vectors coming in the constructor with
> RandomAccessSparseVector.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.