That's correct; I reported that last summer and didn't fix it in M-1030
since it didn't seem like that's what the group wanted in that bug.

I see you're filing another bug, thanks.


On Fri, Jan 24, 2014 at 10:29 AM, Pat Ferrel <p...@occamsmachete.com> wrote:

> I can’t believe I haven’t noticed this before and so am hoping I’m
> mistaken…
>
> When you are using kmeans to cluster data where there is no “named”
> vector, clusteredPoints do not contain the vector ids so the cluster id,
> pdf, “distance-squared”, and vector dimensions are not tied to any known
> vector and so are, well, pretty much useless afaict.
>
> This means you have to loop through all your input vectors, recalculate
> any of the above values you need and categorize them yourself, right? Is
> this how it’s meant to work?
>
> I have used clustering before but had named vectors (text docs). Anyone
> clustering some intermediate Mahout DRM or vectors with no names will have
> this problem.
>
> Someone please tell me I’ve slipped a gear...

Reply via email to