That's correct; I reported that last summer and didn't fix it in M-1030 since it didn't seem like that's what the group wanted in that bug.
I see you're filing another bug, thanks. On Fri, Jan 24, 2014 at 10:29 AM, Pat Ferrel <p...@occamsmachete.com> wrote: > I can’t believe I haven’t noticed this before and so am hoping I’m > mistaken… > > When you are using kmeans to cluster data where there is no “named” > vector, clusteredPoints do not contain the vector ids so the cluster id, > pdf, “distance-squared”, and vector dimensions are not tied to any known > vector and so are, well, pretty much useless afaict. > > This means you have to loop through all your input vectors, recalculate > any of the above values you need and categorize them yourself, right? Is > this how it’s meant to work? > > I have used clustering before but had named vectors (text docs). Anyone > clustering some intermediate Mahout DRM or vectors with no names will have > this problem. > > Someone please tell me I’ve slipped a gear...