Clustering uses sparse vectors by default. The missing coordinate values must 
be zeros.

-----Original Message-----
From: rmx [mailto:ruimax...@hotmail.com] 
Sent: Wednesday, November 17, 2010 1:29 PM
To: mahout-u...@lucene.apache.org
Subject: k-means output missing some cluster centers coordinates


Hi.

I am running k-means over a numerical dataset with 41 variables.

The output is missing  some cluster centers coordinates.

For example in this center it misses the coordinates number 6, 8, 10, 12,
13, 14,...:
"VL-489651{n=290923 c=[0:0.081, 1:2.931, 2:8.846, 3:1.000, 4:1009.936,
5:10.337, 7:0.000, 9:0.000, 11:0.033, 15:0.000, 16:0.001, 17:0.000,
18:0.000, 22:489.476, 23:489.492, 24:0.000, 25:0.000, 26:0.000, 27:0.000,
28:0.999, 29:0.001, 30:0.013, 31:250.361, 32:250.280, 33:0.986, 34:0.002,
35:0.969, 36:0.000, 37:0.000, 38:0.000, 39:0.000, 40:0.000, 41:8.723]
r=[0:2.377, 1:0.366, 2:1.750, 3:0.044, 4:949.196, 5:69.735, 7:0.009,
9:0.010, 11:0.179, 15:0.009, 16:0.031, 17:0.011, 18:0.020, 22:94.125,
23:94.032, 24:0.005, 25:0.005, 26:0.003, 27:0.003, 28:0.019, 29:0.031,
30:0.112, 31:28.323, 32:26.883, 33:0.087, 34:0.024, 35:0.169, 36:0.004,
37:0.011, 38:0.005, 39:0.014, 40:0.002, 41:1.470]}"

Thank you in advance
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/k-means-output-missing-some-cluster-centers-coordinates-tp1919928p1919928.html
Sent from the Mahout User List mailing list archive at Nabble.com.

Reply via email to