​Hi All,

I am trying to create a vector file to go into KMeans clustering Algorithm. The Data I have is in Solr and I have followed this tutorial https://cwiki.apache.org/confluence/display/MAHOUT/Creating+Vectors+from+Text

and used this command

bin/mahout lucene.vector --dir <path>/solr/data/index --field body \
--dictOut /tmp/dict.txt --output /tmp/out.txt --max 50
--norm 2

I get two files out and tried to use the out file with this command

bin/mahout
org.apache.mahout.clustering.kmeans.KMeansDriver -i
out.txt -o input-data-kmeans-clusters -c
clusters -m org.apache.mahout.common.distance.CosineDistanceMeasure -v
org.apache.mahout.matrix.SparseVector -x 50

I get an error about no clusters found am I even using the right vector file??


Regards

Dave


Reply via email to