Hi All,
I am trying to create a vector file to go into KMeans clustering
Algorithm. The Data I have is in Solr and I have followed this
tutorial https://cwiki.apache.org/confluence/display/MAHOUT/Creating+Vectors+from+Text
and used this command
bin/mahout lucene.vector --dir <path>/solr/data/index --field body \
--dictOut /tmp/dict.txt --output /tmp/out.txt --max 50
--norm 2
I get two files out and tried to use the out file with this command
bin/mahout
org.apache.mahout.clustering.kmeans.KMeansDriver -i
out.txt -o input-data-kmeans-clusters -c
clusters -m org.apache.mahout.common.distance.CosineDistanceMeasure -v
org.apache.mahout.matrix.SparseVector -x 50
I get an error about no clusters found am I even using the right
vector file??
Regards
Dave