Hi Antonio, Are you using the new LDA (invoked via "$MAHOUT_HOME/bin/mahout cvb <args>", or by invoking the class org.apache.mahout.clustering.lda.cvb.CVB0Driver manually)?
If so, then your first command should work fine: mahout vectordump -i DB-LDA-clusters/docTopics/part-m-00000 -o output/cluster_lda_topics.txt What error do you get? On Thu, Apr 12, 2012 at 6:21 AM, antonio d'agata <antoniodag...@gmail.com>wrote: > Dear users, > > I'm trying to use lda clustering algorithm by command line (using > mahout-07-snapshot) and I was able to get the topics (as text file > containing the top words) but I need also to get the documents id > associated to the calculated topics. > > I tried this commands: > mahout vectordump -i DB-LDA-clusters/docTopics/part-m-00000 -o > output/cluster_lda_topics.txt > mahout vectordump -i DB-LDA-clusters/docTopics/part-m-00000 -o > output/cluster_lda_topics.txt -dt text(or sequencefile) > but without success. > > Is there a way to do such work? > > Thanks > > Antonio Michelangelo D'Agata > -- -jake