If u r using Mahout 0.8, suggest that you look at the CVB invocation in 
examples/bin/cluster-reuters.sh as reference for the sequence of steps (and 
other command line options for each step).

ldatopics has been deprecated (in 0.8) and removed completely (in 0.9).

Anyways, the input vectors directory in ur case would be - 
'/opt/mahout/cvb-output/topic_dist.out', but I would desist from using it as 
its been deprecated.





________________________________
 From: Christopher Schindler <ideab...@hotmail.com>
To: "user@mahout.apache.org" <user@mahout.apache.org> 
Sent: Wednesday, August 7, 2013 2:34 AM
Subject: Using CVB; LdaTopics confusion
 

Hi all,
A noob question I'm sure but I'm stuck. I'm using CVB to cluster a text index 
of articles. 
Here's the CVB call:
bin/mahout cvb \ -i /opt/mahout/lucene-sparse-vectors-cvb/matrix \ -dict 
/opt/mahout/cvb-output/dict.file-* \ -o /opt/mahout/cvb-output/topic_terms.out 
\ -dt /opt/mahout/cvb-output/topic_dist.out \ -k 200 \-mt 
/opt/mahout/output/iterations/ \-x 20 -a .25 -ow
I'm trying to access the topics using ldatopics per 
https://cwiki.apache.org/confluence/display/MAHOUT/Latent+Dirichlet+Allocation. 
My latest combination was: bin/mahout ldatopics -i opt/mahout/cvb-output/ -d 
/opt/mahout/cvb-output/dict.file-*
However, it returns an error stating: ERROR driver.MahoutDriver: : Try the new 
Collapsed Variation Bayes LDA, try bin/mahout cvb or bin/mahout cvb0_local
The spec is:bin/mahout ldatopics \    -i <input vectors directory> \    -d 
<input dictionary file> \
What is the vectors directory supposed to be? Many thanks in advance.
Cheers!
Chris 

Reply via email to