Re: mahout 0.5 to 0.7 commandline parameter of lda

2012-10-18 Thread Jake Mannix
On Thu, Oct 18, 2012 at 9:16 AM, Vineeth wrote: > I am running the lda for the first time. I gave the following command to > test over the Reuters dataset but i got the error > > lda -i reuters-vectors/tf-vectors -o reuters-lda-sparse -k 10 -v 7000 -x > 20 -ow > > hadoop binary is not in PATH,HAD

Re: mahout 0.5 to 0.7 commandline parameter of lda

2012-10-18 Thread Vineeth
I am running the lda for the first time. I gave the following command to test over the Reuters dataset but i got the error lda -i reuters-vectors/tf-vectors -o reuters-lda-sparse -k 10 -v 7000 -x 20 -ow hadoop binary is not in PATH,HADOOP_HOME/bin,HADOOP_PREFIX/bin, running locally SLF4J: C

Re: mahout 0.5 to 0.7 commandline parameter of lda

2012-10-18 Thread Jake Mannix
For Mahout 0.7, the format of the model files for LDA are just a SequenceFile, with the row numbers being the topicIds, and the entries being the (un-normalized) probabilities for each termId. bin/vectordump --dictionary \ --dictioanryType \ --in

mahout 0.5 to 0.7 commandline parameter of lda

2012-10-17 Thread vineeth
Hello, I am seeing from this website http://theglassicon.com/computing/machine-learning/running-lda-algorithm-mahout (Mahout 0.5). This website give the complete procedure to get probabilities of word and topics using LDA. However, these steps donot work on Mahout 0.7. Can some one give an up