On 5/19/10 3:19 PM, Jeff Eastman wrote:
I tried propagating numReducers into its makePartialVectors driver; however, but a single reducer is still all I get. I need to figure out how to tickle the elephant to give me more.
Note to self: Use a real elephant. Running Hadoop in Eclipse is great for debugging but it does not launch multiple mappers or reducers. Running on a single-host Hadoop cluster; however, does and the elephant is now dancing nicely.

./bin/mahout seq2sparse -i ./examples/bin/work/reuters-out-seqdir/ -o ./examples/bin/work/reuters-out-seqdir-sparse -wt tf -seq -nr 2

now produces two input vector files for LDA to munch on. Now to try it on a real cluster...

Reply via email to