On 5/19/10 3:19 PM, Jeff Eastman wrote:
I tried propagating numReducers into its makePartialVectors driver;
however, but a single reducer is still all I get. I need to figure out
how to tickle the elephant to give me more.
Note to self: Use a real elephant. Running Hadoop in Eclipse is great
for debugging but it does not launch multiple mappers or reducers.
Running on a single-host Hadoop cluster; however, does and the elephant
is now dancing nicely.
./bin/mahout seq2sparse -i ./examples/bin/work/reuters-out-seqdir/ -o
./examples/bin/work/reuters-out-seqdir-sparse -wt tf -seq -nr 2
now produces two input vector files for LDA to munch on. Now to try it
on a real cluster...