Thanks Drew,
I started a new EC2 instance with the mahout trunk and got it working.
There is a problem with the last line though.
The last line in the script gave an error:
../bin/mahout kmeans -i ./work/reuters-out-seqdir-sparse/tfidf/vectors/
-c ./work/clusters -o ./work/reuters-kmeans -k 20 -w
org.apache.commons.cli2.OptionException: Unexpected -w while processing
Options
Removing the -w and adding the -maxIter fixes it.
../bin/mahout kmeans -i ./work/reuters-out-seqdir-sparse/tfidf-vectors/
-c ./work/clusters -o ./work/reuters-kmeans -k 20 --maxIter 20
I added a comment to
https://issues.apache.org/jira/browse/MAHOUT-390
@tommychheng
Programmer and UC Irvine Graduate Student
Find a great grad school based on research interests: http://gradschoolnow.com
On 6/2/10 8:27 PM, Drew Farris wrote:
Very strange:
d...@skirnir:~/mahout/svn-trunk$ svn info
Path: .
URL: https://svn.apache.org/repos/asf/mahout/trunk
Repository Root: https://svn.apache.org/repos/asf
Repository UUID: 13f79535-47bb-0310-9956-ffa450edef68
Revision: 950859
[...]
d...@skirnir:~/mahout/svn-trunk$ ./bin/mahout seqdirectory -i
./work/reuters-out -o ./work/reuters-out-seqdir -c UTF-8
no HADOOP_CONF_DIR or HADOOP_HOME set, running locally
[..]
d...@skirnir:~/mahout/svn-trunk$ ls ./work/reuters-out-seqdir
chunk-0
To be absolutely certain nothing old is lurking in your target directories,
try 'mvn clean install' to rebuild and see if your results differ. If you
prefer, you can skip test execution 'mvn clean install -DskipTests=true'
IF that doesn't work, run 'mvn -v' and post the results -- that might
provide some clues.
- Drew
On Tue, Jun 1, 2010 at 9:39 PM, Tommy Chheng<[email protected]> wrote:
I updated the svn and did a mvn install but still getting a parsing
command line error on the seqdirectory command.
$svn info
Path: .
URL: http://svn.apache.org/repos/asf/mahout/trunk
Repository Root: http://svn.apache.org/repos/asf
Repository UUID: 13f79535-47bb-0310-9956-ffa450edef68
Revision: 950329
Node Kind: directory
Schedule: normal
Last Changed Author: srowen
Last Changed Rev: 950049
Last Changed Date: 2010-06-01 05:55:49 -0700 (Tue, 01 Jun 2010)
$./bin/mahout seqdirectory -i ./work/reuters-out/ -o
./work/reuters-out-seqdir -c UTF-8
no HADOOP_CONF_DIR or HADOOP_HOME set, running locally
Exception in thread "main" org.apache.commons.cli2.OptionException:
Unexpected -i while processing Options
at org.apache.commons.cli2.commandline.Parser.parse(Parser.java:99)
at
org.apache.mahout.text.SequenceFilesFromDirectory.main(SequenceFilesFromDirectory.java:205)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at
org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
at
org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:174)
@tommychheng
Programmer and UC Irvine Graduate Student
Find a great grad school based on research interests:
http://gradschoolnow.com
On 6/1/10 12:43 PM, Grant Ingersoll wrote:
Can you try doing an SVN update and then "mvn install" and then run again?
On May 31, 2010, at 12:28 PM, Tommy Chheng wrote:
Hi,
I'm using the quickstart-kmeans.sh script from
https://issues.apache.org/jira/browse/MAHOUT-390 to run the example
kmeans. I'm on mahout trunk.
It fails on the SequenceFile generation step:
$./bin/mahout seqdirectory -i ./work/reuters-out/ -o
./work/reuters-out-seqdir -c UTF-8
no HADOOP_CONF_DIR or HADOOP_HOME set, running locally
Exception in thread "main" org.apache.commons.cli2.OptionException:
Unexpected -i while processing Options
at
org.apache.commons.cli2.commandline.Parser.parse(Parser.java:99)
at
org.apache.mahout.text.SequenceFilesFromDirectory.main(SequenceFilesFromDirectory.java:205)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at
org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
at
org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
at
org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:174)
Alternatively, I tried ./bin/mahout seqdirectory --input
./work/reuters-out/ -o ./work/reuters-out-seqdir -c UTF-8 but the get the
same unexpected --input error.
--
@tommychheng
Programmer and UC Irvine Graduate Student
Find a great grad school based on research interests:
http://gradschoolnow.com