It should be org.apache.lucene.analysis.core.WhitespaceAnalyzer ( u were missing the 'core')
Mahout trunk's presently at Lucene 4.2.1. Lucene's has gone through a major refactor in 4.x. Check Lucene 4.2.1 docs for the correct package name. ________________________________ From: 万代豊 <20525entrad...@gmail.com> To: "user@mahout.apache.org" <user@mahout.apache.org> Sent: Tuesday, May 7, 2013 3:20 AM Subject: Class Not Found from 0.8-SNAPSHOT for org.apache.lucene.analysis.WhitespaceAnalyzer Hi all I guest I must've seen somewhere on very similar topics on classname change in Mahout-0.8-SNAPSHOT for some of the Lucene analyzer and here is another one that I need to be solved. Mahout gave me an error for seq2sparse with Lucene analyzer option as follows, which of cource had been working in at least Mahout 0.7. $MAHOUT_HOME/bin/mahout seq2sparse --namedVector -i NHTSA-seqfile01/ -o NHTSA-namedVector -ow -a org.apache.lucene.analysis.WhitespaceAnalyzer -chunk 200 -wt tfidf -s 5 -md 3 -x 90 -ng 2 -ml 50 -seq -n 2 Running on hadoop, using /usr/local/hadoop/bin/hadoop and HADOOP_CONF_DIR= MAHOUT-JOB: /usr/local/trunk/examples/target/mahout-examples-0.8-SNAPSHOT-job.jar 13/05/07 15:41:12 INFO vectorizer.SparseVectorsFromSequenceFiles: Maximum n-gram size is: 2 13/05/07 15:41:18 INFO vectorizer.SparseVectorsFromSequenceFiles: Minimum LLR value: 50.0 13/05/07 15:41:18 INFO vectorizer.SparseVectorsFromSequenceFiles: Number of reduce tasks: 1 Exception in thread "main" java.lang.ClassNotFoundException: org.apache.lucene.analysis.WhitespaceAnalyzer I have confirmed what classpath Mahout is refering to as; $ $MAHOUT_HOME/bin/mahout classpath and obtained Lucene related classpath as below. /usr/local/trunk/examples/target/dependency/lucene-analyzers-common-4.2.1.jar /usr/local/trunk/examples/target/dependency/lucene-benchmark-4.2.1.jar: /usr/local/trunk/examples/target/dependency/lucene-core-4.2.1.jar /usr/local/trunk/examples/target/dependency/lucene-facet-4.2.1.jar /usr/local/trunk/examples/target/dependency/lucene-highlighter-4.2.1.jar /usr/local/trunk/examples/target/dependency/lucene-memory-4.2.1.jar /usr/local/trunk/examples/target/dependency/lucene-queries-4.2.1.jar /usr/local/trunk/examples/target/dependency/lucene-queryparser-4.2.1.jar /usr/local/trunk/examples/target/dependency/lucene-sandbox-4.2.1.jar I want to believe this to be simple classname change related issue. Please let me be advised. Regards,,, Y.Mandai