On Sep 11, 2010, at 8:02 PM, Grant Ingersoll wrote:

> I've made a little bit of progress here, but not much.  Here's what I ran:
> 
> elastic-mapreduce -j <JOB>  --jar 
> s3n://news-vecs/mahout-core-0.4-SNAPSHOT.job  --main-class 
> org.apache.mahout.clustering.kmeans.KMeansDriver --arg --input --arg 
> s3n://news-vecs/part-out.vec --arg --clusters --arg 
> s3n://news-vecs/kmeans/clusters/ --arg


> --k

Ugh.  It's -k, not --k.  

So, this bit of code could likely be more useful:
} catch (IllegalArgumentException e) {
      log.error(e.getMessage());
      CommandLineUtil.printHelpWithGenericOptions(group);
      return null;
    }

Since, at least on EMR, the logs tend to get buried and it writes it out to 
syslog, not stderr or stdout.

I have it running now by logging into the EMR instance using SSH and then I 
also specifically uploaded my Vector file to HDFS by hand.  In other words, I'm 
not using the remote Ruby CLI just yet.

Progress.  Sigh.

-Grant

Reply via email to