Re: Using Mahout 1.0-SNAPSHOT with yarn cluster continued

2015-01-08 Thread mw
Hello again, maybe my question was misleading. I am asking whether the intended usage is to provide the job with the required library’s and sent those together with the job to yarn(if yes how can this be done?), or to add the required classes to the classpath of every node in the cluster. Wha

DTW distance measure and K-medioids, Hierarchical clustering

2015-01-08 Thread Marko Dinic
Hello everyone. I have a couple of questions. 1) Is there an implementation of DTW (Dynamic Time Warping) in Mahout that could be used as a distance measure for clustering? 2) Why isn't there an implementation of K-mediods in Mahout? I'm guessing that it could not be implemented efficiently

Re: Using Mahout 1.0-SNAPSHOT with yarn cluster continued

2015-01-08 Thread Pat Ferrel
That sounds like a Hadoop list question. All I can say is there is a job.jar in mrlegacy/target with all dependencies packaged. This should have everything needed for lda. On Jan 8, 2015, at 5:50 AM, mw wrote: Hello again, maybe my question was misleading. I am asking whether the intended us

Re: consistency of StaticWordValueEncoder

2015-01-08 Thread chirag lakhani
Thanks! Is that standard practice or do people typically serialize their encoders and then load the binaries later? On Wed, Jan 7, 2015 at 5:25 PM, Ted Dunning wrote: > On Wed, Jan 7, 2015 at 2:20 PM, chirag lakhani > wrote: > > > In the Mahout in Action book I got the impression that the term

output of Naive Bayes Classifer

2015-01-08 Thread chirag lakhani
I find the java documentation for the classifyfull method in Naive Bayes I have instantiated a Naive Bayes classifier StandardNaiveBayesClassifier classifier = new StandardNaiveBayesClassifier(model); and then I try to evaluate a particular vector Vector resultVector = classifier.classifyFull(v

Re: output of Naive Bayes Classifer

2015-01-08 Thread chirag lakhani
I meant to say that I found the java documentation to be confusing On Thu, Jan 8, 2015 at 1:49 PM, chirag lakhani wrote: > I find the java documentation for the classifyfull method in Naive Bayes > > I have instantiated a Naive Bayes classifier > > StandardNaiveBayesClassifier classifier = new >

Re: DTW distance measure and K-medioids, Hierarchical clustering

2015-01-08 Thread Ted Dunning
On Thu, Jan 8, 2015 at 7:00 AM, Marko Dinic wrote: > 1) Is there an implementation of DTW (Dynamic Time Warping) in Mahout that > could be used as a distance measure for clustering? > No. > > 2) Why isn't there an implementation of K-mediods in Mahout? I'm guessing > that it could not be imple