Re:Re: Re: how to add -mapred.job.queue.name support for mahout modules ?

2011-12-13 Thread enyun
Solve this issue by change source code of trunk : add "conf.set("mapred.job.queue.name", "xxx"), then it's ok. but anybody knows the method "how to specify without changing source code?" thanks, At 2011-12-14 03:28:29,"Lance Norskog" wrote: >It was about how "-Dproperty=value" arguments ar

Re:Re: Re: how to add -mapred.job.queue.name support for mahout modules ?

2011-12-13 Thread enyun
hi all, I'm still blocked on this issue. I can't specify "mapred.job.queue.name" when I try to submit the naivebayes example job of mahout. I try to change source code, try different version of mahout, but not find anything useful. Could anybody help me on this issue : how to specify mapred.queu

Re: Austin SIGKDD - Next Meeting Wednesday, December 14, 2011, 7:00 - 8:00 pm

2011-12-13 Thread Raphael Cendrillon
Sounds like fun, perhaps we should start something similar in the Bay Area :) On Dec 13, 2011, at 8:04 PM, David Boney wrote: > Sure, we are studying machine learning using Mahout. We have started a > weekly hackers dojo to learn how to implement Hadoop based machine learning > programs usin

Re: Austin SIGKDD - Next Meeting Wednesday, December 14, 2011, 7:00 - 8:00 pm

2011-12-13 Thread David Boney
Sure, we are studying machine learning using Mahout. We have started a weekly hackers dojo to learn how to implement Hadoop based machine learning programs using Mahout. Once the group get some experience using Mahout, we are going to focus on projects to add functionality to Mahout. If you ar

Re: Austin SIGKDD - Next Meeting Wednesday, December 14, 2011, 7:00 - 8:00 pm

2011-12-13 Thread Raphael Cendrillon
Is there a reason why user@mahout.apache.org is part of this list? On 13 Dec, 2011, at 7:38 PM, David Boney wrote: > The next meeting of Austin SIGKDD, Big Data Machine Learning Hackers Dojo, is > Wednesday, December 14, 2011, 7:00 - 8:00 pm at Northwest Recreation Center. > We have the meeting

Austin SIGKDD - Next Meeting Wednesday, December 14, 2011, 7:00 - 8:00 pm

2011-12-13 Thread David Boney
The next meeting of Austin SIGKDD, Big Data Machine Learning Hackers Dojo, is Wednesday, December 14, 2011, 7:00 - 8:00 pm at Northwest Recreation Center. We have the meeting room from 6-9. There will be a presentation on how Hadoop works from 7-8, with hacking from 6-7 and 8-9. Please try to h

Re: SequenceFile cast problems

2011-12-13 Thread Ted Dunning
Which version of Mahout? And what happens when you train the classifier from the command line? On Tue, Dec 13, 2011 at 2:27 PM, Daniele Volpi wrote: > First of all i've converted the train files in the format: > target[\t]terms > through the BayesFileFormatter class. > Then i've converted these

Re: Understanding TrainLogistic's output

2011-12-13 Thread Ted Dunning
This is pretty confusing. What has happened is that you have encoded a single categorical variable that has four states as four numerical variables. Unfortunately, Mahout has gotten the message that you are using four categorical variables each with two states. Can you say a bit about how you ra

Understanding TrainLogistic's output

2011-12-13 Thread magicalo
Hello, I am trying to understand the output of TrainLogic. Mahout in Action only has an example of running the classifier on numeric predictor variables. However, my model uses categorical predictor variables only (a,b,c,d) and each can only hold a value of 0 or 1 only. The output I get is this:

Re: Re: how to add -mapred.job.queue.name support for mahout modules ?

2011-12-13 Thread Lance Norskog
It was about how "-Dproperty=value" arguments are sorted. The issue added a change so that these arguments were handled first. On Tue, Dec 13, 2011 at 2:24 AM, Sean Owen wrote: > What are you referring to here? I don't know of anything that would be > relevant. > > On Tue, Dec 13, 2011 at 3:45 AM

Re: SequenceFile cast problems

2011-12-13 Thread Daniele Volpi
First of all i've converted the train files in the format: target[\t]terms through the BayesFileFormatter class. Then i've converted these files (one per category) in SequenceFile using the seqdirectory program. After that I ran this code: TrainNaiveBayesJob trainer = new TrainNaiveBayesJob(); tra

Re: SequenceFile cast problems

2011-12-13 Thread Grant Ingersoll
What steps have you done? On Dec 13, 2011, at 12:29 PM, Daniele Volpi wrote: > Hi everyone, > I'm trying to implement the Naive Bayes classifier through the > TrainNaiveBayesJob class. > After convert the text files in the required sequencefile for the "run" > method through the seqdirectory prog

SequenceFile cast problems

2011-12-13 Thread Daniele Volpi
Hi everyone, I'm trying to implement the Naive Bayes classifier through the TrainNaiveBayesJob class. After convert the text files in the required sequencefile for the "run" method through the seqdirectory program i get this error: java.lang.ClassCastException: org.apache.hadoop.io.Text cannot be

Re: Random forest on Mahout

2011-12-13 Thread deneche abdelhakim
the wiki has several pages dedicated to random forests: https://cwiki.apache.org/MAHOUT/random-forests.html this page describe the basics and has interesting links explaining random forests https://cwiki.apache.org/MAHOUT/breiman-example.html https://cwiki.apache.org/MAHOUT/partial-implementation

Random forest on Mahout

2011-12-13 Thread praveenesh kumar
I want to learn mahout's Random Forest Algorithm and want to implement it. Can someone guide me on this ? I already went to mahout's wiki, couldn't get any info on how mahout has implemented it or how to run it from mahout. Any help would be highly appreciable. Thanks, Praveenesh

Re: Re: how to add -mapred.job.queue.name support for mahout modules ?

2011-12-13 Thread Sean Owen
What are you referring to here? I don't know of anything that would be relevant. On Tue, Dec 13, 2011 at 3:45 AM, Lance Norskog wrote: > There was a bug in Mahout 0.5 which was fixed. Are you using Mahout > 0.5, or trunk? > >