The trainlogistic command is (as Stanley says) only a simple example. You will need to write a program something like TrainNewsGroups for your modelers to use.
I agree that the API oriented code in Mahout is not what those users need. I was, however, what my users needed. It would be great if you would like to contribute a good command line for the more advanced SGD classifier training API. On Tue, Apr 19, 2011 at 10:51 PM, Stanley Xu <wenhao...@gmail.com> wrote: > Hi Xiaobo, > > You could check the chapter 13-16 from <Mahout In Action>, it provided all > the parameters the command line tool of 'mahout trainlogistic' could use. > But the trainlogistic command is still only a simple example. If you wanted > to use that in a production environment, you still have to write the feature > encode code by yourself. The code you need to write is pretty easy, just > parse the input and put that in a Vector and let the LR train the data. > > Best wishes, > Stanley Xu > > > > > On Tue, Apr 19, 2011 at 9:09 PM, XiaoboGu <guxiaobo1...@gmail.com> wrote: > >> Hi, >> >> Thanks for your reply, after some reading of the wiki pages, I think what >> I want is a Logistic Regression command-line, since the target users of >> Mahout are data analysts, who can't write Java code, a command line is more >> convenient. Some specific questions are : >> 1. What format should we apply when preparing data for logistic >> regression, can we use csv, and should we put the value for the target >> variable as the first column in every row the csv file. >> 2. What options can we support to the command line if there is one. >> 3. How can interpret the results. >> >> Because Logistic Regression is the working horse of credit scoring in >> industry, I think it will make Mahout friends of more analysts if LR support >> is smooth. >> >> Regards, >> >> Xiaobo Gu >> >> From: Ted Dunning [mailto:ted.dunn...@gmail.com] >> Sent: Wednesday, April 13, 2011 1:02 AM >> To: user@mahout.apache.org >> Cc: Xiaobo Gu >> Subject: Re: Is any more detailed documentation aout the sgd logistic >> regression example. >> >> Can you be more specific about what you have and what you want? >> >> The book Mahout in Action provides quite a lot of details with sample code >> for a server farm. >> >> The TrainNewsGroups example provides code that you can copy. >> >> Do you have these resources? Do you want more? Did you want more theory? >> >> On Tue, Apr 12, 2011 at 9:11 AM, Xiaobo Gu <guxiaobo1...@gmail.com> >> wrote: >> Hi, >> Documents about sgd logistic regression itself are welcome too. >> Regards, >> >> Xiaobo Gu >> >> >> >