[
https://issues.apache.org/jira/browse/MAHOUT-228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ted Dunning updated MAHOUT-228:
-------------------------------
Attachment: MAHOUT-228.patch
Now has a working and almost useful version of TrainLogistic command line.
This command line will solve a simple example case that I am working out for
the Mahout in Action book:
java -cp<mumble> org.apache.mahout.classifier.sgd.TrainLogistic \
--passes 100 --rate 50 --lambda 0.001 \
--input donut.csv --features 21 --output foo \
--target color --categories 2 \
--predictors x y xx xy yy a b c --types n n
I still need to
- output the model
- change prints into log statements
- build the book-end TestLogistic function
- integrate into the mahout command line driver framework and
- build a DumpResourceData program.
Otherwise, this is beginning to coalesce.
> Need sequential logistic regression implementation using SGD techniques
> -----------------------------------------------------------------------
>
> Key: MAHOUT-228
> URL: https://issues.apache.org/jira/browse/MAHOUT-228
> Project: Mahout
> Issue Type: New Feature
> Components: Classification
> Reporter: Ted Dunning
> Fix For: 0.4
>
> Attachments: logP.csv, MAHOUT-228-3.patch, MAHOUT-228.patch,
> MAHOUT-228.patch, MAHOUT-228.patch, r.csv, sgd-derivation.pdf,
> sgd-derivation.tex, sgd.csv
>
>
> Stochastic gradient descent (SGD) is often fast enough for highly scalable
> learning (see Vowpal Wabbit, http://hunch.net/~vw/).
> I often need to have a logistic regression in Java as well, so that is a
> reasonable place to start.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.