[jira] [Issue Comment Edited] (MAHOUT-976) Implement Multilayer Perceptron

Issue Comment Edited Tue, 28 Feb 2012 11:26:11 -0800

    [ 
https://issues.apache.org/jira/browse/MAHOUT-976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13218511#comment-13218511
 ]


Dirk Weißenborn edited comment on MAHOUT-976 at 2/28/12 7:24 PM:
-----------------------------------------------------------------

You can also take a look at the training itself in this patch since it is 
actually also a batch learning algorithm. I also implemented a none map/reduce 
based approach using multiple threads. I think you can save a lot of time by 
reusing already tested code since it is pretty similar to this task.
                
      was (Author: dirk.weissenborn):
    You can also take a look at the training itself in this patch since it is 
actually also a batch learning algorithm. I also implemented a not map/reduce 
based approach using multiple threads. I think you can take you can save a lot 
of time reusing already tested code since it is pretty similar to this task.
                  
> Implement Multilayer Perceptron
> -------------------------------
>
>                 Key: MAHOUT-976
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-976
>             Project: Mahout
>          Issue Type: New Feature
>    Affects Versions: 0.7
>            Reporter: Christian Herta
>            Priority: Minor
>              Labels: multilayer, networks, neural, perceptron
>         Attachments: MAHOUT-976.patch, MAHOUT-976.patch, MAHOUT-976.patch, 
> MAHOUT-976.patch
>
>   Original Estimate: 80h
>  Remaining Estimate: 80h
>
> Implement a multi layer perceptron
>  * via Matrix Multiplication
>  * Learning by Backpropagation; implementing tricks by Yann LeCun et al.: 
> "Efficent Backprop"
>  * arbitrary number of hidden layers (also 0  - just the linear model)
>  * connection between proximate layers only 
>  * different cost and activation functions (different activation function in 
> each layer) 
>  * test of backprop by gradient checking 
>  * normalization of the inputs (storeable) as part of the model
>  
> First:
>  * implementation "stocastic gradient descent" like gradient machine
>  * simple gradient descent incl. momentum
> Later (new jira issues):  
>  * Distributed Batch learning (see below)  
>  * "Stacked (Denoising) Autoencoder" - Feature Learning
>  * advanced cost minimazation like 2nd order methods, conjugate gradient etc.
> Distribution of learning can be done by (batch learning):
>  1 Partioning of the data in x chunks 
>  2 Learning the weight changes as matrices in each chunk
>  3 Combining the matrixes and update of the weights - back to 2
> Maybe this procedure can be done with random parts of the chunks (distributed 
> quasi online learning). 
> Batch learning with delta-bar-delta heuristics for adapting the learning 
> rates.    
>  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Issue Comment Edited] (MAHOUT-976) Implement Multilayer Perceptron

Reply via email to