[ 
https://issues.apache.org/jira/browse/MAHOUT-1626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306888#comment-14306888
 ] 

ASF GitHub Bot commented on MAHOUT-1626:
----------------------------------------

Github user gcapan commented on the pull request:

    https://github.com/apache/mahout/pull/62#issuecomment-73015659
  
    * The first and simplest will be Zinkevich et al.'s Parallelized Stochastic 
Gradient Descent [1]:  The algorithm is basically running multiple local SGD's 
in parallel, then averaging them. For implementation, I was thinking of running 
SGD's locally in blocks of rows and averaging them.
    
    * Further, I hope to implement distributed stratified SGD for matrix 
factorization (Gemulla et al.)[2]: The algorithm is forming strata (where each 
stratum consists of a set of blocks that do not share any rows or columns), 
then for each stratum, performing SGD updates in parallel.
    
    I am not yet sure if the latter would require additional non-DSL stuff. I 
will raise my concerns once I get to it.
    
    [1] http://martin.zinkevich.org/publications/nips2010.pdf
    [2] http://dl.acm.org/citation.cfm?id=2020426
    
     


> Support for required quasi-algebraic operations and starting with aggregating 
> rows/blocks
> -----------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-1626
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1626
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Math
>    Affects Versions: 1.0
>            Reporter: Gokhan Capan
>             Fix For: 1.0
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to