[ https://issues.apache.org/jira/browse/MAHOUT-1626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306888#comment-14306888 ]
ASF GitHub Bot commented on MAHOUT-1626: ---------------------------------------- Github user gcapan commented on the pull request: https://github.com/apache/mahout/pull/62#issuecomment-73015659 * The first and simplest will be Zinkevich et al.'s Parallelized Stochastic Gradient Descent [1]: The algorithm is basically running multiple local SGD's in parallel, then averaging them. For implementation, I was thinking of running SGD's locally in blocks of rows and averaging them. * Further, I hope to implement distributed stratified SGD for matrix factorization (Gemulla et al.)[2]: The algorithm is forming strata (where each stratum consists of a set of blocks that do not share any rows or columns), then for each stratum, performing SGD updates in parallel. I am not yet sure if the latter would require additional non-DSL stuff. I will raise my concerns once I get to it. [1] http://martin.zinkevich.org/publications/nips2010.pdf [2] http://dl.acm.org/citation.cfm?id=2020426 > Support for required quasi-algebraic operations and starting with aggregating > rows/blocks > ----------------------------------------------------------------------------------------- > > Key: MAHOUT-1626 > URL: https://issues.apache.org/jira/browse/MAHOUT-1626 > Project: Mahout > Issue Type: New Feature > Components: Math > Affects Versions: 1.0 > Reporter: Gokhan Capan > Fix For: 1.0 > > -- This message was sent by Atlassian JIRA (v6.3.4#6332)