[
https://issues.apache.org/jira/browse/MAHOUT-1626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306888#comment-14306888
]
ASF GitHub Bot commented on MAHOUT-1626:
----------------------------------------
Github user gcapan commented on the pull request:
https://github.com/apache/mahout/pull/62#issuecomment-73015659
* The first and simplest will be Zinkevich et al.'s Parallelized Stochastic
Gradient Descent [1]: The algorithm is basically running multiple local SGD's
in parallel, then averaging them. For implementation, I was thinking of running
SGD's locally in blocks of rows and averaging them.
* Further, I hope to implement distributed stratified SGD for matrix
factorization (Gemulla et al.)[2]: The algorithm is forming strata (where each
stratum consists of a set of blocks that do not share any rows or columns),
then for each stratum, performing SGD updates in parallel.
I am not yet sure if the latter would require additional non-DSL stuff. I
will raise my concerns once I get to it.
[1] http://martin.zinkevich.org/publications/nips2010.pdf
[2] http://dl.acm.org/citation.cfm?id=2020426
> Support for required quasi-algebraic operations and starting with aggregating
> rows/blocks
> -----------------------------------------------------------------------------------------
>
> Key: MAHOUT-1626
> URL: https://issues.apache.org/jira/browse/MAHOUT-1626
> Project: Mahout
> Issue Type: New Feature
> Components: Math
> Affects Versions: 1.0
> Reporter: Gokhan Capan
> Fix For: 1.0
>
>
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)