[ https://issues.apache.org/jira/browse/MAHOUT-1567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14015014#comment-14015014 ]
Maciej Kula commented on MAHOUT-1567: ------------------------------------- Thanks, I'll have a look. My implementation is reasonably fast and very sparse-friendly, but definitely single machine and not very suitable to distributed computation. (I guess I could do distributed matrix multiplication but that does not seem like a very good idea.) > Add online sparse dictionary learning (dimensionality reduction) > ---------------------------------------------------------------- > > Key: MAHOUT-1567 > URL: https://issues.apache.org/jira/browse/MAHOUT-1567 > Project: Mahout > Issue Type: Improvement > Components: Collaborative Filtering > Reporter: Maciej Kula > > I have recently implemented a sparse online dictionary learning algorithm, > with an emphasis on learning very high-dimensional and very sparse > dictionaries. It is based on J. Mairal et al 'Online Dictionary Learning for > Sparse Coding' (http://www.di.ens.fr/willow/pdfs/icml09.pdf). It's an online > variant of low-rank matrix factorization, suitable for sparse binary matrices > (such as implicit feedback matrices). > I would be very happy to bring this up to the Mahout standard and contribute > to the main codebase --- is this something you would in principle be > interested in having? > The code (as well as some examples) are here: > https://github.com/maciejkula/dictionarylearning -- This message was sent by Atlassian JIRA (v6.2#6252)