[ 
https://issues.apache.org/jira/browse/MAHOUT-880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13165469#comment-13165469
 ] 

Dmitriy Lyubimov edited comment on MAHOUT-880 at 12/8/11 7:36 PM:
------------------------------------------------------------------

Ideally to optimize this i guess DRM better have a notion that dimensions (or 
whatever other parameters inside solver) may not be initially known. When this 
happens, first operation in pipeline (whatever it happens to be) may also 
employ standard strategies to come up with those in the end. 

Similarly, there's a "post-step" strategy concept: using output and some 
additional parameters you can re-assemble required knowledge (such as mean or 
small result of multiplication) in post step by re-combining result of all 
reducers or separate factors of computation (if it happens to be a small 
product in the end). 

this is a fundamental technique in SSVD (and seems to become even more 
prominent with PCA efficiency tricks).
                
      was (Author: dlyubimov):
    Ideally to optimize this i guess DRM better have a notion that dimensions 
(or whatever other parameters inside solver) may not be initially known. When 
this happens, first operation in pipeline (whatever it happens to be) may also 
employ standard strategies to come up with those in the end. 

Similarly, there's a "post-step" strategy concept: using output and some 
additional parameters you can re-assemble required knowledge (such as mean or 
small result of multiplication) in post step by re-combining result of all 
reducers or separate factors of computation (if it happens to be a small 
product in the end). 

this is a fundamental technique and SSVD (and seems to become even more 
prominent with PCA efficiency tricks).
                  
> Add some matrix method(like addition, subtraction, norm ... etc) to 
> DistributedRowMatrix
> ----------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-880
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-880
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Math
>    Affects Versions: 0.6
>            Reporter: Wangda Tan
>            Priority: Minor
>              Labels: DistributedRowMatrix
>         Attachments: MAHOUT-880.patch, MAHOUT-880.patch, MAHOUT-880.patch
>
>
> I'm a new to Mahout, I didn't find some basic matrix functions. This make 
> users cannot do many tasks by CLI or API, if user get some result through 
> existing map-reduce matrix operation (like svd), he cannot do farther steps. 
> I make a list for it:
> 1) Addition, Subtraction 
> 2) Norm (like norm-1, norm-2, norm-frobenius)
> 3) Matrix compare
> 4) Get lower triangle, upper triangle and diagonal
> 5) Get identity and zero matrix
> 6) Put two or matrix to together: A = [A1, A2]
> 7) More linear equations solver method, like Gaussian elimination (maybe it's 
> hard to implement)
> 8) import and export CSV, ARFF ... (this will very useful when user want to 
> reuse result from or to other applications like MATLAB)
> I want to know is there any plan to do this, if so, I can make some efforts 
> to implement these.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to