Hi Dmitry,

I've pulled this out as a separate issue under MAHOUT-923. Could you please 
take a look?

Thanks!

On Dec 8, 2011, at 11:38 AM, "Dmitriy Lyubimov (Issue Comment Edited) (JIRA)" 
<j...@apache.org> wrote:

> 
>    [ 
> https://issues.apache.org/jira/browse/MAHOUT-880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13165469#comment-13165469
>  ] 
> 
> Dmitriy Lyubimov edited comment on MAHOUT-880 at 12/8/11 7:36 PM:
> ------------------------------------------------------------------
> 
> Ideally to optimize this i guess DRM better have a notion that dimensions (or 
> whatever other parameters inside solver) may not be initially known. When 
> this happens, first operation in pipeline (whatever it happens to be) may 
> also employ standard strategies to come up with those in the end. 
> 
> Similarly, there's a "post-step" strategy concept: using output and some 
> additional parameters you can re-assemble required knowledge (such as mean or 
> small result of multiplication) in post step by re-combining result of all 
> reducers or separate factors of computation (if it happens to be a small 
> product in the end). 
> 
> this is a fundamental technique in SSVD (and seems to become even more 
> prominent with PCA efficiency tricks).
> 
>      was (Author: dlyubimov):
>    Ideally to optimize this i guess DRM better have a notion that dimensions 
> (or whatever other parameters inside solver) may not be initially known. When 
> this happens, first operation in pipeline (whatever it happens to be) may 
> also employ standard strategies to come up with those in the end. 
> 
> Similarly, there's a "post-step" strategy concept: using output and some 
> additional parameters you can re-assemble required knowledge (such as mean or 
> small result of multiplication) in post step by re-combining result of all 
> reducers or separate factors of computation (if it happens to be a small 
> product in the end). 
> 
> this is a fundamental technique and SSVD (and seems to become even more 
> prominent with PCA efficiency tricks).
> 
>> Add some matrix method(like addition, subtraction, norm ... etc) to 
>> DistributedRowMatrix
>> ----------------------------------------------------------------------------------------
>> 
>>                Key: MAHOUT-880
>>                URL: https://issues.apache.org/jira/browse/MAHOUT-880
>>            Project: Mahout
>>         Issue Type: New Feature
>>         Components: Math
>>   Affects Versions: 0.6
>>           Reporter: Wangda Tan
>>           Priority: Minor
>>             Labels: DistributedRowMatrix
>>        Attachments: MAHOUT-880.patch, MAHOUT-880.patch, MAHOUT-880.patch
>> 
>> 
>> I'm a new to Mahout, I didn't find some basic matrix functions. This make 
>> users cannot do many tasks by CLI or API, if user get some result through 
>> existing map-reduce matrix operation (like svd), he cannot do farther steps. 
>> I make a list for it:
>> 1) Addition, Subtraction 
>> 2) Norm (like norm-1, norm-2, norm-frobenius)
>> 3) Matrix compare
>> 4) Get lower triangle, upper triangle and diagonal
>> 5) Get identity and zero matrix
>> 6) Put two or matrix to together: A = [A1, A2]
>> 7) More linear equations solver method, like Gaussian elimination (maybe 
>> it's hard to implement)
>> 8) import and export CSV, ARFF ... (this will very useful when user want to 
>> reuse result from or to other applications like MATLAB)
>> I want to know is there any plan to do this, if so, I can make some efforts 
>> to implement these.
> 
> --
> This message is automatically generated by JIRA.
> If you think it was sent incorrectly, please contact your JIRA 
> administrators: 
> https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
> For more information on JIRA, see: http://www.atlassian.com/software/jira
> 
> 

Reply via email to