[ https://issues.apache.org/jira/browse/MAHOUT-923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13167292#comment-13167292 ]
jirapos...@reviews.apache.org commented on MAHOUT-923: ------------------------------------------------------ ----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/3147/ ----------------------------------------------------------- Review request for mahout. Summary ------- Here's a patch with a simple job to calculate the row mean (column-wise mean). One outstanding issue is the combiner, this requires a wrtiable class IntVectorTupleWritable, where the Int stores the number of rows, and the Vector stores the column-wise sum. This addresses bug MAHOUT-923. https://issues.apache.org/jira/browse/MAHOUT-923 Diffs ----- /trunk/core/src/main/java/org/apache/mahout/math/hadoop/DistributedRowMatrix.java 1213095 /trunk/core/src/main/java/org/apache/mahout/math/hadoop/MatrixRowMeanJob.java PRE-CREATION /trunk/core/src/test/java/org/apache/mahout/math/hadoop/TestDistributedRowMatrix.java 1213095 Diff: https://reviews.apache.org/r/3147/diff Testing ------- Junit test Thanks, Raphael > Row mean job for PCA > -------------------- > > Key: MAHOUT-923 > URL: https://issues.apache.org/jira/browse/MAHOUT-923 > Project: Mahout > Issue Type: Improvement > Components: Math > Affects Versions: 0.6 > Reporter: Raphael Cendrillon > Assignee: Raphael Cendrillon > Fix For: Backlog > > Attachments: MAHOUT-923.patch > > > Add map reduce job for calculating mean row (column-wise mean) of a > Distributed Row Matrix for use in PCA. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira