It's from this thread: http://www.lucidimagination.com/search/document/117d2f370e925cf9#ca4271b66a19bf9a
And somehow I managed to promise to write it. On Sun, Nov 13, 2011 at 10:59 PM, Jake Mannix (Commented) (JIRA) < [email protected]> wrote: > > [ > https://issues.apache.org/jira/browse/MAHOUT-884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149470#comment-13149470] > > Jake Mannix commented on MAHOUT-884: > ------------------------------------ > > why do we want the part files squished into one? If the result is small > enough to read into memory somewhere, then you can easily just iterate over > the part files, reading each one into memory, right? > > > Matrix Concatenate utility > > -------------------------- > > > > Key: MAHOUT-884 > > URL: https://issues.apache.org/jira/browse/MAHOUT-884 > > Project: Mahout > > Issue Type: New Feature > > Components: Integration > > Reporter: Lance Norskog > > Priority: Minor > > Attachments: MAHOUT-884.patch > > > > > > Utility to concatenate matrices stored as SequenceFiles of vectors. > > Each pair in the SequenceFile is the IntWritable row number and a > VectorWritable. > > The input and output files may skip rows. > > -- > This message is automatically generated by JIRA. > If you think it was sent incorrectly, please contact your JIRA > administrators: > https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa > For more information on JIRA, see: http://www.atlassian.com/software/jira > > > -- Lance Norskog [email protected]
