More than happy to do so; the only caveat is that we're effectively bringing DistributedRowMatrix up from 0.18 to 0.21, while the rest of Mahout is at 0.20. From what I can tell, 0.21 doesn't really remove anything, so the migration should be fairly painless...but if this is a Mahout-wide goal for a 0.6 release then we may want to create a new issue for that; DistributedRowMatrix is just one cog of the entire wheel.
Unless, of course, everyone has been using 0.21 or even 0.22 as you have :) On Sat, May 21, 2011 at 2:52 AM, Sean Owen (JIRA) <[email protected]> wrote: > > [ > https://issues.apache.org/jira/browse/MAHOUT-537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13037295#comment-13037295] > > Sean Owen commented on MAHOUT-537: > ---------------------------------- > > I could, though honestly, I think the better solution at this point is to > move to Hadoop 0.21 as part of the next release. It is the current release > and nearly superseded by 0.22. It has some features we need to move forward. > It is closer to what many are using in CDH3/4. The only drawback I see is > that Amazon EMR is on 0.20.2. However we're releasing 0.5 now for 0.20.2. > And it is 6 months until we would put out a release needing 0.21, after > which time I imagine 0.22 is out and EMR makes available 0.21 -- or if it > doesn't, we have to leave behind support. > > So let me open an item for that, and I suggest you can proceed using 0.21 > features here. > (That is what I am doing for personal projects and it really simplified > things. I'm on 0.22 now myself.) > > > Bring DistributedRowMatrix into compliance with Hadoop 0.20.2 > > ------------------------------------------------------------- > > > > Key: MAHOUT-537 > > URL: https://issues.apache.org/jira/browse/MAHOUT-537 > > Project: Mahout > > Issue Type: Improvement > > Components: Math > > Affects Versions: 0.4, 0.5 > > Reporter: Shannon Quinn > > Assignee: Shannon Quinn > > Fix For: 0.6 > > > > Attachments: MAHOUT-537.patch, MAHOUT-537.patch, > MAHOUT-537.patch, MAHOUT-537.patch > > > > > > Convert the current DistributedRowMatrix to use the newer Hadoop 0.20.2 > API, in particular eliminate dependence on the deprecated JobConf, using > instead the separate Job and Configuration objects. > > -- > This message is automatically generated by JIRA. > For more information on JIRA, see: http://www.atlassian.com/software/jira >
