Can any committer with knowledge of HOD please review this patch? If there are no committers with such knowledge, I would encourage us to either (a) add a committer to maintain hod, or (b) reconsider the vote to abandon it as an official contrib. Perhaps Simone and Gianluigi could move it to a separate incubator project?
-Todd On Fri, Feb 18, 2011 at 6:40 AM, Simone Leo <simone....@crs4.it> wrote: > I am the co-author (with Gianluigi Zanetti) of HADOOP-6369 -- add Grid > Engine support to HOD. At CRS4 we've been using (our patched version of) > HOD since 2008 and we still use it in production. We use Hadoop 0.20.2 > since it was released one year ago. > > Simone > > On 02/12/11 06:15, Owen O'Malley wrote: > > > > On Feb 11, 2011, at 6:17 PM, Nigel Daley wrote: > > > >>> a) I don't think hod is actually part of any unit tests, so including > >>> it would likely only be a burden on the tarball size. > >> > >> Not true. HOD has python unit tests and is the reason our builds have > >> dependencies on python. > > > > But Allen's point is that I don't recall ever seeing HOD test failures > > causing the build to fail. > > > >>> b) The edu community uses this quite extensively, evidenced by the > >>> topic coming up on the mailing lists at least once every two months > >>> or so and has for years. Can't say that about the other contrib > >>> modules other than the schedulers and streaming. > >> > >> Then they are using old version of Hadoop. AFAICT HOD does not work > >> with 0.20 or beyond. > > > > Out of curiosity, what goes wrong? Clearly nothing major has changed in > > starting up a mapreduce cluster in a very long time. > > > >>> c) The community that does use it has even submitted a patch that > >>> we've ignored. > >> > >> Which means the committers of this project gave up on it long ago. > > > > There are also some patches on core Hadoop that have been sitting for a > > long time, so I don't think that is a valid inference. > > > > I would love to hear some of the people who are using HOD speak up and > > give us their feedback. > > > > -- Owen > > > -- > Simone Leo > Data Fusion - Distributed Computing > CRS4 > POLARIS - Building #1 > Piscina Manna > I-09010 Pula (CA) - Italy > e-mail: simone....@crs4.it > http://www.crs4.it > -- Todd Lipcon Software Engineer, Cloudera