+4.01. This is a terrific idea.

On Fri, Apr 1, 2011 at 1:19 AM, Aaron T. Myers <a...@cloudera.com> wrote:

> Hello Hadoop Community,
>
> Given the tremendous positive feedback we've all had regarding the HDFS,
> MapReduce, and Common project split, I'd like to propose we take the next
> step and further separate the existing projects.
>
> I propose we begin by splitting the MapReduce project into separate "Map"
> and "Reduce" sub-projects. This will provide us the opportunity to tease
> out
> the complex interdependencies between "map" and "reduce" that exist today,
> to encourage us to write more modular and isolated code, which should speed
> releases. This will also aid our users who exclusively run map-only or
> reduce-only jobs. These are important use-cases, and so should be given
> high
> priority.
>
> Given that these two portions of the existing MapReduce project share a
> great deal of code, we will likely need to release these two new projects
> concurrently at first, but the eventual goal should certainly be to be able
> to release "Map" and "Reduce" independently. This seems intuitive to me,
> given the remarkable recent advancements in the academic community
> regarding
> "reduce," while the research coming out of the "map" academics has largely
> stagnated of late.
>
> If this proposal is accepted, and it has the success I think it will, then
> we should strongly consider splitting the other two projects as well. My
> gut
> instinct is that we should split "HDFS" into "HD" and "FS" sub-projects,
> and
> simply rename the "Common" project to "C'Mon." We can think about the
> details of what exactly these project splits mean later.
>
> Please let me know what you think.
>
> Best,
> Aaron
>



-- 
Todd Lipcon
Software Engineer, Cloudera

Reply via email to