Re: Discussions - Re: [VOTE] Release candidate 0.20.203.0-rc0

Steve Loughran Tue, 03 May 2011 07:17:53 -0700

On 03/05/11 01:41, Roy T. Fielding wrote:
>
 I am constantly amazed at how

quiet it is in this project, at least until I remember that
most of the work is done exclusively via jira, unlike any of
my other followed projects that use jira.  I'd suggest that
the right place to hold any discussion is on the dev list,
but I am not on that list because it receives way too many
automated notifications.  Maybe it would help discussion on
dev if notices were sent elsewhere and only discussions were
held on dev.

I've seen this before on the Maven lists, where there's mostly a streamof JIRA changes above anything else:

http://mail-archives.apache.org/mod_mbox/maven-dev/200510.mbox/browser

however, they've got no JIRA issues in their list now, which may implyall changes aren't going to the list, or they arent using it so much:

http://mail-archives.apache.org/mod_mbox/maven-dev/201104.mbox/browser

(pause: bisecting their list shows that in 1.mar.06 they forked JIRA toa separate list to hide the details of ongoing work)

In some ways it's a means of dealing with a large and fast movingcodebase: you subscribe to the issues that matter to you, all thediscussions on a specific feature are archived, etc.


However, it has some flaws

-discouragement of community, you become a group of people working onJIRA issues, rather than on a large integrated project-with work spread across common, hdfs and mapreduce JIRAs and mailinglists, it's hard to keep all the things in your head -it is pretty mucha full time job to do so. And I don't know about the others, but I don'thave the time.-we need a way of gently moving people from those who use hadoop tothose who develop it. To me, every end user is a warm engineeringresource we just need to point at a problem that they care about. Thescale of the project, its complexity, JIRA change rate and testingdifficulties are all barriers to entry -you end up needing a team of people

 * someone to track all the issues and keep the design in their head
 * 1+ person to test
 * 1+ person to code
I don't know about others, but I can't do this on my own.

The attempt to split up into HDFS+MAPREDUCE was one tactic to deal withthis, but it hasn't worked, we just have more mailing lists to track (orin my case, fall behind on).


votewise:

-I'm favour of shipping an apache release of 20.x that has the patchesthat Y! and others have added to deal with scale and availability -andwhich has been tested by them. This will provide an apache release forpeople to use in production systems -because the official apachereleases have lagged the CDH and Y! releases.

-I'd like to see all the changes integrated into trunk too, as itdoesn't make sense for a patch in this branch not to be in trunk.


Steve

Re: Discussions - Re: [VOTE] Release candidate 0.20.203.0-rc0

Reply via email to