yep.. I've heard it's a source of contention...

but I'd like to see how we can get it so the amount of patches that the
large companies apply on top of the current production apache release gets
minimized, and the large installations are all running nearly identical code
on their clusters, and that we wouldn't need to have a yahoo or cloudera
repo of their patch sets made available.

So Ideally I'd like to hear what kind of things apache needs to do help get
these kind of things less divergent.

In discussing it with people, I've heard that a major issue (not the only
one i'm sure) is lack of resources to actually test the apache releases on
large clusters, and that it is very hard getting this done in short cycles
(hence the large gap between 20.x and 21).

So I thought I would start the thread to see if we could at least identify
what the people think are the problems are.

On Thu, Oct 21, 2010 at 3:30 PM, Allen Wittenauer

> On Oct 21, 2010, at 12:13 PM, Ian Holsman wrote:
> > Hi guys.
> >
> > I wanted to start a conversation about how we could merge the the
> cloudera +
> > yahoo distribtutions of hadoop into our codebase,
> > and what would be required.
> *grabs popcorn*

Reply via email to