Re: Install mahout 0.8 with hadoop 2.0

2013-08-14 Thread Ted Dunning
Honest feedback is always welcome on this mailing list.  Don't ever worry about 
flames for that.  

Don't forget that mr v1 is an option with hadoop 2. Confusing as that may be.  

Iterative algos are, as you say, very important.  My current inclination is to 
lean toward a downpour style of implementation. That fits well with yarn but it 
also actually fits reasonably with mr v1.  

Sent from my iPhone

On Aug 13, 2013, at 20:13, Carlos Mundi cmu...@gmail.com wrote:

 Anyway, I apologize if anyone takes offense.  None is meant, so please
 flame me off-list if you must.  But since I self-identify as a member of
 the small demand set Ted Dunning describes, I figure I can chime in.  As
 always, YMMV.


Re: Install mahout 0.8 with hadoop 2.0

2013-08-14 Thread Carlos Mundi
Ted,

The mr v1 option is absolutely viable.  It's just question of how long a
project wants to rely on a legacy tail, and that's everyone's individual
lookout naturally.

Downpour style will help, as you say and surely know better than me; I
should think about that for my situation.

Thanks!
On Aug 14, 2013 12:37 PM, Ted Dunning ted.dunn...@gmail.com wrote:

 Honest feedback is always welcome on this mailing list.  Don't ever worry
 about flames for that.

 Don't forget that mr v1 is an option with hadoop 2. Confusing as that may
 be.

 Iterative algos are, as you say, very important.  My current inclination
 is to lean toward a downpour style of implementation. That fits well with
 yarn but it also actually fits reasonably with mr v1.

 Sent from my iPhone

 On Aug 13, 2013, at 20:13, Carlos Mundi cmu...@gmail.com wrote:

  Anyway, I apologize if anyone takes offense.  None is meant, so please
  flame me off-list if you must.  But since I self-identify as a member of
  the small demand set Ted Dunning describes, I figure I can chime in.
  As
  always, YMMV.



Install mahout 0.8 with hadoop 2.0

2013-08-13 Thread Sergey Svinarchuk
Hi all,

Somebody compile and install mahout with hadoop 2.0? If yes, that what
changes you make in mahout, that it have 100% passed unit tests and
successful work with hadoop 2.0?

Thanks


Re: Install mahout 0.8 with hadoop 2.0

2013-08-13 Thread Ted Dunning
No.  There is very small demand for Mahout on Hadoop 2.0 so far and the
forward/backward incompatibility of 2.0 has made it difficult to motivate
moving to 2.0.

The bigtop guys built a maven profile for 0.23 some time ago.  I don't know
the status of that.

I don't think that the differences are huge ... it is just the standard
Hadoop forklift-the-world upgrade experience.



On Tue, Aug 13, 2013 at 6:49 AM, Sergey Svinarchuk 
ssvinarc...@hortonworks.com wrote:

 Hi all,

 Somebody compile and install mahout with hadoop 2.0? If yes, that what
 changes you make in mahout, that it have 100% passed unit tests and
 successful work with hadoop 2.0?

 Thanks



Re: Install mahout 0.8 with hadoop 2.0

2013-08-13 Thread Sean Owen
I think it all minimally works on Hadoop 2.0.x, though I haven't tried
it recently -- it does require a recompile.

This is different from it working on MRv2 versus MRv1. I'm almost
certain it does not work on MRv2 and doubt it will.

The effort is not large, but it's subtle. A few hacks may fail in
mysterious ways, and certainly to properly use MRv2 you have to switch
to use the newer resource configuration scheme -- in terms of
megabytes not reducer slots and all that.

At least this was most of the work that i remember when I was
rebuilding some of this type of stuff on MRv2 + Hadoop 2.0.x

On Tue, Aug 13, 2013 at 5:58 PM, Ted Dunning ted.dunn...@gmail.com wrote:
 No.  There is very small demand for Mahout on Hadoop 2.0 so far and the
 forward/backward incompatibility of 2.0 has made it difficult to motivate
 moving to 2.0.

 The bigtop guys built a maven profile for 0.23 some time ago.  I don't know
 the status of that.

 I don't think that the differences are huge ... it is just the standard
 Hadoop forklift-the-world upgrade experience.



 On Tue, Aug 13, 2013 at 6:49 AM, Sergey Svinarchuk 
 ssvinarc...@hortonworks.com wrote:

 Hi all,

 Somebody compile and install mahout with hadoop 2.0? If yes, that what
 changes you make in mahout, that it have 100% passed unit tests and
 successful work with hadoop 2.0?

 Thanks



Re: Install mahout 0.8 with hadoop 2.0

2013-08-13 Thread Carlos Mundi
I recently asked the same core question on this list.  I certainly won't
argue with the statistics of small numbers.  But I will hazard a
prediction: the impetus for Mahout to support Hadoop 2 will appear about
the same time the elephant book gets updated for 2.0, provided Twister or
something like it does not get there first.

My own view, from the outside, is that the best use of limited Mahout dev
resources at the moment is stabilizing the core feature-set and positioning
Mahout in the increasingly crowded big analytics space.  I don't know how
easy that is, as I do not know how tightly Mahout is tied to the Hadoop MR
model.

Although it would be hard to overstate the importance of iterative algos,
it is also true that Hadoop MR excels on some very real killer apps.
Making those accessible and robust should take precedence, for now, over
the incremental ROI for chasing Hadoop 2.  I say this as someone who would
very much like to run Mahout against Hadoop 2.

Anyway, I apologize if anyone takes offense.  None is meant, so please
flame me off-list if you must.  But since I self-identify as a member of
the small demand set Ted Dunning describes, I figure I can chime in.  As
always, YMMV.

Cheers,
Carlos
On Aug 13, 2013 9:59 AM, Ted Dunning ted.dunn...@gmail.com wrote:

 No.  There is very small demand for Mahout on Hadoop 2.0 so far and the
 forward/backward incompatibility of 2.0 has made it difficult to motivate
 moving to 2.0.

 The bigtop guys built a maven profile for 0.23 some time ago.  I don't know
 the status of that.

 I don't think that the differences are huge ... it is just the standard
 Hadoop forklift-the-world upgrade experience.



 On Tue, Aug 13, 2013 at 6:49 AM, Sergey Svinarchuk 
 ssvinarc...@hortonworks.com wrote:

  Hi all,
 
  Somebody compile and install mahout with hadoop 2.0? If yes, that what
  changes you make in mahout, that it have 100% passed unit tests and
  successful work with hadoop 2.0?
 
  Thanks