Hmmm. 

That could be. I don't know what Doug wrote except that I knew he mentioned he 
updated the docs on it. 

This is really kind of a basic issue.  It just makes sense. 
As you already point out, you and Andrew already noticed this back in 2009 and 
2010. 


I just don't think you took it far enough. Essentially HBase can be used in 
place of a reducer. In terms of a M/R job, M/M using HBase is going to be more 
efficient. (Assuming that you are already running HBase.)   I really can't see 
any reason to use a reducer when using HBase. 
Maybe I'm being stupid, but every example I've looked at, you can refactor it 
to not use a reducer. 

I also think you may read a bit more in to my posts that I intend. ;-)

-Mike



On May 10, 2012, at 10:28 PM, Stack wrote:

> On Thu, May 10, 2012 at 6:28 PM, Michael Segel
> <michael_se...@hotmail.com> wrote:
>> That section was written by Doug after he and I had the same debate man 
>> moons ago.
> 
> 
> I'm not sure that is correct.  If you git blame that section, you'll
> see that stack and andrew are the authors and that the edits were made
> in 2009 and 2010.
> 
> There is this section in the book but it doesn't seem to have the
> benefit of your input:
> http://hbase.apache.org/book.html#mapreduce.example.summary.noreducer
> 
> 
>> While I can't say with absolute certainty that you shouldn't use a reducer, 
>> I can say is that every situation where I have seen a M/R where you are 
>> writing to HBase, you end up not wanting to use a reducer. If you want a 
>> clear and concise statement you can say that the rule of thumb is that you 
>> don't want to use a reducer and that cases where you would need to first use 
>> a reducer are the rare exception.
>> 
> 
> Please file an issue w/ a patch.  It'd be good to get your experience
> into the doc.
> 
>> The reason I ask people to think about this topic is that unless you have a 
>> really good foundation in databases, not relying on a reducer is a bit 
>> counter intuitive. (Which is why I said that you really need to clear your 
>> mind and focus on this issue. )
>> 
> 
> Lets make it so that if you don't have a foundation in dbs, if you
> read the doc., you won't need such a background to get the best of
> hbase.
> 
>> PS. If you care to read the thread, I didn't become condescending until a 
>> certain individual piped up about how refactoring the M/R was a 
>> 'distraction' to the issue at hand.
>> Not to mention his flip response w the Google paper?
>> 
> 
> There are a few problems w/ the above.
> 
> + You presume I did not read the thread before responding
> + That the condescending tone started after Dave's intercessions (I
> was not referring to this thread only).
> 
> Michael, fellas like you help move the hbase story along.   Generally,
> I see that you do a great job in this forum and in others.  In my
> previous note, I was just trying to give a pointer that what you might
> consider jest, others can read as condescending or sarcasm.
> 
> St.Ack
> 

Reply via email to