Re: Difference between Hadoop Streaming and "Normal" mode

Adam SI Wed, 13 Aug 2008 07:54:20 -0700

Will coding computational intensive algorithms using c/c++ and usingthem with streaming mode improve the performance ? Just curiosity.


Xiance


On Aug 13, 2008, at 10:56 AM, Gaurav Veda wrote:

Thank you all for the replies. They do clarify things!

Cheers,
Gaurav
On Tue, Aug 12, 2008 at 8:01 PM, Arun C Murthy <[EMAIL PROTECTED]>wrote:
On Aug 12, 2008, at 3:15 PM, Ashish Venugopal wrote:
There is definitely functionality in "normal" mode that is notavailable
in
streaming, like the ability to write counters to instruments jobs. I
personally just use streaming, so I am interested to see if thereare
further key differences...
With hadoop-0.18 (under vote now) you get counters for streaming too:
http://issues.apache.org/jira/browse/HADOOP-1328

As others have pointed out, the fact that your input/output has to be
'textual' is a major difference - lots of applications need binarydata.
This 'stringification' has serious performance implications too, some
benchmarks I did a while ago for Pig put this at nearly 3x.

Arun
Ashish

On Tue, Aug 12, 2008 at 3:09 PM, Gaurav Veda
<[EMAIL PROTECTED]<[EMAIL PROTECTED]>
wrote:
Hi All,
This might seem too silly, but I couldn't find a satisfactoryanswerto this yet. What are the advantages / disadvantages of usingHadoopStreaming over the normal mode (wherein you write your ownmapper and
reducer in Java)? From what I gather, the real advantage of Hadoop
Streaming is that you can use any executable (in c / perl / python
etc) as a mapper / reducer.
A slight disadvantage is that the default is to read (write)from thestandard input (output) ... though one can specify their ownInput and
Output format (and package it with the default hadoop streaming jar
file).

My point is, why should I ever use the normal mode? Streaming seems
just as good. Is there a performance problem or do I have onlylimitedcontrol over my job if I use the streaming mode or some otherissue?
Thanks!
Gaurav
--
Share what you know, learn what you don't !
--
Share what you know, learn what you don't !

Re: Difference between Hadoop Streaming and "Normal" mode

Reply via email to