Travis Brady wrote:
This brings up two interesting issues:
1. Hadoop streaming is a potentially very powerful tool, especially for
those of us who don't work in Java for whatever reason
2. If Hadoop streaming is "at best a jury rigged solution" then that should
be made known somewhere on the wiki. If it's really not supposed to be
used, why is it provided at all?
A set of reasonable performance tests and results would be very helpful
in helping people decide whether to go with streaming or not. Hopefully
we can get some numbers from this thread and publish them? Anyone else
compared streaming with native java?
Best,
Parand