Hi,

Can I split a input file into pieces based on the key? (Probably the
hash value of the key)
Considering Hadoop streaming is a kind of shell pipelines,
it seems to be impossible to do this, but I wanted to double-check
this to be sure.

Background: The output(an index file) is so large (more than 10G) that
it slows down my applications using that file without splitting it into pieces.

Thanks in advance.

Reply via email to