In Streaming, allow different mappers for different subsets of the input
------------------------------------------------------------------------

                 Key: HADOOP-2277
                 URL: https://issues.apache.org/jira/browse/HADOOP-2277
             Project: Hadoop
          Issue Type: Improvement
          Components: contrib/streaming
            Reporter: arkady borkovsky


The command line may look like this:

-mapper mapper-command-1 -input dir11   -mapper mapper-command-2 -input dir22  
input -dir21

meaning that map phase will apply mapper-command-1 to part files from dir11, 
and the part files from dir22 and dir21 will be processed by mapper-command-2
then all will be shuffled and processed by a single reducer.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to