Streaming task with special char gets wrong output 
---------------------------------------------------

                 Key: MAPREDUCE-3388
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3388
             Project: Hadoop Map/Reduce
          Issue Type: Bug
          Components: contrib/streaming, mrv2
    Affects Versions: 0.23.0, 0.24.0
            Reporter: Robert Joseph Evans
            Assignee: Robert Joseph Evans
            Priority: Critical


In 0.20.204:
{noformat}
hadoop jar $HADOOP_HOME/hadoop-streaming.jar '-Dmapreduce.job.acl-view-job=*' 
-files 'file:///tmp/InputDir#testlink!@$&*()-_+=' -input input.txt -mapper 
'xargs cat' -reducer cat -output output -jobconf 
mapred.job.name=streamingTest-1000 -jobconf 'mapreduce.job.acl-view-job=*'
{noformat}

Output:
{noformat}
hadoop fs -cat output/*
and-so-is-the-c#
c++-also-supports-object-oriented-concepts
hadoop-apache-org-core  
hadoop-streaming
hod-is-the-part-of-hadoop
it-is-residing-on-apache-under-repos-asf
java-an object-oriented-language
smalltalk-is-also-object-oriented-language
streaming-is-also-part-of-it
{noformat}

For 0.23:
{noformat}
hadoop jar $HADOOP_MAPRED_HOME/hadoop-streaming.jar 
-Dmapreduce.job.acl-view-job=* -files 
'file:///tmp/InputDir#testlink!@$&*()-_+=' -input input.txt  -mapper 'xargs 
cat' -reducer cat -output output -jobconf mapred.job.name=streamingTest-1000 
-jobconf 'mapreduce.job.acl-view-job=*'
{noformat}

Output:
{noformat}
testlink!@$&*()-_+=/input1.txt testlink!@$&*()-_+=/input2.txt
{noformat}

The contents of input.txt are as follows:
{noformat}
hadoop fs -cat Streaming/streaming-1000/input.txt
testlink!@$&*()-_+=/input1.txt
testlink!@$&*()-_+=/input2.txt
{noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to