[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Joseph Evans resolved MAPREDUCE-3388.
--------------------------------------------

    Resolution: Invalid

As it turns out this was an issue with our tests which was masked by a small 
bug in streaming.  I have filed MAPREDUCE-3521 to address this issue in 
streaming.
                
> Streaming task with special char gets wrong output 
> ---------------------------------------------------
>
>                 Key: MAPREDUCE-3388
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3388
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: contrib/streaming, mrv2
>    Affects Versions: 0.23.0, 0.24.0
>            Reporter: Robert Joseph Evans
>            Assignee: Robert Joseph Evans
>            Priority: Critical
>
> In 0.20.204:
> {noformat}
> hadoop jar $HADOOP_HOME/hadoop-streaming.jar '-Dmapreduce.job.acl-view-job=*' 
> -files 'file:///tmp/InputDir#testlink!@$&*()-_+=' -input input.txt -mapper 
> 'xargs cat' -reducer cat -output output -jobconf 
> mapred.job.name=streamingTest-1000 -jobconf 'mapreduce.job.acl-view-job=*'
> {noformat}
> Output:
> {noformat}
> hadoop fs -cat output/*
> and-so-is-the-c#
> c++-also-supports-object-oriented-concepts
> hadoop-apache-org-core  
> hadoop-streaming
> hod-is-the-part-of-hadoop
> it-is-residing-on-apache-under-repos-asf
> java-an object-oriented-language
> smalltalk-is-also-object-oriented-language
> streaming-is-also-part-of-it
> {noformat}
> For 0.23:
> {noformat}
> hadoop jar $HADOOP_MAPRED_HOME/hadoop-streaming.jar 
> -Dmapreduce.job.acl-view-job=* -files 
> 'file:///tmp/InputDir#testlink!@$&*()-_+=' -input input.txt  -mapper 'xargs 
> cat' -reducer cat -output output -jobconf mapred.job.name=streamingTest-1000 
> -jobconf 'mapreduce.job.acl-view-job=*'
> {noformat}
> Output:
> {noformat}
> testlink!@$&*()-_+=/input1.txt testlink!@$&*()-_+=/input2.txt
> {noformat}
> The contents of input.txt are as follows:
> {noformat}
> hadoop fs -cat Streaming/streaming-1000/input.txt
> testlink!@$&*()-_+=/input1.txt
> testlink!@$&*()-_+=/input2.txt
> {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to