Streaming task with special char gets wrong output
---------------------------------------------------
Key: MAPREDUCE-3388
URL: https://issues.apache.org/jira/browse/MAPREDUCE-3388
Project: Hadoop Map/Reduce
Issue Type: Bug
Components: contrib/streaming, mrv2
Affects Versions: 0.23.0, 0.24.0
Reporter: Robert Joseph Evans
Assignee: Robert Joseph Evans
Priority: Critical
In 0.20.204:
{noformat}
hadoop jar $HADOOP_HOME/hadoop-streaming.jar '-Dmapreduce.job.acl-view-job=*'
-files 'file:///tmp/InputDir#testlink!@$&*()-_+=' -input input.txt -mapper
'xargs cat' -reducer cat -output output -jobconf
mapred.job.name=streamingTest-1000 -jobconf 'mapreduce.job.acl-view-job=*'
{noformat}
Output:
{noformat}
hadoop fs -cat output/*
and-so-is-the-c#
c++-also-supports-object-oriented-concepts
hadoop-apache-org-core
hadoop-streaming
hod-is-the-part-of-hadoop
it-is-residing-on-apache-under-repos-asf
java-an object-oriented-language
smalltalk-is-also-object-oriented-language
streaming-is-also-part-of-it
{noformat}
For 0.23:
{noformat}
hadoop jar $HADOOP_MAPRED_HOME/hadoop-streaming.jar
-Dmapreduce.job.acl-view-job=* -files
'file:///tmp/InputDir#testlink!@$&*()-_+=' -input input.txt -mapper 'xargs
cat' -reducer cat -output output -jobconf mapred.job.name=streamingTest-1000
-jobconf 'mapreduce.job.acl-view-job=*'
{noformat}
Output:
{noformat}
testlink!@$&*()-_+=/input1.txt testlink!@$&*()-_+=/input2.txt
{noformat}
The contents of input.txt are as follows:
{noformat}
hadoop fs -cat Streaming/streaming-1000/input.txt
testlink!@$&*()-_+=/input1.txt
testlink!@$&*()-_+=/input2.txt
{noformat}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira