Hi All,
I am new to hadoop and is using Python to write MapReduce tasks. In
order to execute the streaming command I am using the following command.
bin/hadoop jar hadoop-0.20.0-streaming.jar -mapper pkg2Cls.py -jobconf
mapred.map.tasks=5 -jobconf mapred.reduce.tasks=0 -input
/usr/test/linecount -output linecountresults -file pkg2Cls.py -file
pkg1Cls.py
which is working fine. But now I want to pass the the entire directory of my
python files to the "-file option", instead of passing each file using the
-file option.
how can I do this.
Thanks for your help in advance.
Subbarayudu Amanchi.